Scalable Scheduling for Sub-Second Parallel Jobs
Patrick Wendell
EECS Department, University of California, Berkeley
Technical Report No. UCB/EECS-2013-79
May 16, 2013
http://www2.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-79.pdf
Large-scale data analytics frameworks are shifting towards shorter task durations and larger degrees of parallelism to provide low latency. However, scheduling highly parallel jobs that com- plete in hundreds of milliseconds poses a major challenge for cluster schedulers, which will need to place millions of tasks per second on appropriate nodes while offering millisecond-level la- tency and high availability. We demonstrate that a decentralized, randomized sampling approach provides near-optimal performance while avoiding the throughput and availability limitations of a centralized design. We implement and deploy our scheduler, Sparrow, on a real cluster and demon- strate that Sparrow performs within 14% of an ideal scheduler.
Advisors: Ion Stoica
BibTeX citation:
@mastersthesis{Wendell:EECS-2013-79,
Author= {Wendell, Patrick},
Title= {Scalable Scheduling for Sub-Second Parallel Jobs},
School= {EECS Department, University of California, Berkeley},
Year= {2013},
Month= {May},
Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-79.html},
Number= {UCB/EECS-2013-79},
Abstract= {Large-scale data analytics frameworks are shifting towards shorter task durations and larger
degrees of parallelism to provide low latency. However, scheduling highly parallel jobs that com-
plete in hundreds of milliseconds poses a major challenge for cluster schedulers, which will need
to place millions of tasks per second on appropriate nodes while offering millisecond-level la-
tency and high availability. We demonstrate that a decentralized, randomized sampling approach
provides near-optimal performance while avoiding the throughput and availability limitations of a
centralized design. We implement and deploy our scheduler, Sparrow, on a real cluster and demon-
strate that Sparrow performs within 14% of an ideal scheduler.},
}
EndNote citation:
%0 Thesis %A Wendell, Patrick %T Scalable Scheduling for Sub-Second Parallel Jobs %I EECS Department, University of California, Berkeley %D 2013 %8 May 16 %@ UCB/EECS-2013-79 %U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-79.html %F Wendell:EECS-2013-79