MapReduce is a programming model and its associated run-time system proposed in 2004, which can process large scale of data in clusters with simple program logic. MapReduce has a potential problem running on a cooperative cluster which is combined with machines having different configurations. The problem will cause unexpected performance degradations and should be avoided. In this paper, a task scheduling policy is proposed to take higher utilization of all computing nodes in heterogeneous clusters.
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
Tel.: +1 703 830 6300
Fax: +1 703 830 2300 firstname.lastname@example.org
(Corporate matters and books only) IOS Press c/o Accucoms US, Inc.
For North America Sales and Customer Service
West Point Commons
Lansdale PA 19446
Tel.: +1 866 855 8967
Fax: +1 215 660 5042 email@example.com