Scheduling Job Queue On Hadoop

Durga Achuta Toram, B.P.N.Madhu Kumar

Abstract


Hadoop is a free, Java-based programming system that backings the preparing of vast informational collections in a Parallel and disseminated figuring condition. Enormous Data in many organizations are handled by Hadoop by presenting the employments to Master. Estimate based booking with maturing has been perceived as a compelling way to deal with certification powerful and close ideal framework reaction times. Hadoop Fair Sojourn Protocol (HFSP), “a scheduler acquainting this procedure with a genuine, multi-server, complex and generally utilized framework, for example, Hadoop”. In this paper, we introduce the plan of another booking convention that caters both to a reasonable and productive use of bunch assets, while endeavoring to accomplish short reaction times. Our answer actualizes a size-based, preemptive planning discipline. The scheduler apportions group assets with the end goal that employment measure data is surmised while the occupation gains ground toward its fruition. Planning choices utilize the idea of virtual time and bunch assets are centered around employments as per their need, processed through maturing. This guarantees neither little nor extensive employments experience the ill effects of starvation. The result of our work appears as an undeniable scheduler usage that coordinates consistently in Hadoop named HFSP. Measure based planning for HFSP receives offering need to little occupations that they won't be backed off by expansive ones. The “Shortest Remaining Processing Time (SRPT) strategy, which organizes occupations that need minimal measure of work to finish, is the one that limits the mean reaction time (or visit time), that is the time that goes between an occupation accommodation and its fruition”. We Extend HFSP to respite occupations with Higher SRPT and permit other holding up employments in Queue in view of FCFS.


Keywords


Hadoop, Hadoop Fair Sojourn Protocol, Shortest Remaining Processing Time.

References


Hadoop, the Apache Software Foundation, May 2012, 1.0.3.

Thusoo,Ashish,(2012), “Hive: a warehousing solution over map-reduce framework.” Proceedings of the VLDB Endowment 2.2 pp. 1626-1629,2012.

Ananthanarayanan G, Ghodsi A, Shenker S, Stoica I. Effective straggler mitigation: Attack of the clones. National Spatial Data Infrastructure. 2013; 13.

Zaharia, Chowdhury M, Das T, Dave A, Ma J, McCauley M, Franklin MJ, Shenker S, Stoica I. Resilient distributed datasets: a fault- tolerant abstraction for in-memory cluster computing. Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation; 2012. p. 2–2.

Zahariaet M. Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling. Proceedings of ACM EuroSys; 2010.

Dean J, Ghemawat S. Map reduce: simplified data processing on large clusters. Proceedings of USENIX OSDI; 2004.

SCHRAGE, L. E. A proof of the optimality of the shortest remaining processing time discipline. Operations Research 16 (1968), 678–690.

SCHRAGE, L. E., AND MILLER, L. W. The queue M/G/1 with the shortest remaining processing time discipline. Operations Research 14 (1966), 670–684.

PERERA, R. The variance of delay time in queueing system M/G/1 with optimal strategy SRPT. Archiv fur Elektronik und Uebertragungstechnik 47, 2 (1993), 110–114.

BUX, W. Analysis of a local-area bus system with controlled access. IEEE Transactions on Computers 32, 8 (1983), 760– 763.

Y. Chen, S. Alspaugh, and R. Katz, “Interactive query processing in big data systems: A cross-industry study of MapReduce workloads,” in Proc. of VLDB, 2012.

K. Ren et al., “Hadoop’s adolescence: An analysis of Hadoop usage in scientific workloads,” in Proc. of VLDB, 2013.

A. Verma, L. Cherkasova, and R. H. Campbell, “Aria: automatic resource inference and allocation for MapReduce environments,” in Proc. of ICAC, 2011.

D. Lu, H. Sheng, and P. Dinda, “Size-based scheduling policies with inaccurate scheduling information,” in Proc. of IEEE MASCOTS, 2004.

H. Chang et al. Scheduling in MapReduce-like Systems for Fast Completion Time. In Proc. of IEEE INFOCOM, pages 3074–3082, 2011.

B. Moseley, A. Dasgupta, R. Kumar, and T. Sarlos. On scheduling in map-reduce and flow-shops. In In Proc. of ACM SPAA, pages 289–298, 2011.

J. Tan, X. Meng, and L. Zhang. Delay tails in MapReduce scheduling. In Proc. of ACM SIGMETRICS, pages 5–16, 2012.


Full Text: PDF [Full Text]

Refbacks

  • There are currently no refbacks.


Copyright © 2013, All rights reserved.| ijseat.com

Creative Commons License
International Journal of Science Engineering and Advance Technology is licensed under a Creative Commons Attribution 3.0 Unported License.Based on a work at IJSEat , Permissions beyond the scope of this license may be available at http://creativecommons.org/licenses/by/3.0/deed.en_GB.