The Open Cybernetics & Systemics Journal
2015, 9 : 253-261Published online 2015 April 17. DOI: 10.2174/1874110X01509010253
Publisher ID: TOCSJ-9-253
A New MapReduce Framework Based on Virtual IP Mechanism and Load Balancing Strategy
ABSTRACT
MapReduce is an important method for large-scale data processing on parallel architecture. In Hadoop ecosystem, MapReduce runs on the application-level, thus it provides system with flexibility. MapReduce is good at offline batch processing and it could accelerate the whole execution time. The deficiency of the MapReduce architecture is a lack in balancing and scalability, thus leads to low efficiency when dealing with large-scale data. In this paper, we propose a new MapReduce framework that is more suitable for Hadoop ecosystem. The framework is based on the virtual IP mechanism and load balancing strategy. Comparative experiments indicate that the new framework achieve twice the performance compared to the original MapReduce. Besides, the framework fully meets the environment of Hadoop ecosystem, and provides a stable and efficient data processing.