The Open Automation and Control Systems Journal
2014, 6 : 1463-1467Published online 2014 December 31. DOI: 10.2174/1874444301406011463
Publisher ID: TOAUTOCJ-6-1463
Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform
ABSTRACT
This paper establishes the massive data processing mathematical model and algorithm of cloud computing, and the Hadoop distributed computing method is introduced to the database management system, to realize the automatic partition database data and master-slave node set. The master-slave nodes distributed algorithm is complied by using the MATLAB software realizing the data distributed computing function, and through the numerical simulation, we can compute the data processing speed, transmission rate, capacity and other system parameters. Compared with the Hadoop distributed processing algorithms and two kinds of traditional data processing algorithm, we can find that the data processing speed of Hadoop distributed computing algorithm is faster than the general algorithm, the amount of information storage, information transmission speed, which can satisfy the need of high data processing.