The Open Automation and Control Systems Journal

2014, 6 : 1463-1467
Published online 2014 December 31. DOI: 10.2174/1874444301406011463
Publisher ID: TOAUTOCJ-6-1463

Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform

Zhao Xiaoyong and Yang Chunrong
Mathematics and Computer Science Institute, XinYu University, JiangXi, 338004, China.

ABSTRACT

This paper establishes the massive data processing mathematical model and algorithm of cloud computing, and the Hadoop distributed computing method is introduced to the database management system, to realize the automatic partition database data and master-slave node set. The master-slave nodes distributed algorithm is complied by using the MATLAB software realizing the data distributed computing function, and through the numerical simulation, we can compute the data processing speed, transmission rate, capacity and other system parameters. Compared with the Hadoop distributed processing algorithms and two kinds of traditional data processing algorithm, we can find that the data processing speed of Hadoop distributed computing algorithm is faster than the general algorithm, the amount of information storage, information transmission speed, which can satisfy the need of high data processing.

Keywords:

Hadoop, Massive data , Cloud computing, Large capacity, Distributed computing, Master slave node.