The Open Automation and Control Systems Journal

2015, 7 : 1144-1152
Published online 2015 September 14. DOI: 10.2174/1874444301507011144
Publisher ID: TOAUTOCJ-7-1144

The Cooperative Study Between the Hadoop Big Data Platform and the Traditional Data Warehouse

Ping Hu
Tongren University Information Engineering College, Tongren, Guizhou, 554300, China.

ABSTRACT

In this paper, based on the application conditions of the existing traditional data warehouse and the future forecast of the Hadoop big data platform, this paper proposes the new framework of the cooperation of Hadoop and traditional data warehouse which focus on the cooperation between the traditional data warehouse and the Hadoop technique to solve the problem that the traditional data warehouse can hardly meet customers' demands. The new framework originated from the thoughts of the designers of Cloudera and Teradata, and in this paper, the new architecture is divided into three modules: data acquisition, data storage and data applications, this paper mainly discusses the consideration of structured and unstructured data collection, storage and application problem, and researches the Hadoop and traditional data warehouse in collaboration of data storage and data application. According to data collection and transmission problem, this paper uses the Apache Sqoop technology as the solution; and relies on Hadoop HDFS file system and the Hive data warehouse to store the data. At the same time, this paper also introduces the data application in the Hive. Finally, the prototype system proves the feasibility of the designed structure.

Keywords:

Big data, hadoop, data warehouse, traditional data warehouse.