The Open Automation and Control Systems Journal

2015, 7 : 1660-1666
Published online 2015 October 9. DOI: 10.2174/1874444301507011660
Publisher ID: TOAUTOCJ-7-1660

Combining Semantic Comprehension and Machine Learning for Chinese Sentiment Classification

Jianfeng Xu , Yuan Xu , Yuanjian Zhang and Yu Li
Software College of Nanchang University, Nanchang 330047, China.

ABSTRACT

Semantic comprehension-based and machine learning based are two major methods for the classification of Chinese sentiment. The advantage of semantic comprehension-based method is that it can classify text among domains and achieve satisfied portability. However, the accuracy of classification is limited. Although the accuracy derived from supervised machine learning method is much better, the portability is rather poor due to randomly selection of samples and subjective labeling of semantic orientation. In this paper, a hybrid framework combining the advantages of the two methods was proposed. The text features were extracted preliminary based on semantic comprehension and were optimized by a novel information gain method. The features expressed in vector space model were integrated with traditional machine learning algorithm. Experiments show that support vector machine has the best discriminative power compared to other machine learning algorithms. Additionally, this framework improves portability and accuracy as compared to both semantic comprehension-based methods and machine learning based methods.

Keywords:

Chinese information processing, sentiment analysis, semantic comprehension, machine learning.