The Open Cybernetics & Systemics Journal
2014, 8 : 938-943Published online 2014 December 31. DOI: 10.2174/1874110X01408010938
Publisher ID: TOCSJ-8-938
Aspect Clustering Combined N-gram for Reviews
School of Computer Science
and Technology, Beijing University of Posts and Telecommunications,
Haidian District, Beijing, 100876, China.
ABSTRACT
With the increase in popularity of e-commerce, more and more customer reviews are available online, it’s usually hard to go through each of them. Latent Dirichlet Allocation (LDA) was used to mine product aspect. Considering the weakness of standard LDA when processing review text, we defined the product aspect model, and predefined aspects for different domains, and proposed aspect model combined N-gram based on sentence, which can automate aspect clustering. Our experimental results show that the proposed model can cluster mostly aspects and recognize representative words for the aspect with more than one word, and achieve better sentence-level aspect precision than previously proposed aspect models.