The Open Cybernetics & Systemics Journal

2014, 8 : 938-943
Published online 2014 December 31. DOI: 10.2174/1874110X01408010938
Publisher ID: TOCSJ-8-938

Aspect Clustering Combined N-gram for Reviews

Shibo Zhang and Xiaojie Wang
School of Computer Science and Technology, Beijing University of Posts and Telecommunications, Haidian District, Beijing, 100876, China.

ABSTRACT

With the increase in popularity of e-commerce, more and more customer reviews are available online, it’s usually hard to go through each of them. Latent Dirichlet Allocation (LDA) was used to mine product aspect. Considering the weakness of standard LDA when processing review text, we defined the product aspect model, and predefined aspects for different domains, and proposed aspect model combined N-gram based on sentence, which can automate aspect clustering. Our experimental results show that the proposed model can cluster mostly aspects and recognize representative words for the aspect with more than one word, and achieve better sentence-level aspect precision than previously proposed aspect models.

Keywords:

Aspect model, N-gram, reviews.