The Open Cybernetics & Systemics Journal

2014, 8 : 1266-1270
Published online 2014 December 31. DOI: 10.2174/1874110X01408011266
Publisher ID: TOCSJ-8-1266

An Improved Lda Model in Micro-Blog Tags Extracting Based on Multi- Tags

Jianfang Wang , Kunxiao Shen , Anfeng Xu and Yihua Lan
School of Computer and Information Technology, Nanyang Normal University, Nanyang, 473000, P.R. China.

ABSTRACT

This article mainly discusses how to extract the interested information from massive amounts of micro-blogs and recommend right information to user, which is a hot research area in recommendation systems and social networks, too. To solve this problem, a model called Multi-tags Latent Dirichlet Allocation is proposed. Using this model, topics paid attention by users can be mined effectively and the defect of low degree of differentiation for the short blog content is settled. Experiments showed that the tags of user’s micro-blog can be figured out with this model which makes users manage their resources at their convenience and others find their needed resources through tags. The results, experimented on real micro-blog data set, indicate that this model works better than traditional model on extracting tags. Standard measuring index Perplexity is applied to this model to estimate the likelihood of new text. If the number of topics is selected appropriately, the accuracy will be raised to almost 10%.

Keywords:

LDA model, micro-blog tags, MTLDA, resource sharing.