The Open Cybernetics & Systemics Journal

2015, 9 : 1170-1176
Published online 2015 September 10. DOI: 10.2174/1874110X01509011170
Publisher ID: TOCSJ-9-1170

On-Line Labeled Topic Model

YongHeng Chen , Yaojin Lin and Hao Yue
China College of Computer Science, Minnan Normal University, Zhangzhou, Fujian 363000, China.

ABSTRACT

A large number of electronic documents are labeled using human-interpretable annotations. High-efficiency text mining on such data set requires generative model that can flexibly comprehend the significance of observed labels while simultaneously uncovering topics within unlabeled documents. This paper presents a novel and generalized on-line labeled topic model (OLT) tracking the time development of extracted topics through a structured multi-labeled data set. Our topic model has an incrementally updated principle based on time slices in an on-line fashion, and can detect dynamic trending for labeled topics in parallel. Empirical results are presented to demonstrate lower perplexity and high performance of our proposed model when compared with other models.

Keywords:

Bayesian models, Gibbs sampling, topic modeling, variational expectation-maximization.