Professional Documents
Culture Documents
Available at http://www.ijcsonline.com/
Abstract
Mining is a process of knowledge extraction with some meaningful information. Topic Mining provides a convenient way
to analyze large of unclassified text. A topic contains a cluster of words that frequently occur together. A topic modeling
can connect words with similar meanings and distinguish between uses of words with multiple meanings. This paper
provides two categories that can be under the field of topic modeling. First one discusses the area of methods of topic
modeling, which has four methods that can be considerable under this category. These methods are Latent semantic
analysis (LSA), Probabilistic latent semantic analysis (PLSA), Latent Dirichlet allocation (LDA), and Correlated topic
model (CTM). All the above models can consider time as one of the important factor for constructing topics. So in the
second category, different models are discussed such as topic over time (TOT), dynamic topic models (DTM), multiscale
topic tomography, dynamic topic correlation detection all these models are used in various applications.
Keywords: Topic Modeling, Methods of Topic Modeling, Latent semantic analysis (LSA), Probabilistic latent semantic
analysis (PLSA), Latent Dirichlet allocation (LDA), Correlated topic model (CTM), Topic Evolution Modeling.
I.
INTRODUCTION
407 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 05, May, 2016
M Trupthi et al
408 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 05, May, 2016
M Trupthi et al
409 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 05, May, 2016
M Trupthi et al
Characteristics
Latent
Semantic
Analysis (LSA)
Probabilistic Latent
Semantic
Analysis
(PLSA)
Latent
Dirichelet
Allocation (LDA)
Limitations
- It is hard to obtain and to determine the
number of topics.
- To interpret loading values with probability
meaning, it is hard to operate it.
Probabilistic Latent
Semantic Analysis
(PLSA)
Latent
Dirichelet
Allocation (LDA)
Correlated
Topic
Model (CTM)
410 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 05, May, 2016
M Trupthi et al
411 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 05, May, 2016
M Trupthi et al
REFERENCES
[1]
Models
[2]
[3]
2)Multiscale topic tomography
[4]
[5]
[6]
[7]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
412 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 05, May, 2016
M Trupthi et al
413 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 05, May, 2016