jnanliu / TopicModel

Implementation of a variety of topic models, include plsa, lda and so on

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TopicModel

Simple implementation of some topic models, such as plsa, lda and so on

TODO

  • plsa
  • lda
  • ...

Dependencies

All code only runs on Windows. I'm not sure if it can run on other platforms

Data

I crawled some articles from bilibili.

stopwords list is from stopwords

Train

You can run as follow

python main.py \
data_dir=data/bilibiliarticle \
number_of_topics=100 
max_iters=100

Note

  • The code are not well tested, so it may contain bugs.

Reference

About

Implementation of a variety of topic models, include plsa, lda and so on