Simple implementation of some topic models, such as plsa, lda and so on
- plsa
- lda
- ...
All code only runs on Windows. I'm not sure if it can run on other platforms
I crawled some articles from bilibili.
stopwords list is from stopwords
You can run as follow
python main.py \
data_dir=data/bilibiliarticle \
number_of_topics=100
max_iters=100
- The code are not well tested, so it may contain bugs.