laserwave / lda_gibbs_sampling

a python implementation of latent dirichlet allocation(lda) using gibbs sampling algorithm

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LDA (Latent Dirichlet Allocation)

This is a python implementation of LDA using gibbs sampling algorithm.

The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 16 sentences about one piece on wikipedia.

res1

The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 5000 chinese sina social news.

res2

The following picture shows the top 10 words in the 30 topics (set K = 30) generated by this algorithm over 5000 chinese sina social news.

res3

Author

About

a python implementation of latent dirichlet allocation(lda) using gibbs sampling algorithm


Languages

Language:Python 100.0%