abhijeet3922 / Topic-Modelling-on-Wiki-corpus

It uses Latent Dirichlet Allocation algorithm to discover hidden topics from the articles. It is trained on 60,000 articles taken from simple wikipedia english corpus. Finally, It can extract the topic of the given input text article.

Home Page:https://appliedmachinelearning.wordpress.com/2017/08/28/topic-modelling-part-1-creating-article-corpus-from-simple-wikipedia-dump/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

abhijeet3922/Topic-Modelling-on-Wiki-corpus Stargazers