leej35 / PLSA

PLSA implementation via EM algorithm

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This is a PLSA (Probabilistic Latent Semantic Analysis) implementation via the EM (Expectation-Maximization) algorithm.

Current issues:

  1. The code are not well tested, so it may contain bugs. The test text are in the folder ./texts and ./test.
  2. The code seems not working well with small datasets, such as ./test

Reference:

EM introduction: http://blog.tomtung.com/2011/10/em-algorithm

PLSA introduction: http://blog.tomtung.com/2011/10/plsa

My lda-with-gibbs repo

Note:

A Tutorial on Probabilistic Latent Semantic Analysis by Liangjie Hong is not a very good PLSA introduction material. There are some known bugs.

About

PLSA implementation via EM algorithm


Languages

Language:Python 100.0%