askerlee / topicvec

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LogLikelihood

gabrer opened this issue · comments

Hi askerlee!

I would ask you if the logLikelihood computed by "calcLoglikelihood" function is normalised by the number of words in the corpus?
If not, it could be easily done?

Thank you!

It's not normalized by the document length (i.e. the number of words in a document). You could divide it by the document length to get the average (log) perplexity.

So, I need only to divide it by the document length, great!

Thank you for your prompt reply!