LogLikelihood
gabrer opened this issue · comments
Gabriele Pergola commented
Hi askerlee!
I would ask you if the logLikelihood computed by "calcLoglikelihood" function is normalised by the number of words in the corpus?
If not, it could be easily done?
Thank you!
askerlee commented
It's not normalized by the document length (i.e. the number of words in a document). You could divide it by the document length to get the average (log) perplexity.
Gabriele Pergola commented
So, I need only to divide it by the document length, great!
Thank you for your prompt reply!