LogLikelihood

Question

LogLikelihood

gabrer opened this issue 7 years ago · comments

Hi askerlee!

I would ask you if the logLikelihood computed by "calcLoglikelihood" function is normalised by the number of words in the corpus?
If not, it could be easily done?

Thank you!

askerlee · Answer 1 · Wed Jun 07 2017 12:01:46 GMT+0800 (China Standard Time)

It's not normalized by the document length (i.e. the number of words in a document). You could divide it by the document length to get the average (log) perplexity.

Gabriele Pergola · Answer 2 · Thu Jun 08 2017 20:52:42 GMT+0800 (China Standard Time)

So, I need only to divide it by the document length, great!

Thank you for your prompt reply!