jhlau / topic-coherence-sensitivity

Code to compute topic coherence for several topic cardinalities and aggregate scores across them

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

score over the whole topics

un-lock-me opened this issue · comments

commented

Thanks for sharing the code. I tried this code, however I got one question left. why you reported the coherence score per topic?
Usually in the papers they report the score over all the topics. for example if we run a topic modeling on the 20 newsgroup data set, and we set number of topic 20, then we report only one coherence score related to all the topics.

can you please share your idea how can I achieve that score. Is only averaging all the coherence per topics makes sense?

Thanks

commented

It depends on what you want to do. If you want to compare topic models, you'd aggregate the score (e.g. by taking mean). But if you want to filter out bad topics, you'd want to look at the individual coherence score for each topic.