MaartenGr / KeyBERT

Minimal keyword extraction with BERT

Home Page:https://MaartenGr.github.io/KeyBERT/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Document Clustering between KeyBERT and Sentence Transformer?

km5ar opened this issue · comments

commented

I'm wondering if anyone compared the differences using KeyBERT vs Sentence Transformers for document clustering?

KeyBERT itself is already using SentenceTransformers for extracting the document and word embeddings. It might be interesting to compare how well the clustering would be on the keyword embeddings compared to the document embedding but unfortunately I have not tried it out yet.

commented

@MaartenGr Yeah! I read your official doc a few month ago, I remember there were a section which you suggesting about first use KeyBERT then clustering, but recently I tried to find that section again, but not able to locate it anymore.

I actually do not remember using writing in the documentation as such a use case with respect to KeyBERT. It may have been PolyFuzz but KeyBERT is not generally used for clustering unless word embeddings are clustered again.