MaartenGr / KeyBERT

Minimal keyword extraction with BERT

Home Page:https://MaartenGr.github.io/KeyBERT/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to use leaderboards?

sdspieg opened this issue · comments

Can you recommend a way to find the most appropriate, pretrained recent language model(s) that focuses on semantic similarity AND should work with KeyBERT? E.g. these seem to be appropriate models that should also work for Dutch. But how can we tell which ones would work? Also, we'd like to run KeyBERT with different models, in order to be able to see the differences - you don't happen to have any Jupyter notebooks that would show us how to do this? Thanks!

Personally, I would advise looking at the MTEB Leaderboard. These models are optimized for sentence similarity tasks and especially the top models ("bge-" and "gte-") work incredibly well and can be used within KeyBERT.

Great! Thanks much Maarten...