koaning / manyterms

Many terms for whatever purposes (weak labelling)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

manyterms

The goal of this project is to collect lists of terms that might be used for:

  • weak labelling
  • text classification
  • entity detection
  • term training for annotation
  • fun

High Quality?

The goal of these wordlists is to be low-effort, but we cannot guarantee high quality. Maintaining high quality wordlists is hard work and outside of the scope of this project. If there is a serious issue with a word-list feel free to make a PR though.

Contributing

You're free to add a list yourself, but we require that you always add a source with a permissive license.

About

Many terms for whatever purposes (weak labelling)