kimeshan / xhosa-nlp

πŸ”€ Extracting most frequent words in the Xhosa language using NLP

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Xhosa NLP πŸ”€

GitHub license PRs Welcome

Quickstart

  1. Install NLTK
  2. Run python most_frequent_words.py
  3. Open results.csv to view results

Source of Corpus

Leipzig Corpora Collection

CITE: D. Goldhahn, T. Eckart & U. Quasthoff: Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages. In: Proceedings of the 8th International Language Ressources and Evaluation (LREC'12), 2012

About

πŸ”€ Extracting most frequent words in the Xhosa language using NLP


Languages

Language:Python 100.0%