yash8589 / CL1-Project-1-WordCloud

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

For English

$cd english
$bash English.sh


The bash script will run all the codes present here in a methodical way and end after the wordCloud has been displayed.

For Hindi

$cd Hindi
$bash Hindi.sh


The bash script will run all the codes present here in a methodical way and end after the wordCloud has been displayed.
If on windows, open the bash file in any editor and check out the python commands for the order of execution


NOTE

  • POS and Lemmatization of Hindi is done using Stanza which can be slow in case of a large dataset.
  • POS and Lemmatization for Hindi runs in POSandLemma.py , freq_POS.py and freq_lemma.py

About


Languages

Language:Python 92.3%Language:Shell 7.7%