how to get the monolingual corpus
cocaer opened this issue · comments
The code seems written for downloading parallel corpus. If I want to do some work with UNMT, how I can get the monolingual Common Crawl corpus referred in the paper? Nepali and Sinhala corpus are not found on paracrawl.eu homepage.