facebookresearch / flores

Facebook Low Resource (FLoRes) MT Benchmark

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

how to get the monolingual corpus

cocaer opened this issue · comments

The code seems written for downloading parallel corpus. If I want to do some work with UNMT, how I can get the monolingual Common Crawl corpus referred in the paper? Nepali and Sinhala corpus are not found on paracrawl.eu homepage.