hplt-project / OpusCleaner

OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.

Home Page:https://pypi.org/project/opuscleaner/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Download all of them!

kpu opened this issue · comments

There should be a function to download all corpora for a particular language pair. Otherwise what am I supposed to do, guess from the name and size which ones are good then click madly?