hplt-project / OpusCleaner

OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.

Home Page:https://pypi.org/project/opuscleaner/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

num_mismatch filter does not recognize equivalent numbers

jindrahelcl opened this issue · comments

06 and 6 is considered a mismatch