Marta Bañón's repositories
binonymizer
Anonymizer module for Bicleaner's pipeline (WIP)
benchmarks
Several benchmarks on sentence splitting and language identification
fastspell-dictionaries
Dictionaries for FastSpell
dictionaries
Hunspell dictionaries in UTF-8
Language:JavaScriptMIT000
flask-api-demo
Flask RESTful项目示例,包含JWT认证、rq异步任务、Swagger文档、Redoc文档、Docker部署、uwsgi、supervisor……
Language:Python000
loomchild-segment-py
Python module to interface with Java Loomchild sentence segmenter
Language:PythonGPL-3.0000
OpusCleaner
OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.
Language:Python000
stopwords
Stopwords removal:
Language:PythonISC000