Jeremy Gwinnup's repositories
wmt19-tmx-extract
Extraction script for Paracrawl tmx files for use in WMT19
CoMMuTE-Arabic
MSA Arabic translations of v1 of the commute dataset
compare_attention
Compare soft attention outputs from decodes
differential
Differential multiple word alignment comparison tool
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
langid.py
Stand-alone language identification system
lrp_contrast
Streamlit LRP Contrast viewer
moses-rest
Moses with a Rest interface
moses2-test
Test scripts for moses2 debugging
mosesdecoder
Moses, the machine translation system
mtdata
A tool that locates, downloads, and extracts machine translation corpora
mtma17-lab
Docker File to build container used for MTMA17 labs
mtma22-marian-logs
Example Marian training and validation logs
multi-bleu-smooth
Katherine Young's smoothed-bleu modifications to multi-bleu.pl
nlposs.github.io
Workshop for Natural Language Processing Open Source Software (NLP-OSS)
papillote-mtma18
Development data for start/stop curriculum training wrapper. Check this module out to grab the other pieces as submodules
projtest
Project Test
tokenZone
Zone on punctuation in Moses-Tokenized data
viewdiffaug
Compare stock vs generated images