sumeet's starred repositories
C4_200M-synthetic-dataset-for-grammatical-error-correction
This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the dataset are described in more detail by Stahlberg and Kumar (2021) (https://www.aclweb.org/anthology/2021.bea-1.4/)
transformer-pytorch
Transformer implementation in PyTorch.
reconstructing_faces_from_voices
[NeurIPS 2019] Face Reconstruction from Voice using Generative Adversarial Networks