varepsilon

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Sequence Transduction": www.aclweb.org/anthology/D19-1435.pdf (EMNLP-IJCNLP 2019)

Language:Macaulay2MIT226 9 24

yolopandas

Language:PythonMIT193 6 5

C4_200M-synthetic-dataset-for-grammatical-error-correction

This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the dataset are described in more detail by Stahlberg and Kumar (2021) (https://www.aclweb.org/anthology/2021.bea-1.4/)

Language:PythonCC-BY-4.0152 11 7

m2scorer

MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.

Language:PythonGPL-2.0146 4 7

e2e-metrics

E2E NLG Challenge Evaluation metrics

Language:PythonNOASSERTION90 5 3

UNION

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Language:Python57 8 7

yandex-tank

Technical fork. All issues, requests etc. should be done in yandex/yandex-tank

Language:PythonLGPL-2.148 40

X-MAML

Code base for " Zero-Shot Cross-Lingual Transfer with Meta Learning" papaer

Language:Python33 7 3

discofuse

32 9 3

FairRecSys

[Official Codes] Experiments on Generalizability of User-Oriented Fairness in Recommender Systems (SIGIR 2022)

Language:Jupyter Notebook32 40

user-satisfaction-simulation

"Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems" in SIGIR'21

Language:Python32 2 6

CWEB

Language:Macaulay216 20

ZEST

Language:PythonMIT9 3 4

clse

The Corpus of Linguistically Significant Entities (CLSE) is a dataset of named entities annotated by linguist experts. It includes 34 languages and covers 74 different semantic types to support various applications from airline ticketing to video games. The aim of the corpus is to facilitate the creation of more linguistically diverse NLG datasets.

Language:Python7 40

varepsilon

Aleksandr Chuklin's starred repositories

stable-diffusion

langchain

Prompt-Engineering-Guide

powerlevel10k

iodine

YaLM-100B

SDV

portfolio

pyserini

bleurt

aclpubcheck

convai

ua-gec

PIE