Beast code in Giters

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.

Language:Jupyter NotebookMIT010

Creative-Commons-Markdown

Markdown-formatted Creative Commons licenses

010

disaster_tweets

010

Discourse-Phenomena-in-Document-level-Neural-Machine-Translation

Datasets for "A Test Suite for Evaluating Discourse Phenomena in Document-level Neural Machine Translation" accepted by Proceedings of the Second International Workshop of Discourse Processing

010

DMRST_Parser

One implementation of the paper "DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing".

Language:Python000

dockerfiles

Language:Dockerfile02 1

good-translation-wrong-in-context

This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 paper "Context-Aware Monolingual Repair for Neural Machine Translation"

Language:Ruby010

google-research

Google Research

Language:Jupyter NotebookApache-2.0010

kmeans_pytorch

kmeans using PyTorch

Language:Jupyter NotebookMIT010

korean_wordlist

korean wordlist

Language:Python010

language-programmes

Language:Jupyter Notebook000

Large-contrastive-pronoun-testset-EN-FR

Language:PLSQLMIT010

mtdlc

Library for parsing document-level corpora for machine translation

Apache-2.0020

Pytorch-Sequence-Bucket-Iterator

A minimal sampler example for bucketing sequences of similar lengths in Pytorch based off of @TrentBrick script https://gist.github.com/TrentBrick/bac21af244e7c772dc8651ab9c58328c.

Language:PythonApache-2.0010