Arya McCarthy's repositories
basic-color-terms
Paper to accompany "Modeling Color Terminology Across Thousands of Languages", accepted at EMNLP 2019
gapjunctions
Fast, accurate simulation of gap junctions in neural networks
gender-partitions
Code for McCarthy et al. (2020): Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions
political-dynamics
A study of American National Election Studies (ANES) over time.
acl-anthology
Data and software for building the ACL Anthology.
boilerplate
Boilerplate for Python Data projects
brown-cluster
C++ implementation of the Brown word clustering algorithm.
clsp-pubs
The code used for crawling CLSP faculty publication from Semantic Scholar
DTL
Repository for DirecTL+
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
jekyll-now
Build a Jekyll blog in minutes, without touching the command line.
littlebird
Basic utils for Tweet processing
meseq
Rewriting fairseq from the ground up
rules_python
Bazel Python Rules
sacreBLEU
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
tada2022
Text as Data 2022
translate
Translate - a PyTorch Language Library
worcomal
Word compounding across languages
yorick
Code for Defending Against Neural Fake News, https://rowanzellers.com/grover/