Vincent Hellendoorn's repositories
ICLR20-Great
Data and Code for Reproducing "Global Relational Models of Source Code"
neural-nlp
[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing
plur
PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. We provide scripts for downloading, processing, and loading the datasets. This is done by offering a unified API and data structures for all datasets.
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
flash-attention-jax
Implementation of Flash Attention in Jax