David Herel's starred repositories
collapse-lm-iclr
"Collapse of Self-trained Language Models" codebase for ICLR 2024
arctic_shift
Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.
TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
EvolvingModularRobots_Unity
Software for Evolving Modular Robots in Unity
awd-lstm-lm
LSTM and QRNN Language Model Toolkit for PyTorch
why-I-hate-wow-private-servers
Reasons why most of WoW private servers sucks
davidherel.github.io
Source code for my website: https://davidherel.com
semantics-preserving-encoder
Python library providing a simple, fully supervised sentence embedding technique for textual adversarial attacks.
From-Simple-Transformations-to-Highly-Efficient-Jobs
Spark Training
LaMDA-rlhf-pytorch
Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.
deep_learning_curriculum
Language model alignment-focused deep learning curriculum
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.