DavidHerel

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.0589900

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0736300

TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

Language:PythonMIT147200

EvolvingModularRobots_Unity

Software for Evolving Modular Robots in Unity

Language:PythonGPL-3.01100

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3493000

llama

Inference code for Llama models

Language:PythonNOASSERTION5429100

cc_net

Tools to download and cleanup Common Crawl data

Language:PythonMIT93800

awd-lstm-lm

LSTM and QRNN Language Model Toolkit for PyTorch

Language:PythonBSD-3-Clause195900

why-I-hate-wow-private-servers

Reasons why most of WoW private servers sucks

AGPL-3.09300

davidherel.github.io

Source code for my website: https://davidherel.com

Language:HTML200

semantics-preserving-encoder

Python library providing a simple, fully supervised sentence embedding technique for textual adversarial attacks.

Language:PythonMIT1200

TabPFN

Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.

Language:PythonApache-2.0114400

From-Simple-Transformations-to-Highly-Efficient-Jobs

Spark Training

Language:Jupyter Notebook500

LaMDA-rlhf-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

Language:PythonMIT45800

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

116800

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.01201500

DavidHerel

David Herel's starred repositories

collapse-lm-iclr

fundus

minbpe

API

GNews

litgpt

llama.cpp

llama2.c

gigaGPT

preprocessing_reddit

arctic_shift

nanoRWKV

sota_lm

lit-llama