syncdoth

followers

following

stars

South Korea | Hong Kong

syncdoth.github.io

Sehyun Choi's starred repositories

grok-1

Grok open release

Language:PythonApache-2.049231 561 204

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT22476 219 126

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookMIT18393 116 509

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT13998 108 315

mamba

Mamba SSM architecture

Language:PythonApache-2.012025 98 456

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonApache-2.09248 87 714

outlines

Structured Text Generation

Language:PythonApache-2.07482 45 522

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.06856 50 597

mesop

Build delightful web apps quickly in Python

Language:PythonApache-2.04940 33 334

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04239 45 268

NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Language:PythonNOASSERTION3851 36 309

lmql

A language for constraint-guided and efficient LLM programming.

Language:PythonApache-2.03541 22 248

hivemind

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

Language:PythonMIT1958 56 160

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01830 43 106

Awesome-LLM-KG

Awesome papers about unifying LLMs and KGs

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonMIT1490 38 36

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonMIT1299 16 227

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.01017 41 68

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonMIT917 15 40

safari

Convolutions for Sequence Modeling

Language:AssemblyApache-2.0857 35 38

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT792 21 31

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonMIT507 24 69

honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Language:PythonMIT410 9 33

DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Language:Python384 3 15

predictive-forward-forward

Implementation/simulation of the predictive forward-forward credit assignment algorithm for training neurobiologically-plausible recurrent neural network models.

Language:PythonMIT54 4 1

pyhgf

PyHGF: A neural network library for predictive coding

Language:PythonGPL-3.040 2 59

miner-release

Stable Diffusion and LLM miner for Heurist

Language:PythonNOASSERTION37 7 5

Knowledge-Constrained-Decoding

Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection"

Language:Python26 1 4

pretraining

Pretraining

Language:PythonMIT15 10

FilmstripMaker

c++ Program for adding images together into a film strip. Useful in creating GUIs

Language:C++MIT500