Sehyun Choi (syncdoth)

syncdoth

Geek Repo

Location:South Korea | Hong Kong

Home Page:syncdoth.github.io

Twitter:@schoiaj

Github PK Tool:Github PK Tool

Sehyun Choi's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49231Issues:561Issues:204

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22476Issues:219Issues:126

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18393Issues:116Issues:509

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13998Issues:108Issues:315

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12025Issues:98Issues:456

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:9248Issues:87Issues:714

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:7482Issues:45Issues:522

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:6856Issues:50Issues:597

mesop

Build delightful web apps quickly in Python

Language:PythonLicense:Apache-2.0Stargazers:4940Issues:33Issues:334

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4239Issues:45Issues:268

NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Language:PythonLicense:NOASSERTIONStargazers:3851Issues:36Issues:309

lmql

A language for constraint-guided and efficient LLM programming.

Language:PythonLicense:Apache-2.0Stargazers:3541Issues:22Issues:248

hivemind

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

Language:PythonLicense:MITStargazers:1958Issues:56Issues:160

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1830Issues:43Issues:106

Awesome-LLM-KG

Awesome papers about unifying LLMs and KGs

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonLicense:MITStargazers:1490Issues:38Issues:36

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonLicense:MITStargazers:1299Issues:16Issues:227

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:1017Issues:41Issues:68

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonLicense:MITStargazers:917Issues:15Issues:40

safari

Convolutions for Sequence Modeling

Language:AssemblyLicense:Apache-2.0Stargazers:857Issues:35Issues:38

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:792Issues:21Issues:31

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonLicense:MITStargazers:507Issues:24Issues:69

honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Language:PythonLicense:MITStargazers:410Issues:9Issues:33

DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

predictive-forward-forward

Implementation/simulation of the predictive forward-forward credit assignment algorithm for training neurobiologically-plausible recurrent neural network models.

Language:PythonLicense:MITStargazers:54Issues:4Issues:1

pyhgf

PyHGF: A neural network library for predictive coding

Language:PythonLicense:GPL-3.0Stargazers:40Issues:2Issues:59

miner-release

Stable Diffusion and LLM miner for Heurist

Language:PythonLicense:NOASSERTIONStargazers:37Issues:7Issues:5

Knowledge-Constrained-Decoding

Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection"

pretraining

Pretraining

Language:PythonLicense:MITStargazers:15Issues:1Issues:0

FilmstripMaker

c++ Program for adding images together into a film strip. Useful in creating GUIs

Language:C++License:MITStargazers:5Issues:0Issues:0