Yeojoon

followers

following

stars

Yeojoon's starred repositories

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION2782000

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3641300

simple-local-rag

Build a RAG (Retrieval Augmented Generation) pipeline from scratch and have it all run locally.

Language:Jupyter Notebook45700

rag-from-scratch

Language:Jupyter Notebook229900

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookMIT1009900

logix

AI Logging for Interpretability and Explainability🔬

Language:PythonApache-2.07700

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.03489700

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookMIT887400

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonMIT610400

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause403300

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT1040500

LoftQ

Language:PythonMIT19300

LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Language:PythonApache-2.0104900

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT654700

lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Language:PythonMIT68500

llmtools

Finetuning Large Language Models on One Consumer GPU in Under 4 Bits

Language:Python69700

hivemind

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

Language:PythonMIT200100

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.02753300

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1356700

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0828900

SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Language:PythonMIT63100

CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonApache-2.040700

low-bit-optimizers

Low-bit optimizers for PyTorch

Language:PythonApache-2.011100

CS7545

Language:TeX1800

FedML

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

Language:PythonApache-2.0415700

Awesome-Federated-Learning

FedML - The Research and Production Integrated Federated Learning Library: https://fedml.ai

betty

Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization

Language:PythonApache-2.032900

FedTorch

FedTorch is a generic repository for benchmarking different federated and distributed learning algorithms using PyTorch Distributed API.

Language:PythonGPL-2.018500

FedAc-NeurIPS20

Code for "Federated Accelerated Stochastic Gradient Descent" (NeurIPS 2020)

Language:Python1400

gitignore

A collection of useful .gitignore templates

CC0-1.016140400