kathir-ks

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.0000

llama-agentic-system

Agentic components of the Llama Stack APIs

NOASSERTION000

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION000

LLM

Language:Jupyter Notebook000

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonApache-2.0000

minbpe

Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT000

Mistral-Multiply

Language:Jupyter Notebook010

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.0000

paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

Language:PythonApache-2.0000

ResponseService

010

SearchEngine

Language:Python010

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Language:PythonNOASSERTION000

t5x

Language:PythonApache-2.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

kathir-ks

kathir's repositories

llm_os

Anima

fineweb-translation

indictrans2-tokenization

indictrans2-translation

levanter

scalax

setu-translate

autogen

corenet

deep-learning-with-python-notebooks

executorch

gpt-neox

IndicTrans2

IndicTransTokenizer

kathir-ks

lit-gpt

llama-agentic-system

llama3

LLM

maxtext

minbpe

Mistral-Multiply

OLMo

paxml

ResponseService

SearchEngine

system-design-primer

t5x

transformers