kathir's repositories

Language:PythonStargazers:1Issues:1Issues:0

Anima

33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fineweb-translation

A process to translate fineweb-edu dataset released by huggingface to indian languages using indictrans2 model.

Language:PythonStargazers:0Issues:0Issues:0

indictrans2-tokenization

A codebase to split text into sentences and tokenize them in batches for inference using IndicTrans2

Language:PythonStargazers:0Issues:1Issues:0

indictrans2-translation

code to run inference on tokenized sentences using indictrans-2

Language:PythonStargazers:0Issues:1Issues:0

levanter

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

scalax

A simple library for scaling up JAX programs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

autogen

Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

deep-learning-with-python-notebooks

Jupyter notebooks for the code samples of the book "Deep Learning with Python"

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

executorch

On-device AI across mobile, embedded and edge for PyTorch

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

IndicTrans2

Translation models for 22 scheduled languages of India

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

IndicTransTokenizer

A simple, consistent and extendable module for IndicTrans2 tokenizer

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

kathir-ks

Config files for my GitHub profile.

Stargazers:0Issues:1Issues:0

lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llama-agentic-system

Agentic components of the Llama Stack APIs

License:NOASSERTIONStargazers:0Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

minbpe

Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0