arnav-gudibande

followers

following

stars

Perplexity AI

San Francisco Bay Area

Organizations

hackclub

mlberkeley

p2p-app

sfhacks

Arnav Gudibande's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT165438 1557 2452

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT35351 360 307

ChatGPT

Reverse engineered ChatGPT API

Language:PythonGPL-2.027996 290 810

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonNOASSERTION14484 263 203

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.013665 91 672

chatgpt-google-extension

This project is deprecated. Check my new project ChatHub:

Language:TypeScriptGPL-3.013258 112 316

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.011154 201 2177

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonApache-2.010808 136 162

outlines

Structured Text Generation

Language:PythonApache-2.07467 45 520

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.07438 109 150

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

Apache-2.07314 119 91

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT6043 35 980

ai-deadlines

:alarm_clock: AI conference deadline countdowns

Language:JavaScriptMIT5517 99 92

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4410 49 287

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonApache-2.03987 57 19

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonApache-2.02342 42 86

sharegpt

Easily share permanent links to ChatGPT conversations with your friends

Language:TypeScriptMIT1724 19 91

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonNOASSERTION1672 24 210

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.01358 7 135

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

backdoor-learning-resources

A list of backdoor learning resources

Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

tyro

Zero-effort CLI interfaces & config objects, from types

Language:PythonMIT442 7 97

adept-inference

Inference code for Persimmon-8B

Language:PythonApache-2.0414 16 7

dpr-scale

Scalable training for dense retrieval models.

Language:Python262 19 13

JAXSeq

Train very large language models in Jax.

Language:PythonMIT188 10 7

JAX_llama

Inference code for LLaMA models in JAX

Language:PythonMIT106 10

GPT4ALL-collector

A semi-scalable system to scrape the chatgpt API to make input/output pairs

Language:Python36 30

chess-llm

Play chess against large language models.

Language:PythonGPL-3.033 30

bairblog.github.io

Language:JavaScriptMIT8 130