Arnav Gudibande (arnav-gudibande)

arnav-gudibande

Geek Repo

Company:Perplexity AI

Location:San Francisco Bay Area

Github PK Tool:Github PK Tool


Organizations
hackclub
mlberkeley
p2p-app
sfhacks

Arnav Gudibande's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:165438Issues:1557Issues:2452

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35351Issues:360Issues:307

ChatGPT

Reverse engineered ChatGPT API

Language:PythonLicense:GPL-2.0Stargazers:27996Issues:290Issues:810

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14484Issues:263Issues:203

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:13665Issues:91Issues:672

chatgpt-google-extension

This project is deprecated. Check my new project ChatHub:

Language:TypeScriptLicense:GPL-3.0Stargazers:13258Issues:112Issues:316

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11154Issues:201Issues:2177

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10808Issues:136Issues:162

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:7467Issues:45Issues:520

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7438Issues:109Issues:150

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6043Issues:35Issues:980

ai-deadlines

:alarm_clock: AI conference deadline countdowns

Language:JavaScriptLicense:MITStargazers:5517Issues:99Issues:92

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4410Issues:49Issues:287

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:3987Issues:57Issues:19

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonLicense:Apache-2.0Stargazers:2342Issues:42Issues:86

sharegpt

Easily share permanent links to ChatGPT conversations with your friends

Language:TypeScriptLicense:MITStargazers:1724Issues:19Issues:91

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:1672Issues:24Issues:210

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1358Issues:7Issues:135

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

backdoor-learning-resources

A list of backdoor learning resources

Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

tyro

Zero-effort CLI interfaces & config objects, from types

Language:PythonLicense:MITStargazers:442Issues:7Issues:97

adept-inference

Inference code for Persimmon-8B

Language:PythonLicense:Apache-2.0Stargazers:414Issues:16Issues:7

dpr-scale

Scalable training for dense retrieval models.

JAXSeq

Train very large language models in Jax.

Language:PythonLicense:MITStargazers:188Issues:10Issues:7

JAX_llama

Inference code for LLaMA models in JAX

Language:PythonLicense:MITStargazers:106Issues:1Issues:0

GPT4ALL-collector

A semi-scalable system to scrape the chatgpt API to make input/output pairs

Language:PythonStargazers:36Issues:3Issues:0

chess-llm

Play chess against large language models.

Language:PythonLicense:GPL-3.0Stargazers:33Issues:3Issues:0
Language:JavaScriptLicense:MITStargazers:8Issues:13Issues:0