ChaosCodes's starred repositories

llama.cpp

LLM inference in C/C++

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36585Issues:348Issues:1778

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:32513Issues:170Issues:4784

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31869Issues:204Issues:4916

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:11942Issues:91Issues:363

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9603Issues:74Issues:1134

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7709Issues:108Issues:156

ai-town

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

Language:TypeScriptLicense:MITStargazers:7432Issues:61Issues:98

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:5409Issues:55Issues:541

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3832Issues:23Issues:520

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1971Issues:44Issues:125

AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated

Language:PythonLicense:Apache-2.0Stargazers:1503Issues:27Issues:25

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:1405Issues:4Issues:155

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Language:PythonLicense:Apache-2.0Stargazers:861Issues:8Issues:19

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Language:PythonLicense:MITStargazers:809Issues:8Issues:26

MergeLM

Codebase for Merging Language Models (ICML 2024)

Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Language:PythonLicense:Apache-2.0Stargazers:748Issues:7Issues:35

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Language:PythonLicense:NOASSERTIONStargazers:708Issues:10Issues:56

text-dedup

All-in-one text de-duplication

Language:PythonLicense:Apache-2.0Stargazers:593Issues:4Issues:67

LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Language:PythonLicense:Apache-2.0Stargazers:530Issues:2Issues:14

instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Language:PythonLicense:Apache-2.0Stargazers:517Issues:13Issues:28

Dataset_Quantization

[ICCV2023] Dataset Quantization

VCD

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Language:PythonLicense:Apache-2.0Stargazers:186Issues:6Issues:21

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonLicense:Apache-2.0Stargazers:170Issues:5Issues:12

sailor-llm

⚓️ Sailor: Open Language Models for South-East Asia

Language:PythonLicense:MITStargazers:104Issues:7Issues:1

tangent_task_arithmetic

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Language:PythonLicense:MITStargazers:81Issues:1Issues:4

CLEX

[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models

Language:PythonLicense:MITStargazers:72Issues:4Issues:9

skill-it

Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:39Issues:12Issues:1