ChaosCodes

followers

following

stars

ChaosCodes's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT65813 550 3819

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.036585 348 1778

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonApache-2.032513 170 4784

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.031869 204 4916

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.016397 134 125

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT11942 91 363

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.09603 74 1134

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.07709 108 156

ai-town

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

Language:TypeScriptMIT7432 61 98

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonApache-2.05409 55 541

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.03832 23 520

weak-to-strong

Language:PythonMIT2491 32 18

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01971 44 125

AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated

Language:PythonApache-2.01503 27 25

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonNOASSERTION1405 4 155

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Language:PythonApache-2.0861 8 19

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Language:PythonMIT809 8 26

MergeLM

Codebase for Merging Language Models (ICML 2024)

Language:Python749 7 40

Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Language:PythonApache-2.0748 7 35

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Language:PythonNOASSERTION708 10 56

text-dedup

All-in-one text de-duplication

Language:PythonApache-2.0593 4 67

LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Language:PythonApache-2.0530 2 14

instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Language:PythonApache-2.0517 13 28

Dataset_Quantization

[ICCV2023] Dataset Quantization

Language:Python250 7 13

VCD

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Language:PythonApache-2.0186 6 21

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonApache-2.0170 5 12

sailor-llm

⚓️ Sailor: Open Language Models for South-East Asia

Language:PythonMIT104 7 1

tangent_task_arithmetic

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Language:PythonMIT81 1 4

CLEX

[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models

Language:PythonMIT72 4 9

skill-it

Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models

Language:Jupyter NotebookApache-2.039 12 1