CZWin32768

Zewen Chi's starred repositories

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonMIT41771 878 587

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.035809 348 1727

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookMIT18249 117 504

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.015105 104 968

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION8096 100 83

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.05899 67 269

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.05617 78 141

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonApache-2.02944 42 216

flops-counter.pytorch

Flops counter for convolutional networks in pytorch framework

Language:PythonMIT2744 16 94

galai

Model API for GALACTICA

Language:Jupyter NotebookApache-2.02667 43 71

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookApache-2.02475 30 376

OpenAGI

OpenAGI: When LLM Meets Domain Experts

Language:PythonMIT1851 27 16

LLMAgentPapers

Must-read Papers on LLM Agents.

1472 44 7

prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

Language:Jupyter NotebookMIT1437 35 2

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileMIT1337 23 32

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Language:ShellNOASSERTION967 37 19

proteinnet

Standardized data set for machine learning of protein structure

Language:PythonMIT851 54 29

massive

Tools and Modeling Code for the MASSIVE dataset

Language:PythonNOASSERTION536 17 24

SN-Net

[CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".

Language:PythonApache-2.0238 4 6

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

Language:PythonApache-2.0218 11 5

CZWin32768

Zewen Chi's starred repositories

MetaGPT

FastChat

guidance

peft

ImageBind

lit-llama

LLaMA-Adapter

GPTQ-for-LLaMa

flops-counter.pytorch

galai

adapters

OpenAGI

LLMAgentPapers

prompt-in-context-learning

DeepSeek-LLM

bigscience

proteinnet

massive

SN-Net

ModuleFormer

SuperICL

Okapi

seq_ppi

ProtST

HIGH-PPI

awesome-jekyll-websites

Mod-Squad

milatools

DAP

OpenAlpaca