Wenxuan Zhou (wzhouad)

wzhouad

Geek Repo

Company:Zoom

Location:Los Angeles

Home Page:https://wzhouad.github.io/

Github PK Tool:Github PK Tool

Wenxuan Zhou's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36031Issues:348Issues:1739

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:24335Issues:219Issues:3809

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19321Issues:297Issues:1342

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12917Issues:99Issues:1033

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9786Issues:84Issues:247

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8164Issues:73Issues:399

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:6357Issues:39Issues:926

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Language:PythonLicense:Apache-2.0Stargazers:2367Issues:57Issues:716

MeZO

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Language:PythonLicense:MITStargazers:1008Issues:20Issues:33

RecurrentGPT

Official Code for Paper: RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text

Language:PythonLicense:GPL-3.0Stargazers:943Issues:13Issues:22

ReWOO

Decoupling Reasoning from Observations for Efficient Augmented Language Models

Language:PythonLicense:MITStargazers:861Issues:22Issues:13

LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.

Language:PythonLicense:Apache-2.0Stargazers:837Issues:15Issues:23

ChatGenTitle

🌟 ChatGenTitle:使用百万arXiv论文信息在LLaMA模型上进行微调的论文题目生成模型

Language:PythonLicense:NOASSERTIONStargazers:829Issues:12Issues:9

xgen

Salesforce open-source LLMs with 8k sequence length.

Language:PythonLicense:Apache-2.0Stargazers:713Issues:12Issues:14
Language:PythonLicense:MITStargazers:674Issues:9Issues:27

Mind2Web

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"

Language:Jupyter NotebookLicense:MITStargazers:632Issues:22Issues:41

FLARE

Forward-Looking Active REtrieval-augmented generation (FLARE)

Language:PythonLicense:MITStargazers:560Issues:7Issues:21

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Language:PythonLicense:Apache-2.0Stargazers:534Issues:13Issues:20

SwiftSage

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

CALM-pytorch

Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind

Language:PythonLicense:MITStargazers:146Issues:7Issues:3

LaMP

Codes for papers on Large Language Models Personalization (LaMP)

StructGPT

The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"

Language:PythonLicense:Apache-2.0Stargazers:93Issues:4Issues:18

PASTA

PASTA: Post-hoc Attention Steering for LLMs

Language:PythonLicense:MITStargazers:89Issues:2Issues:7
Language:Jupyter NotebookLicense:MITStargazers:64Issues:3Issues:1