Haoxiang-Wang

Haoxiang Wang's starred repositories

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.028052 187 4423

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.023885 218 3673

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.011106 203 2167

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.08899 75 1020

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookApache-2.08772 116 115

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonApache-2.08155 73 398

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookNOASSERTION7850 68 227

mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Language:PythonApache-2.07798 53 2344

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.06856 50 597

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT5996 36 964

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonMIT3527 100 160

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.03425 25 434

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter NotebookMIT2457 36 34

ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Language:PythonNOASSERTION1785 300

GroupViT

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

Language:PythonNOASSERTION711 11 64

geotorch

Constrained optimization toolkit for PyTorch

Language:PythonMIT641 9 35

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Language:PythonMIT604 7 65

unified-io-2

Language:PythonApache-2.0544 15 16

panopticapi

COCO 2018 Panoptic Segmentation Task API (Beta version)

Language:PythonNOASSERTION415 10 48

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:HTMLMIT278 5 28

prometheus

[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.

Language:PythonMIT224 2 14

Haoxiang-Wang

Haoxiang Wang's starred repositories

LLaMA-Factory

vllm

NeMo

trl

mistral-src

LMFlow

llama-recipes

mmsegmentation

axolotl

lm-evaluation-harness

Otter

opencompass

chain-of-thought-hub

ml-fastvit

GroupViT

geotorch

tensor_parallel

unified-io-2

panopticapi

doremi

prometheus

FLASK

easse

RAFA_code

mint-bench

BabyLlama

Skill-Localization-by-grafting

clippy

understanding-forgetting

GOAT