Haoxiang-Wang

Haoxiang Wang's starred repositories

fms-fsdp

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Language:PythonApache-2.013000

SciCode

A benchmark that challenges language models to code solutions for scientific problems

Language:PythonApache-2.05700

PINE

Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""

Language:Python600

Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Language:Python22600

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonApache-2.030800

hugo-PaperMod

A fast, clean, responsive Hugo theme.

Language:HTMLMIT929000

rlftqc

Reinforcement Learning for Fault-Tolerant Quantum Circuit Discovery

Language:PythonMIT600

qdx

Quantum error correction code AI-discovery with Jax

Language:Jupyter NotebookMIT1000

CodeUltraFeedback

CodeUltraFeedback: aligning large language models to coding preferences

Language:PythonMIT5700

2025QuantInternships

Public quant internship repository, maintained by NUFT but available for everyone.

107900

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonApache-2.050800

grok-1

Grok open release

Language:PythonApache-2.04921800

Directional-Preference-Alignment

Directional Preference Alignment

Apache-2.04300

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1265900

easse

Easier Automatic Sentence Simplification Evaluation

Language:RoffGPL-3.015600

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.01110600

[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.

Language:PythonMIT28100

Haoxiang-Wang

Haoxiang Wang's starred repositories

fms-fsdp

SciCode

PINE

Adam-mini

reward-bench

bedirt.github.io

kopytjuk.github.io

hugo-PaperMod

rlftqc

qdx

CodeUltraFeedback

2025QuantInternships

RLHF-Reward-Modeling

grok-1

Directional-Preference-Alignment

flash-attention

unified-io-2

easse

NeMo

prometheus

Otter

trl

GOAT

LLaMA-Factory

FLASK

lm-evaluation-harness

axolotl

tensor_parallel

RAFA_code

mint-bench