garrett4wade

Wei Fu's repositories

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0000

openrlhf-vllm

Language:PythonApache-2.0100

DeepSpeed-for-dschat

Language:PythonApache-2.0000

util-scripts

Language:Python000

garrett4wade.github.io

Language:HTMLMIT000

sphinx-action

Github action that builds docs using sphinx and places errors inline

Language:PythonApache-2.0000

OpenRLHF-1

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Apache-2.0000

sphinx-pages

Build html documentation by Sphinx, and push to branch gh-pages.

Language:Shell000

sipo

Iteratively Learn Diverse Strategies with State Distance Information

Language:PythonBSD-3-Clause200

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Apache-2.0000

cugae

CUDA implementation of Generalized Advantage Estimation (GAE)

Language:Python000

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause000

gpu-burn

Multi-GPU CUDA stress test

Language:C++BSD-2-Clause000

scaling_marl

Language:Python000

Trust-Region-Methods-in-Multi-Agent-Reinforcement-Learning

Language:PythonMIT300

revisiting_marl

Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)

Language:Python2100

atari_dqn

Language:Python000

build_football_engine

Build script for Google Research Football on M1 Mac.

Language:Shell000

ray_rl

Language:Python200

rl-tf1

Language:Python000