Wei Fu (garrett4wade)

garrett4wade

Geek Repo

Company:Tsinghua University

Location:Beijing, China

Github PK Tool:Github PK Tool

Wei Fu's repositories

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

sphinx-action

Github action that builds docs using sphinx and places errors inline

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

OpenRLHF-1

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

License:Apache-2.0Stargazers:0Issues:0Issues:0

sphinx-pages

Build html documentation by Sphinx, and push to branch gh-pages.

Language:ShellStargazers:0Issues:0Issues:0

sipo

Iteratively Learn Diverse Strategies with State Distance Information

Language:PythonLicense:BSD-3-ClauseStargazers:2Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

License:Apache-2.0Stargazers:0Issues:0Issues:0

cugae

CUDA implementation of Generalized Advantage Estimation (GAE)

Language:PythonStargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

gpu-burn

Multi-GPU CUDA stress test

Language:C++License:BSD-2-ClauseStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

revisiting_marl

Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)

Language:PythonStargazers:21Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

build_football_engine

Build script for Google Research Football on M1 Mac.

Language:ShellStargazers:0Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0