pickxiguapi

followers

following

stars

yifu-yuan.github.io

Yifu Yuan's repositories

Uni-RLHF-Platform

Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonMIT24 20

Clean-Offline-RLHF

Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonMIT23 2 1

pic2face

Enter a photo and return a 3D model of face

MIT4 30

euclid-iclr2023

Official implementation for "EUCLID: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model" (ICLR2023)

Language:Python100

BabyAI-text

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Language:PythonMIT000

ED2

the ED2 implementation

Language:Python000

Mini-Uni-RLHF

Minimal implementation for easy-to-use RLHF annotation

Language:PythonMIT010

Best-README-Template

An awesome README template to jumpstart your projects!

MIT000

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION000

decision-diffuser

Language:Python000

diffusion_policy

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Language:PythonMIT000

diffusion_reward

Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"

Language:PythonMIT000

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonMIT000

dreamerv3-torch

Implementation of Dreamer v3 in pytorch.

Language:PythonMIT000

drqv2

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Language:PythonMIT000

Everything-LLMs-And-Robotics

The world's largest GitHub Repository for LLMs + Robotics

BSD-3-Clause000

IQL

Language:PythonMIT000

learning-from-scratch

The repository of On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline

Language:Python000

MV-MWM

Language:Python000

pickxiguapi.github.io

Language:HTML000

PreferenceTransformer

Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)

Language:PythonMIT000

pytorch3d

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Language:PythonNOASSERTION000

RLHF

RLHF

Language:Jupyter Notebook000

robohive

A unified framework for robot learning

Language:PythonApache-2.0000

robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Language:PythonMIT000

tdmpc

Code for "Temporal Difference Learning for Model Predictive Control"

Language:PythonMIT000

tdmpc2

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

Language:PythonMIT000

text2reward

Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"

Language:Jupyter Notebook000

unstable_baselines

Re-implementations of SOTA RL algorithms.

Language:Python000

v-d4rl

Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Language:PythonMIT000