Yifu Yuan's repositories

Uni-RLHF-Platform

Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonLicense:MITStargazers:24Issues:2Issues:0

Clean-Offline-RLHF

Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonLicense:MITStargazers:23Issues:2Issues:1

pic2face

Enter a photo and return a 3D model of face

License:MITStargazers:4Issues:3Issues:0

euclid-iclr2023

Official implementation for "EUCLID: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model" (ICLR2023)

Language:PythonStargazers:1Issues:0Issues:0

BabyAI-text

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ED2

the ED2 implementation

Language:PythonStargazers:0Issues:0Issues:0

Mini-Uni-RLHF

Minimal implementation for easy-to-use RLHF annotation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Best-README-Template

An awesome README template to jumpstart your projects!

License:MITStargazers:0Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

diffusion_policy

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

diffusion_reward

Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dreamerv3-torch

Implementation of Dreamer v3 in pytorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

drqv2

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Everything-LLMs-And-Robotics

The world's largest GitHub Repository for LLMs + Robotics

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

learning-from-scratch

The repository of On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

PreferenceTransformer

Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch3d

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

RLHF

RLHF

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

robohive

A unified framework for robot learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tdmpc

Code for "Temporal Difference Learning for Model Predictive Control"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tdmpc2

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

text2reward

Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

unstable_baselines

Re-implementations of SOTA RL algorithms.

Language:PythonStargazers:0Issues:0Issues:0

v-d4rl

Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Language:PythonLicense:MITStargazers:0Issues:0Issues:0