Jiaming Ji's repositories
CUP-safe-rl
NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization
Dev-Setup-Jiaming
Automation scripts for setting up a basic development environment.
omnisafe_zmsn
OmniSafe is a comprehensive and reliable benchmark for safe reinforcement learning.
Safe-Policy-Optimization
This is a benchmark repository for safe reinforcement learning algorithms
baichuan-7B
A large-scale 7B pretraining language model developed by Baichuan
draggable-example
vue.draggable example
functorch
functorch is JAX-like composable function transforms for PyTorch.
Gymnasium
A standard API for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
RRHF
RRHF & Wombat
safe-rlhf-zmsn
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
safety-gymnasium-zmsn
Safety-Gymnaisum is a highly scalable and customizable safe reinforcement learning environment library.
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
tianshou
An elegant PyTorch deep reinforcement learning library.
tldr
📚 Collaborative cheatsheets for console commands
torchopt
TorchOpt is an efficient library for differentiable optimization built upon PyTorch.