Jiaming Ji (zmsn-2077)

zmsn-2077

Geek Repo

Location:Peking University, Beijing

Home Page:jiamg.ji@gmail.com

Github PK Tool:Github PK Tool


Organizations
PKU-Alignment
PKU-MARL

Jiaming Ji's repositories

CUP-safe-rl

NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization

Dev-Setup-Jiaming

Automation scripts for setting up a basic development environment.

Language:ShellLicense:MITStargazers:1Issues:0Issues:0

omnisafe_zmsn

OmniSafe is a comprehensive and reliable benchmark for safe reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Safe-Policy-Optimization

This is a benchmark repository for safe reinforcement learning algorithms

Language:PythonStargazers:1Issues:0Issues:0

baichuan-7B

A large-scale 7B pretraining language model developed by Baichuan

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

draggable-example

vue.draggable example

Language:VueStargazers:0Issues:0Issues:0

functorch

functorch is JAX-like composable function transforms for PyTorch.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Gymnasium

A standard API for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RRHF

RRHF & Wombat

Language:PythonStargazers:0Issues:0Issues:0

safe-rlhf-zmsn

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

safety-gymnasium-zmsn

Safety-Gymnaisum is a highly scalable and customizable safe reinforcement learning environment library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:TeXLicense:MITStargazers:0Issues:2Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

License:MITStargazers:0Issues:0Issues:0

tldr

📚 Collaborative cheatsheets for console commands

License:NOASSERTIONStargazers:0Issues:0Issues:0

torchopt

TorchOpt is an efficient library for differentiable optimization built upon PyTorch.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLLicense:MITStargazers:0Issues:0Issues:0