Jiayi Zhou (Gaiejj)

Gaiejj

Geek Repo

Company:Peking University

Location:Beijing

Github PK Tool:Github PK Tool


Organizations
PKU-Alignment
PKU-MARL

Jiayi Zhou's repositories

omniairl

A trustworthy benchmark for IAIR Reinforcement Learning homework

Gaiejj

A brief intro of Jiayi Zhou

Language:TeXLicense:MITStargazers:1Issues:1Issues:0

JiayiGPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:JavaScriptLicense:MITStargazers:1Issues:0Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

omnisafe_zjy

OmniSafe is a comprehensive and reliable benchmark for safe reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:TeXLicense:MITStargazers:1Issues:1Issues:0
Language:PythonStargazers:1Issues:1Issues:0

safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

brain_signal_deal

brain_signal_deal in ai class in xjtu

Language:PythonStargazers:0Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

HowToMakeDocs

A tutorial on how to make beauty documents with Sphinx.

Stargazers:0Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

omnisafe_benchmarks_cruve

A simple repo to store omnisafe training curves.

Stargazers:0Issues:1Issues:0

Safe-Policy-Optimization

This is a benchmark repository for safe reinforcement learning algorithms

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Safety-Gymnasium

Safety Gymnaisum is a highly scalable and customizable safe reinforcement learning environment.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torchopt

TorchOpt is an efficient library for differentiable optimization built upon PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0