Calico (calico-1226)

calico-1226

Geek Repo

Company:ZJU

Location:Hangzhou, Zhejiang, China

Home Page:jtd.acad@gmail.com

Github PK Tool:Github PK Tool


Organizations
PKU-Alignment

Calico's repositories

Language:PythonStargazers:1Issues:1Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

omnisafe

OmniSafe is a comprehensive and reliable benchmark for safe reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

safe-rlhf-calico

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

safety-gymnasium

Safety-Gymnaisum is a highly scalable and customizable safe reinforcement learning environment library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0