Calico's repositories
DeepSpeedExamples
Example models using DeepSpeed
Language:PythonApache-2.0000
omnisafe
OmniSafe is a comprehensive and reliable benchmark for safe reinforcement learning.
Language:PythonApache-2.0000
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
Language:PythonMIT000
reward-bench
RewardBench: the first evaluation tool for reward models.
Language:PythonApache-2.0000
safe-rlhf-calico
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:PythonApache-2.0000
safety-gymnasium
Safety-Gymnaisum is a highly scalable and customizable safe reinforcement learning environment library.
Language:PythonApache-2.0000