Yan Cai (Cyccyyycyc)

Cyccyyycyc

Geek Repo

Company:Wuhan University

Location:Wuhan

Github PK Tool:Github PK Tool

Yan Cai's repositories

Language:HTMLStargazers:2Issues:0Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0