jwliao-ai

Junwei Liao's repositories

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonApache-2.0000

This is a paper participating in the school-level mathematical modeling competition

Language:TeX000

Implementation for FeDPO (Federated Direct Preference Optimization)

Language:Python000

Junwei Liao's homepage.

Language:HTML000