liumingzhu6060

liumingzhu6060

Geek Repo

Github PK Tool:Github PK Tool

liumingzhu6060's starred repositories

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:2056Issues:19Issues:81

PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

Language:PythonLicense:Apache-2.0Stargazers:1556Issues:92Issues:549

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1311Issues:18Issues:84

Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。

3dVideoFusion

将视频剪裁贴入到简单的3维盒子模型上,目前未详细整理过代码,纹理的映射坐标还不是很准确。待改进

Stargazers:7Issues:0Issues:0