shizhediao's starred repositories
RLHFlow.github.io
Webpage for RLHFlow
reward-bench
RewardBench: the first evaluation tool for reward models.
bootstrapped-preference-optimization-BPO-
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
sleeper-agents-paper
Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".
MLLM-protector
The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"
Directional-Preference-Alignment
Directional Preference Alignment
Multi-LoRA-Composition
Repository for the Paper "Multi-LoRA Composition for Image Generation"
Automate-CoT
Findings of EMNLP 2023
ChemistryHTMLPaperParser
Convert HTML/XML Chemistry/Material Science articles into plain text
ConstraintChecker
Official code repository for the EACL2024 paper "ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases"
Awesome-Scientific-Language-Models
A Curated List of Language Models in Scientific Domains
Contamination_For_PreTraining
The source code for the paper contamination analysis for pre-training language models.
CoDA_NeurIPS2023
Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
promptbench
A unified evaluation framework for large language models