Yihe Deng's repositories

rlhf-summary-notes

A brief and partial summary of RLHF algorithms.

OpenVLThinker

OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement

Language:PythonStargazers:108Issues:0Issues:0

STIC

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Language:PythonLicense:Apache-2.0Stargazers:70Issues:2Issues:3

DuoGuard

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Language:PythonLicense:Apache-2.0Stargazers:18Issues:1Issues:2

Rephrase-and-Respond

Official repo of Respond-and-Respond: data, code, and evaluation

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Awesome-Prompt-Engineering

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-trustworthy-deep-learning

A curated list of trustworthy deep learning papers. Daily updating...

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0