ChangyuChen347's repositories

Language:PythonStargazers:0Issues:0Issues:0

MaskedThought

Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models

Language:PythonStargazers:6Issues:0Issues:0
Stargazers:0Issues:0Issues:0

semi-offline-RL

Semi-Offline Reinforcement Learning for Optimized Text Generation

Language:PythonStargazers:7Issues:0Issues:0

RL4LM

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0