ChangyuChen347

followers

following

stars

ChangyuChen347's repositories

semi-offline-RL

Semi-Offline Reinforcement Learning for Optimized Text Generation

Language:Python7 30

MaskedThought

Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models

Language:Python600

COMET-VAE

000

review

Language:Python000

RL4LM

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.0000