WooooDyy / LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

https://arxiv.org/abs/2402.05808

WooooDyy/LLM-Reverse-Curriculum-RL Stargazers