WooooDyy / LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Home Page:https://arxiv.org/abs/2402.05808

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

WooooDyy/LLM-Reverse-Curriculum-RL Stargazers