ssbuild / llm_rlhf

realize the reinforcement learning training for gpt2 llama bloom and so on llm model

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ssbuild/llm_rlhf Watchers