lucidrains / llama-qrlhf

Implementation of the Llama architecture with RLHF + Q-learning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

lucidrains/llama-qrlhf Stargazers