There are 0 repository under trlx topic.
ZYN: Zero-Shot Reward Models with Yes-No Questions
realize the reinforcement learning training for gpt2 llama bloom and so on llm model