CarperAI / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ppo using GLM2-6b as a backbone?

fanxinyun1991 opened this issue Β· comments

πŸš€ The feature, motivation, and pitch

Is there any plan add glm2-6b as a ppo backbone?

Alternatives

No response

Additional context

No response

commented

There is no plan to add glm2-6b as of right now, as far as I'm aware, so if anyone can manage to contribute a PR, they would be more than welcome to do so!