lucidrains / self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What changes should I make to apply the method on Llama2?

Labmem009 opened this issue · comments

I want to apply Self-rewarding and SPIN method on llama2 with alpaca-like finetuning datasets. What changes should I make to apply the method? And what config should I use?
Thanks a lot!