What changes should I make to apply the method on Llama2?
Labmem009 opened this issue · comments
I want to apply Self-rewarding and SPIN method on llama2 with alpaca-like finetuning datasets. What changes should I make to apply the method? And what config should I use?
Thanks a lot!