Support for Larger Models

Question

Support for Larger Models

scblaze opened this issue a year ago · comments

It would be great if there was a way to use this with the 13B, 30B or 60B LLaMa model sizes.

Pokai Chang · Answer 1 · Fri Apr 14 2023 21:20:38 GMT+0800 (China Standard Time)

In theory, it will work by specifying a larger LLaMA base model via the --base_model flag, e.g. --base_model=decapoda-research/llama-13b-hf, then select a LoRA model that's trained on top of that base model (such as chansung/alpaca-lora-13b). However, I still need to test it. If you had the chance to try it first, your sharing of how it works would be appreciated! 🚀

BTW I think I'll be adding the ability to switch between base models without restarting the app and support non-llama models in the near future.

Update 2023/4/20: The ability of switching between base models has been added.

monydochev · Answer 2 · Mon May 29 2023 04:41:05 GMT+0800 (China Standard Time)

I can confirm that it working with llama-13b-h, it use 93% from 80GB VRAM A100, Lora trained successfully and working