Support smaller models

Question

Support smaller models

paplorinc opened this issue a year ago · comments

How difficult would it be to support 3-4 bit models, e.g. https://huggingface.co/Sosaka/Alpaca-native-4bit-ggml or https://huggingface.co/decapoda-research/llama-smallint-pt/blob/main/llama-7b-3bit.pt - or non-conversational ones that are smaller than the 7b?

l0rinc · Answer 1 · Fri Apr 21 2023 00:30:08 GMT+0800 (China Standard Time)

databricks/dolly-v2-3b is pretty small and seems to be working for prediction, but can't make it work for fine-tuning (not sure what LoRA Target Modules to set)

edit: maybe they're: https://github.com/huggingface/peft/blob/2822398fbe896f25d4dac5e468624dc5fd65a51b/src/peft/utils/other.py#L220

l0rinc · Answer 2 · Sat Apr 22 2023 03:30:54 GMT+0800 (China Standard Time)

forget it, the project name clearly states LLaMA :)