training code

Question

training code

ehartford opened this issue 2 months ago · comments

Eric Hartford commented 2 months ago

Hello, I am trying to find the training code, but it seems like there is just inference code.

Can you please point to the training code?

Nicolas Deperrois · Answer 1 · Thu Jun 06 2024 22:07:57 GMT+0800 (China Standard Time)

That would be great to get the training scripts, as it was done in the original LLaVA repo :)

carlos-havier · Answer 2 · Wed Jun 19 2024 05:38:07 GMT+0800 (China Standard Time)

I'd also love to use them for fine-tuning with several images, for few-shot image classification.

Nicolas Deperrois · Answer 3 · Fri Jun 21 2024 00:18:55 GMT+0800 (China Standard Time)

what do you guys think of this ?
https://github.com/NielsRogge/Transformers-Tutorials/blob/master/LLaVa/Fine_tune_LLaVa_on_a_custom_dataset_(with_PyTorch_Lightning).ipynb

By replacing llava by lava-next (processor and model)

chuangchuangtan · Answer 4 · Tue Jun 25 2024 21:28:50 GMT+0800 (China Standard Time)

I implement a LLava-llama3 Lora finetuning https://github.com/chuangchuangtan/LLaVA-NeXT-Image-Llama3-Lora

Nicolas Deperrois · Answer 5 · Wed Jun 26 2024 20:47:23 GMT+0800 (China Standard Time)

I implement a LLava-llama3 Lora finetuning https://github.com/chuangchuangtan/LLaVA-NeXT-Image-Llama3-Lora

Great thank you! Does it also work with Llama3 70b?
Btw, does it train only the bridger and language model, or does it also train the vision encoder (that we want to avoid)? Can we train without LoRA ?

chuangchuangtan · Answer 6 · Thu Jun 27 2024 10:04:21 GMT+0800 (China Standard Time)

I implement a LLava-llama3 Lora finetuning https://github.com/chuangchuangtan/LLaVA-NeXT-Image-Llama3-Lora

Great thank you! Does it also work with Llama3 70b? Btw, does it train only the bridger and language model, or does it also train the vision encoder (that we want to avoid)? Can we train without LoRA ?

It only trains the bridge and language model. We have set up to print the names of trainable parameters in the code, you can check them. We haven't tested it on 70b, but it should be work. You can set training commands to train without LoRA.