How to conduct model fine-tuning training and what is the required data magnitude?

Question

How to conduct model fine-tuning training and what is the required data magnitude?

grainw opened this issue 4 months ago · comments

XuDong Frank Wang · Answer 1 · Mon Mar 04 2024 09:16:50 GMT+0800 (China Standard Time)

Hi, you can follow our instructions on model training to finetune the model on your datasets. If you fine-tune the pretrained InstanceDiffusion model with LORA, the data magnitude can be significant reduced. The exact data magnitude depends on your task.

Alex Xie · Answer 2 · Tue Mar 05 2024 12:46:54 GMT+0800 (China Standard Time)

Do you have any plans to open source for lora training?

XuDong Frank Wang · Answer 3 · Thu Mar 07 2024 01:15:18 GMT+0800 (China Standard Time)

Hi, this repo is mainly for reproducing the results reported in our paper. Currently, we don't have plans to support additional new tasks or lora training. But you can check https://github.com/cloneofsimo/lora to learn how to add LORA to stable diffusion (the base model we used in this repo). Thanks!

S.W.Peng · Answer 4 · Fri Mar 08 2024 22:04:09 GMT+0800 (China Standard Time)

Thanks you for your great job,Can I use your model to do simpler training on a RTX3090 24g?

XuDong Frank Wang · Answer 5 · Mon Mar 18 2024 00:39:22 GMT+0800 (China Standard Time)

You may want to use flash attention (and or deepspeed) during the training time if you want to train the model using RTX 3090. Flash attention was implemented already, you can set it as True in the .yaml config file.

Hope it helps.

XuDong Frank Wang · Answer 6 · Mon Mar 18 2024 00:42:01 GMT+0800 (China Standard Time)

Closing it for now, please reopen-it if you have more questions.