Which "Stable Diffusion Image Variations Model" you fine-tuned?

Question

Which "Stable Diffusion Image Variations Model" you fine-tuned?

BlingHe opened this issue 6 months ago · comments

Hi! Thanks the authers sharing the great work!

As you mentioned in the paper Sec. 5.1, May I ask which "Stable Diffusion Image Variations Model" you fine-tuned? Could you provide the link to this pre-trained model?

xxlong0 · Answer 1 · Fri Jan 26 2024 10:27:36 GMT+0800 (China Standard Time)

You may find details here: https://huggingface.co/lambdalabs/sd-image-variations-diffusers

Jing He · Answer 2 · Sun Jan 28 2024 15:54:39 GMT+0800 (China Standard Time)

You may find details here: https://huggingface.co/lambdalabs/sd-image-variations-diffusers

I noticed that the "in_channels" of this image variations model is 4. But, your unet model needs 8 in_channels for additional "image_latent". How did your modified unet model be trained? Joint training with domain switcher and cross-domain attention or pre-training before training other modules?

Thanks in advance!