xxlong0 / Wonder3D

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Home Page:https://www.xxlong.site/Wonder3D/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Which "Stable Diffusion Image Variations Model" you fine-tuned?

BlingHe opened this issue · comments

Hi! Thanks the authers sharing the great work!

As you mentioned in the paper Sec. 5.1, May I ask which "Stable Diffusion Image Variations Model" you fine-tuned? Could you provide the link to this pre-trained model?
image

You may find details here: https://huggingface.co/lambdalabs/sd-image-variations-diffusers

I noticed that the "in_channels" of this image variations model is 4. But, your unet model needs 8 in_channels for additional "image_latent". How did your modified unet model be trained? Joint training with domain switcher and cross-domain attention or pre-training before training other modules?

Thanks in advance!