songweige / rich-text-to-image

Rich-Text-to-Image Generation

Home Page:https://rich-text-to-image.github.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Train on a custom dataset?

alelordelo opened this issue · comments

Hi, thanks for the amazing paper and code! : )

Would it be possible to train a a custom dataset with segmentation maps and nested JSON prompts?

cheers

Hi, thank you for your interest! Our method does not require any training. If you have another model, say stable diffusion finetuned on the custom dataset, you may directly load the model checkpoint and use our rich-text formats to control the generation.

thanks @songweige
Great, will try to run with with a custom model!

Does it work with image to image as well? Would be super helpful to specify prompts per masked region

For image-to-image models, I think it would be possible to integrate with models like ControlNet. I don't think it is compatible with models like InstructPix2Pix.

Sorry for missing your question. I think it is compatible with the SDEdit kind of method.