SHI-Labs / Prompt-Free-Diffusion

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Home Page:https://arxiv.org/abs/2305.16223

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Do you have one SeeCoder that provides alignment with T5 rather than CLIP?

fiona-lxd opened this issue · comments

Do you have one SeeCoder that provides alignment with T5 rather than CLIP?

So far no, but a SeeCoder can be obtained following the same training we mentioned in the paper by just changing the underlying T2I model into a T5-supported T2I model.

I see. Thank you for the quick reply. BTW, how much time does it cost to train the seecoder with 16 A100?

@xingqian2018 Could you please share the training time on 16 A100? I'm thinking of replicating/finetuning the model and want some estimates of how long it would take