disppointed, the result is too poor, all your demos comes from the training set?

Question

disppointed, the result is too poor, all your demos comes from the training set?

world2vec opened this issue 2 years ago · comments

Hi,
just play cc3m_cc12m_yfcc with your notebook, the result of the simple text 'a man with black glass' is so poor:

Doyup Lee · Answer 1 · Mon Apr 18 2022 11:12:16 GMT+0800 (China Standard Time)

No the demo samples are not from training set, but all generated.
You can check some examples, which are generated by other people, such as https://twitter.com/multimodalart/status/1513947558913187843?s=21&t=Ofu8oiHTE5_3keSX_IDAiQ on Twitter.

You can adjust the topk and topk parameters according to the text prompt, and the performance could be different according to the texts, since 3.9B params model trained on 30M is still smaller size than DALL-E.