Test

Question

Test

Jasonhyw opened this issue 7 months ago · comments

What does the a_prompt and n_prompt mean? What are they used for？

Zhendong Wang · Answer 1 · Sun Oct 29 2023 00:25:18 GMT+0800 (China Standard Time)

'a_prompt' is positive prompt to improve the generation quality of Stable Diffusion (SD), to guide SD to generate high quality images with words. Conversely, 'n_prompt' is the negative prompt to penalize the generation regarding the words in 'n_prompt'.

Tao Wang · Answer 2 · Fri Jan 19 2024 12:08:35 GMT+0800 (China Standard Time)

Thanks for your interesting work. Can you provide the evaluate code and examples for the task of style transfer?

Zhendong Wang · Answer 3 · Sat Jan 20 2024 15:14:36 GMT+0800 (China Standard Time)

Hi Tao, I provided the evalutation code here, #11 (comment).
Perform style transfer with training only on current six tasks are not going to work well all the time. The style transfer results showing in paper are not random generation results. Could consider improvement here.

Tao Wang · Answer 4 · Mon Jan 22 2024 10:23:13 GMT+0800 (China Standard Time)

Thanks, why the style transfer results shown in the paper are not random generation results?

ZebinHe · Answer 5 · Mon Jan 22 2024 10:40:34 GMT+0800 (China Standard Time)

I also met this problem. When I tried to apply the style transfer task mentioned in the paper, I found that the output seems like a reconstruction of the example image, rather than the style transfer one.

and here is my code(btw, I found that the input prompt make no difference in this case)

Zhendong Wang · Answer 6 · Mon Jan 22 2024 11:31:00 GMT+0800 (China Standard Time)

Style transfer significantly diverges from the six tasks explicitly trained in the model, such as segmentation-to-image, depth-to-image, and others. Given the model's specific training on these tasks, its ability to generalize effectively to tasks like style transfer, which are substantially different, is limited. This limitation aligns with common understanding in the field of machine learning, where models often excel in areas closely related to their training data and struggle with tasks that are markedly distinct.

I personally tried some times, found the model can work in some cases and shared them in the paper.

@ZebinHe The style transfer can work on my end in my two shared examples. I am not sure why it didn't in you case.

We also provide a modifed version here https://arxiv.org/abs/2312.01408. ViT based encoder is used to encode the example pairs, and the model is trained on more tasks.

Tao Wang · Answer 7 · Mon Jan 22 2024 13:52:44 GMT+0800 (China Standard Time)

Thanks a lot for your reply.

Nilay Yilmaz · Answer 8 · Thu Mar 21 2024 07:12:52 GMT+0800 (China Standard Time)

Do you have a github page for a modified version of the model, iPromptDiff?

Style transfer significantly diverges from the six tasks explicitly trained in the model, such as segmentation-to-image, depth-to-image, and others. Given the model's specific training on these tasks, its ability to generalize effectively to tasks like style transfer, which are substantially different, is limited. This limitation aligns with common understanding in the field of machine learning, where models often excel in areas closely related to their training data and struggle with tasks that are markedly distinct.

I personally tried some times, found the model can work in some cases and shared them in the paper.

@ZebinHe The style transfer can work on my end in my two shared examples. I am not sure why it didn't in you case.

We also provide a modifed version here https://arxiv.org/abs/2312.01408. ViT based encoder is used to encode the example pairs, and the model is trained on more tasks.