xiaodongdreams / Cones-V2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Cones 2

Official repo for Cones 2: Customizable Image Synthesis with Multiple Subjects.

See Project Page for more examples.

Cones 2 uses a simple yet effective representation to register a subject. The storage space required for each subject is approximately 5 KB. Moreover, Cones 2 allows for the flexible composition of various subjects without any model tuning.

TODO

  • Release code.
  • Release pretrained residual embeddings.
  • Release Gradio UI.

Method

method

(a) Given few-shot images of the customized subject, we fine-tune the text encoder to learn a residual embedding on top of the base embedding of raw subject. (b) Based on the residual embeddings, we then propose to employ layout as the spatial guidance for subject arrangement into the attention maps. After that, we could strengthen the signal of target subjects and weaken the signal of irrelevant subjects.

Results

more challenging cases

About

License:MIT License