mihirp1998 / AlignProp

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion

Home Page:https://align-prop.github.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

mihirp1998/AlignProp Stargazers