Issue about PatchGAN

Question

Issue about PatchGAN

thomascong121 opened this issue 4 years ago · comments

Hi:

I am wondering why it is sufficient to restrict attention to the structure in local image patches to model high-level features?

Phillip Isola · Answer 1 · Mon Nov 09 2020 14:20:57 GMT+0800 (China Standard Time)

This is an interesting question. One thing to recognize first is that the generator has receptive fields that cover the entire image, and that's what allows the generator to model high-level features. The PatchGAN discriminator can indeed get away with relatively small receptive fields, and ultimately that's related to natural image statistics, the level of stochasticity in the mapping you are learning, and the spatial density of information in the input image to the generator.

Zijue Li · Answer 2 · Fri Jan 14 2022 15:21:37 GMT+0800 (China Standard Time)

Hello, I wonder why it is available for discriminator to just force high-frequency correctness by restricting its receptive field in local image patches?

It is my understanding that it change the whole task of determining a picture is true or false into many sub-tasks of determining a image patch is true or false, which greatly reduces the complexity of the task, and it ultimately help to improve the performance, but nothing to do with the high-frequency information.

Could you please explain it for me? Thanks a lot!