Awesome Image-to-image Translation Paper

A collection of resources on Image Translation.

Contributing

If you think I have missed out on something (or) have any suggestions (papers, implementations and other resources), feel free to pull a request

Feedback and contributions are welcome!

Tutorials
Supervised
Unsupervised
Applications
Chronological Order
Datasets

Tutorials

Unpaired Image-to-Image Translation. CVPR Tutorial on GANs (2018)

On Image-to-Image Translation. Stanford, MIT, Facebook, CUHK, SNU (2017)

Supervised

pix2pix: Image-to-Image Translation with Conditional Adversarial Networks.
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros.
CVPR 2017. [PDF] [Github]

BicycleGAN: Toward Multimodal Image-to-Image Translation.
Jun-Yan Zhu, Richard Zhang, Deepak Pathak, Trevor Darrell, Alexei A. Efros, Oliver Wang, Eli Shechtman.
NeurIPS 2017. [PDF] [Github]

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs.
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro.
CVPR 2018. [PDF] [Github]

Geometry Guided Adversarial Facial Expression Synthesis.
Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan.
MM 2018. [PDF]

TextureGAN: Controlling Deep Image Synthesis with Texture Patches.
Wenqi Xian, Patsorn Sangkloy, Varun Agrawal, Amit Raj, Jingwan Lu, Chen Fang, Fisher Yu, James Hays.
CVPR 2018. [PDF] [Github]

Smart, Sparse Contours to Represent and Edit Images.
Tali Dekel, Chuang Gan, Dilip Krishnan, Ce Liu, William T. Freeman.
CVPR 2018. [PDF] [[Project]

Image-to-image translation for cross-domain disentanglement.
Abel Gonzalez-Garcia, Joost van de Weijer, Yoshua Bengio.
NeurIPS 2018. [PDF]

MSGAN: Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis.
Qi Mao, Hsin-Ying Lee, Hung-Yu Tseng, Siwei Ma, Ming-Hsuan Yang.
CVPR 2019. [PDF] [Github]

SPADE: Semantic Image Synthesis with Spatially-Adaptive Normalization.
Taesung Park, Ming-Yu Liu, Ting-Chun Wang, Jun-Yan Zhu.
CVPR 2019. [PDF] [Github]

C2-GAN: Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation.
Hao Tang, Dan Xu, Gaowen Liu, Wei Wang, Nicu Sebe, Yan Yan.
MM 2019. [PDF]

PI-REC: Progressive Image Reconstruction Network With Edge and Color Domain.
Sheng You, Ning You, Minxue Pan.
arxiv, 25 Mar 2019. [PDF] [Github]

Unsupervised

General

CycleGAN: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks.
Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros.
ICCV 2017. [PDF] [Github]

DiscoGAN: Learning to Discover Cross-Domain Relations with Generative Adversarial Networks.
Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, Jiwon Kim.
ICML 2017. [PDF] [Github]

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation.
Zili Yi, Hao Zhang, Ping Tan, Minglun Gong.
ICCV 2017. [PDF] [Github]

DTN: Unsupervised Cross-Domain Image Generation.
Yaniv Taigman, Adam Polyak, Lior Wolf.
ICLR 2017. [PDF] Github]

UNIT: Unsupervised image-to-image translation networks.
Ming-Yu Liu, Thomas Breuel, Jan Kautz.
NeurIPS 2017. [PDF] [Github]

DistanceGAN: One-Sided Unsupervised Domain Mapping.
Sagie Benaim, Lior Wolf.
NeurIPS 2017. [PDF] [Github]

TriangleGAN: Triangle Generative Adversarial Networks.
Zhe Gan, Liqun Chen, Weiyao Wang, Yunchen Pu, Yizhe Zhang, Hao Liu, Chunyuan Li, Lawrence Carin.
NeurIPS 2017. [PDF] [Github]

NAM: Non-Adversarial Unsupervised Domain Mapping.
Yedid Hoshen, Lior Wolf.
ECCV 2018. [PDF] [Github]

SCAN: Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks.
Minjun Li, Haozhi Huang, Lin Ma, Wei Liu, Tong Zhang, Yu-Gang Jiang.
ECCV 2018. [PDF]

GANimorph: Improved Shape Deformation in Unsupervised Image to Image Translation.
Aaron Gokaslan, Vivek Ramanujan, Daniel Ritchie, Kwang In Kim, James Tompkin.
ECCV 2018. [PDF] [Github]

OT-CycleGAN: Guiding the One-to-one Mapping in CycleGAN via Optimal Transport.
Guansong Lu, Zhiming Zhou, Yuxuan Song, Kan Ren, Yong Yu.
AAAI 2019. [PDF]

Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation.
Matteo Tomei, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara.
CVPR 2019. [PDF] [Github]

HarmonicGAN: Harmonic Unpaired Image-to-image Translation.
Rui Zhang, Tomas Pfister, Jia Li.
ICLR 2019. [PDF]

SDIT: Scalable and Diverse Cross-domain Image Translation.
Yaxing Wang, Abel Gonzalez-Garcia, Joost van de Weijer, Luis Herranz.
ACM MM, 2019. [PDF] [Github]

CrossNet: Latent Cross-Consistency for Unpaired Image Translation.
Omry Sendik, Dani Lischinski, Daniel Cohen-Or.
WACV 2020. [PDF]

Cross-Domain Cascaded Deep Feature Translation.
Oren Katzir, Dani Lischinski, Daniel Cohen-Or.
arxiv, 4 Jun 2019. [PDF]

Implicit Pairs for Boosting Unpaired Image-to-Image Translation.
Yiftach Ginger, Dov Danon, Hadar Averbuch-Elor, Daniel Cohen-Or.
arxiv, 15 Apr 2019. [PDF]

Unsupervised Shape Transformer for Image Translation and Cross-Domain Retrieval.
Kaili Wang, Liqian Ma, Jose Oramas M., Luc Van Gool, Tinne Tuytelaars.
arxiv, 5 Dec 2018. [PDF]

A Novel BiLevel Paradigm for Image-to-Image Translation.
Liqian Ma, Qianru Sun, Bernt Schiele, Luc Van Gool.
arxiv, 8 Apr 2019. [PDF]

AGUIT: Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning.
Xinyang Li, Jie Hu, Shengchuan Zhang, Xiaopeng Hong, Qixiang Ye, Chenglin Wu, Rongrong Ji.
arxiv, 29 Apr 2019. [PDF] [Github]

Attention-Examplar-Guided

ContrastGAN: Generative Semantic Manipulation with Mask-Contrasting GAN.
Xiaodan Liang, Hao Zhang, Eric P. Xing.
ECCV 2018. [PDF]

DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks.
Shuang Ma, Jianlong Fu, Chang Wen Chen, Tao Mei.
CVPR 2018. [PDF]

Attention-GAN for Object Transfiguration in Wild Images.
Xinyuan Chen, Chang Xu, Xiaokang Yang, Dacheng Tao.
ECCV 2018. [PDF]

Unsupervised Attention-guided Image to Image Translation.
Youssef A. Mejjati, Christian Richardt, James Tompkin, Darren Cosker, Kwang In Kim.
NeurIPS 2018. [PDF] [Github]

Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention.
Chao Yang, Taehwan Kim, Ruizhe Wang, Hao Peng, C.-C. Jay Kuo.
TIP 2019. [PDF]

InstaGAN: Instance-aware image-to-image translation.
Zhiqiang Shen, Mingyang Huang, Jianping Shi, Xiangyang Xue, Thomas Huang.
ICLR 2019. [PDF] [Github]

INIT: Towards Instance-level Image-to-Image Translation.
Zhiqiang Shen, Mingyang Huang, Jianping Shi, Xiangyang Xue, Thomas Huang.
CVPR 2019. [PDF] [project]

Mask-Guided Portrait Editing with Conditional GANs.
Shuyang Gu, Jianmin Bao, Hao Yang, Dong Chen, Fang Wen, Lu Yuan.
CVPR 2019. [PDF] [Github]

U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation.
Junho Kim, Minjae Kim, Hyeonwoo Kang, Kwanghee Lee.
[PDF] [TensorFlow or Pytorch]

SPA-GAN: Spatial Attention GAN for Image-to-Image Translation.
Hajar Emami, Majid Moradi Aliabadi, Ming Dong, Ratna Babu Chinnam.
ArXiv 2019. [PDF]

Guided Image-to-Image Translation with Bi-Directional Feature Transformation.
Badour AlBahar, Jia-Bin Huang.
ArXiv 2019. [PDF] [Github]

Stylizing Video by Example.
Ondřej Jamriška, Šárka Sochorová, Ondřej Texler, Michal Lukáč, Jakub Fišer, Jingwan Lu, Eli Shechtman, Daniel Sýkora.
SIGGRAPH 2019. [PDF]

Disentanglement

XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings.
Amélie Royer, Konstantinos Bousmalis, Stephan Gouws, Fred Bertsch, Inbar Mosseri, Forrester Cole, Kevin Murphy.
ICML 2018. [PDF] [Dataset]

ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes.
Taihong Xiao, Jiapeng Hong, Jinwen Ma.
ECCV 2018. [PDF] [Github]

MUNIT: Multimodal Unsupervised Image-to-Image Translation.
Xun Huang, Ming-Yu Liu, Serge Belongie, Jan Kautz.
ECCV 2018. [PDF] [Github]

Conditional Image-to-Image Translation.
Jianxin Lin, Yingce Xia, Tao Qin, Zhibo Chen, Tie-Yan Liu.
CVPR 2018. [PDF]

EGSC-IT: Exemplar Guided Unsupervised Image-to-Image Translation with Semantic Consistency.
Liqian Ma, Xu Jia, Stamatios Georgoulis, Tinne Tuytelaars, Luc Van Gool.
ICLR 2019. [PDF] [Github]

PairedCycleGAN: Asymmetric Style Transfer for Applying and Removing Makeup.
Huiwen Chang, Jingwan Lu, Fisher Yu, Adam Finkelstein.
CVPR 2018. [PDF]

DRIT: Diverse Image-to-Image Translation via Disentangled Representations.
Hsin-Ying Lee, Hung-Yu Tseng, Jia-Bin Huang, Maneesh Kumar Singh, Ming-Hsuan Yang.
ECCV 2018. [PDF] [Github]

UFDN: A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation.
Alexander H. Liu, Yen-Cheng Liu, Yu-Ying Yeh, Yu-Chiang Frank Wang.
NeurIPS 2018. [PDF] [Github]

GDWTC: Image-to-Image Translation via Group-wise Deep Whitening and Coloring Transformation.
Wonwoong Cho, Sungha Choi, David Keetae Park, Inkyu Shin, Jaegul Choo.
CVPR 2019. [PDF] [Github]

DRIT++: Diverse Image-to-Image Translation via Disentangled Representations.
Hsin-Ying Lee, Hung-Yu Tseng, Qi Mao, Jia-Bin Huang, Yu-Ding Lu, Maneesh Singh, Ming-Hsuan Yang.
IJCV 2019. [PDF] [[Project] [Github]

Multi-mapping Image-to-Image Translation via Learning Disentanglement.
Xiaoming Yu, Yuanqi Chen, Thomas Li, Shan Liu, Ge Li.
NeurIPS 2019. [PDF] [Github]

Flow-based Image-to-Image Translation with Feature Disentanglement.
Ruho Kondo, Keisuke Kawano, Satoshi Koide, Takuro Kutsuna.
NeurIPS 2019. [PDF]

Many-to-many

StarGAN v2: Diverse Image Synthesis for Multiple Domains.
Yunjey Choi, Youngjung Uh, Jaejun Yoo, Jung-Woo Ha. Clova AI Research, NAVER Corp.
arxiv, 4 Dec 2019. [PDF] [GitHub]

IcGAN: Invertible Conditional GANs for image editing.
Guim Perarnau, Joost van de Weijer, Bogdan Raducanu, Jose M. Álvarez.
NeurIPS Workshop 2016. [PDF] [Github]

Attribute-Guided Face Generation Using Conditional CycleGAN.
Yongyi Lu, Yu-Wing Tai, Chi-Keung Tang.
ECCV 2018. [PDF]

StarGAN: Uniﬁed Generative Adversarial Networks for Multi-Domain Image-to-Image Translation.
Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, Jaegul Choo.
CVPR 2018. [PDF] [Github]

AttGAN: Facial Attribute Editing by Only Changing What You Want.
Zhenliang He, Wangmeng Zuo, Meina Kan, Shiguang Shan, Xilin Chen.
TIP 2019. [PDF] [Github]

ComboGAN: Unrestrained Scalability for Image Domain Translation.
Asha Anoosheh, Eirikur Agustsson, Radu Timofte, Luc Van Gool.
CVPRW 2018. [PDF] [Github]

Augmented CycleGAN: Learning Many-to-Many Mappings from Unpaired Data.
Amjad Almahairi, Sai Rajeswar, Alessandro Sordoni, Philip Bachman, Aaron Courville.
ICML 2018. [PDF] [Github]

ModularGAN: Modular Generative Adversarial Networks.
Bo Zhao, Bo Chang, Zequn Jie, Leonid Sigal.
ECCV 2018. [PDF]

SG-GAN: Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation.
Jichao Zhang, Yezhi Shu, Songhua Xu, Gongze Cao, Fan Zhong, Xueying Qin.
ACM MM 2018. [PDF] [Github]

GANimation: Anatomically-aware Facial Animation from a Single Image.
Albert Pumarola, Antonio Agudo, Aleix M. Martinez, Alberto Sanfeliu, Francesc Moreno-Noguer.
ECCV 2018. [PDF] [Github]

SingleGAN: Image-to-Image Translation by a Single-Generator Network using Multiple Generative Adversarial Learning.
Xiaoming Yu, Xing Cai, Zhenqiang Ying, Thomas Li, Ge Li.
ACCV 2018. [PDF] [Github]

SMIT: Stochastic Multi-Label Image-to-Image Translation.
Andrés Romero, Pablo Arbeláez, Luc Van Gool, Radu Timofte.
ICCV Workshops 2019. [PDF] [Github]
Image-to-Image Translation with Multi-Path Consistency Regularization.
Jianxin Lin, Yingce Xia, Yijun Wang, Tao Qin, Zhibo Chen.
IJCAI 2019. [PDF]

RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes.
Po-Wei Wu, Yu-Jing Lin, Che-Han Chang, Edward Y. Chang, Shih-Wei Liao.
ICCV 2019. [PDF]

DMIT: Multi-mapping Image-to-Image Translation via Learning Disentanglement.
Xiaoming Yu, Yuanqi Chen, Thomas Li, Shan Liu, Ge Li.
NeurIPS 2019. [PDF] [Github]

ADSPM: Attribute-Driven Spontaneous Motion in Unpaired Image Translation.
Ruizheng Wu, Xin Tao, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia.
ICCV 2019. [PDF] [Github]

CartoonRenderer: An Instance-based Multi-Style Cartoon Image Translator.
Yugang Chen, Muchun Chen, Chaoyue Song, Bingbing Ni.
International Conference on Multimedia Modeling (MMM 2020). [PDF]

injectionGAN: Toward Learning a Unified Many-to-Many Mapping for Diverse Image Translation.
Wenju Xu, Shawn Keshmiri, Guanghui Wang.
arxiv 2019. [PDF]

Applications

Attribute-Editing

BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network.
Tingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu, Liang Lin
ACM MM 2018. [PDF] [Github] [Project]

UFDN: A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation.
Alexander H. Liu, Yen-Cheng Liu, Yu-Ying Yeh, Yu-Chiang Frank Wang.
NeurIPS 2018. [PDF] [Github]

ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes.
Taihong Xiao, Jiapeng Hong, Jinwen Ma.
ECCV 2018. [PDF] [Github]

Biphasic-GAN: Biphasic Learning of GANs for High-Resolution Image-to-Image Translation.
Jie Cao, Huaibo Huang, Yi Li, Jingtuo Liu, Ran He, Zhenan Sun.
ArXiv 2019. [PDF]

High Fidelity Face Manipulation with Extreme Pose and Expression.
Chaoyou Fu, Yibo Hu, Xiang Wu, Guoli Wang, Qian Zhang, Ran He.
ArXiv 2019. [PDF]

Make a Face: Towards Arbitrary High Fidelity Face Manipulation.
Shengju Qian, Kwan-Yee Lin, Wayne Wu, Yangxiaokang Liu, Quan Wang, Fumin Shen, Chen Qian, Ran He.
ICCV 2019. [PDF]

SliderGAN: Synthesizing Expressive Face Images by Sliding 3D Blendshape Parameters.
Evangelos Ververas, Stefanos Zafeiriou.
arxiv 2019. [PDF]

Generating High-Resolution Fashion Model Images Wearing Custom Outfits.
Gökhan Yildirim, Nikolay Jetchev, Roland Vollgraf, Urs Bergmann.
Workshop on Computer Vision for Fashion, Art and Design, ICCV 2019. [PDF]

Video

Video-to-Video Synthesis.
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, Jan Kautz, Bryan Catanzaro.
NeurIPS 2018. [PDF] [Github]

Everybody Dance Now.
Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros.
ECCVW 2018. [PDF] [Project]

Preserving Semantic and Temporal Consistency for Unpaired Video-to-Video Translation.
Kwanyong Park, Sanghyun Woo, Dahun Kim, Donghyeon Cho, In So Kweon.
ACM MM 2019. [PDF]

Mocycle-GAN: Unpaired Video-to-Video Translation.
Yang Chen, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei.
ACM MM 2019. [PDF]

Recycle-GAN: Unsupervised Video Retargeting.
Aayush Bansal, Shugao Ma, Deva Ramanan, Yaser Sheikh.
ECCV 2018. [PDF] [Github]

Few-shot Video-to-Video Synthesis.
Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Jan Kautz, Bryan Catanzaro.
ArXiv 2019. [PDF] [Project]

Data Augmentation

Generative Image Translation for Data Augmentation in Colorectal Histopathology Images.
NeurIPS 2019 Machine Learning for Health Workshop. [PDF] [Project]

DG-Net: Joint Discriminative and Generative Learning for Person Re-identification.
Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, Jan Kautz.
CVPR 2019. [PDF] [Github]

Model-Compression-and-Pruning

Co-Evolutionary Compression for Unpaired Image Translation.
Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chunjing Xu, Qi Tian, Chang Xu.
ICCV 2019. [PDF] [Github]

Adversarial-Examples

Adversarial Self-Defense for Cycle-Consistent GANs.
Dina Bashkirova, Ben Usman, Kate Saenko.
NeurIPS 2019. [PDF]]

Imbalanced Data

Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Imbalanced Data.
Utkarsh Ojha, Krishna Kumar Singh, Cho-Jui Hsieh, Yong Jae Lee.
arxiv, 1 Oct 2019. [PDF]

Few-Shot

FUNIT: Few-Shot Unsupervised Image-to-Image Translation.
Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz.
ICCV 2019. [PDF] [Project] [Github]

Semi Few-Shot Attribute Translation.
Ricard Durall, Franz-Josef Pfreundt, Janis Keuper.
ArXiv 2019. [PDF]

ZstGAN: An Adversarial Approach for Unsupervised Zero-Shot Image-to-Image Translation.
Jianxin Lin, Yingce Xia, Sen Liu, Tao Qin, Zhibo Chen.
ArXiv 2019. [PDF] [Github]

MetaPix: Few-Shot Video Retargeting.
Jessica Lee, Deva Ramanan, Rohit Girdhar.
ICCV 2019. [PDF]] [Project] [Github]

Few-shot Video-to-Video Synthesis.
Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Jan Kautz, Bryan Catanzaro.
ArXiv 2019. [PDF] [Project]

Image-Synthesis

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization.
Peihao Zhu, Rameen Abdal, Yipeng Qin, Peter Wonka.
arxiv, 28 Nov 2019. [PDF] [Video]

LostGANs: Image Synthesis From Reconfigurable Layout and Style.
Wei Sun, Tianfu Wu.
ICCV 2019. [PDF] [Github]

Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis.
Xihui Liu, Guojun Yin, Jing Shao, Xiaogang Wang, Hongsheng Li.
NeurIPS 2019. [PDF]

Face-to-Parameter Translation for Game Character Auto-Creation.
Tianyang Shi, Yi Yuan, Changjie Fan, Zhengxia Zou, Zhenwei Shi, Yong Liu.
ICCV 2019. [PDF]

APDrawingGAN: Face-to-Parameter Translation for Game Character Auto-Creation.
Ran Yi, Yong-Jin Liu, Yu-Kun Lai, Paul L. Rosin.
CVPR 2019. [PDF Github] [Online Demo]

Cascaded Generation of High-quality Color Visible Face Images from Thermal Captures.
Naser Damer, Fadi Boutros, Khawla Mallat, Florian Kirchbuchner, Jean-Luc Dugelay, Arjan Kuijper.
ArXiv 2019. [PDF]

CartoonGAN: Generative Adversarial Networks for Photo Cartoonization.
Yang Chen, Yu-Kun Lai, Yong-Jin Liu.
CVPR 2018. [PDF] [Github] [unofficial test] [unofficial pytorch]

Retargeting-and-3D-Vision

Render4Completion: Synthesizing Multi-View Depth Maps for 3D Shape Completion.
Tao Hu, Zhizhong Han, Abhinav Shrivastava, Matthias Zwicker.
ICCV 2019 workshop on Geometry meets Deep Learning. [PDF]

Multi-Garment Net_Learning to Dress 3D people from Images.
Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt, Gerard Pons-Moll.
ICCV 2019. [PDF] [Github]

Tex2Shape: Detailed Full Human Body Geometry From a Single Image.
Thiemo Alldieck, Gerard Pons-Moll, Christian Theobalt, Marcus Magnor.
ICCV 2019. [arxiv] [PDF] [Github]

pix2vertex: Unrestricted facial geometry reconstruction using image-to-image translation.
Matan Sela, Elad Richardson, Ron Kimmel.
arxiv, 2017. [PDF] [Github]

Learning to Reconstruct People in Clothing from a Single RGB Camera.
Thiemo Alldieck, Marcus Magnor, Bharat Lal Bhatnagar, Christian Theobalt, Gerard Pons-Moll.
ICCV 2019. [PDF][Github]

360-Degree Textures of People in Clothing from a Single Image.
Verica Lazova, Eldar Insafutdinov, Gerard Pons-Moll.
3DV 2019. [PDF][Project]

SelectionGAN: Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation.
Hao Tang, Dan Xu, Nicu Sebe, Yanzhi Wang, Jason J. Corso, Yan Yan.
VPR 2019. [PDF] [Github]

VR Facial Animation via Multiview Image Translation.
Shih-En Wei, Jason Saragih, Tomas Simon, Adam W. Harley, Stephen Lombardi, Michal Perdoch, Alexander Hypes, Dawei Wang, Hernan Badino, Yaser Sheikh.
SIGGRAPH 2019. [PDF]

Conference

ICLR 2020

[accepted paper list]

AAAI 2020

[accepted paper list]

Distilling Portable Generative Adversarial Networks for Image Translation.
Hanting Chen, Yunhe Wang, Han Shu, Changyuan Wen, Chunjing Xu, Boxin Shi, Chao Xu, Chang Xu.

Fast and Robust Face-to-Parameter Translation for Game Character Auto-Creation.
Tianyang Shi, Zhengxia Zou, Yi Yuan, Changjie Fan.

Learning to Transfer: Unsupervised Domain Translation via Meta-Learning.
Jianxin Lin, Yijun Wang, Zhibo Chen, Tianyu He.

Multimodal Structure-Consistent Image-to-Image Translation.
Che-Tsung Lin, Yen-Yi Wu, Po-Hao Hsu, Shang-Hong Lai.

Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks.
Yong Wang, Longyue Wang, Shuming Shi, Victor Li, Zhaopeng Tu.

Generating Diverse Translation by Manipulating Multi-Head Attention.[PDF]
Zewei Sun, Shujian Huang, Hao-Ran Wei, Xin-yu Dai, Jiajun Chen.

GAN-Based Unpaired Chinese Character Image Translation via Skeleton Transformation and Stroke Rendering.
Yiming Gao, Jiangqin Wu.

Benign Examples: Imperceptible Changes Can Enhance Image Translation Performance.
Vignesh Srinivasan, Klaus-Robert Müller, Wojciech Samek, Shinichi Nakajima.

Others 2020

CDGAN: Cyclic Discriminative Generative Adversarial Networks for Image-to-Image Transformation.
Kancharagunta Kishan Babu, Shiv Ram Dubey.
arxiv, 15 Jan 2020. [PDF]

NeurIPS 2019

[accepted paper list]

Multi-mapping Image-to-Image Translation via Learning Disentanglement. [PDF]
Xiaoming Yu, Yuanqi Chen, Shan Liu, Thomas Li, Ge Li.

Flow-based Image-to-Image Translation with Feature Disentanglement. [PDF]
Ruho Kondo, Keisuke Kawano, Satoshi Koide, Takuro Kutsuna.

Explicitly disentangling image content from translation and rotation with spatial-VAE. [PDF]
Tristan Bepler, Ellen Zhong, Kotaro Kelley, Edward Brignole, Bonnie Berger.*

Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis. [PDF]
Xihui Liu, Guojun Yin, Jing Shao, Xiaogang Wang, Hongsheng Li.

ICCV 2019

[accepted paper list]

Tex2Shape: Detailed Full Human Body Geometry From a Single Image. [PDF]
Thiemo Alldieck, Gerard Pons-Moll, Christian Theobalt, Marcus Magnor.

Face-to-Parameter Translation for Game Character Auto-Creation. [PDF]
Tianyang Shi, Yi Yuan, Changjie Fan, Zhengxia Zou, Zhenwei Shi, Yong Liu.

Learning Fixed Points in Generative Adversarial Networks: From Image-to-Image Translation to Disease Detection and Localization. [PDF]
Md Mahfuzur Rahman Siddiquee, Zongwei Zhou, Nima Tajbakhsh, Ruibin Feng, Michael B. Gotway, Yoshua Bengio, Jianming Liang.

Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation. [PDF]
Arnab Ghosh, Richard Zhang, Puneet K. Dokania, Oliver Wang, Alexei A. Efros, Philip H. S. Torr, Eli Shechtman.

Deep CG2Real: Synthetic-to-Real Translation via Image Disentanglement. [PDF]
Sai Bi, Kalyan Sunkavalli, Federico Perazzi, Eli Shechtman, Vladimir G. Kim, Ravi Ramamoorthi.

Co-Evolutionary Compression for Unpaired Image Translation. [PDF]
Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chunjing Xu, Qi Tian, Chang Xu.

Sym-Parameterized Dynamic Inference for Mixed-Domain Image Translation. [PDF]
Simyung Chang, SeongUk Park, John Yang, Nojun Kwak.

RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes. [PDF]
Po-Wei Wu, Yu-Jing Lin, Che-Han Chang, Edward Y. Chang, Shih-Wei Liao.

ADSPM: Attribute-Driven Spontaneous Motion in Unpaired Image Translation. [PDF] [Github]
Ruizheng Wu, Xin Tao, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia.

Everybody Dance Now. [PDF]
Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros.

Multimodal Style Transfer via Graph Cuts. [PDF]
Yulun Zhang, Chen Fang, Yilin Wang, Zhaowen Wang, Zhe Lin, Yun Fu, Jimei Yang.

A Closed-Form Solution to Universal Style Transfer. [PDF]
Ming Lu, Hao Zhao, Anbang Yao, Yurong Chen, Feng Xu, Li Zhang.

Guided Image-to-Image Translation With Bi-Directional Feature Transformation. [PDF]
Badour AlBahar, Jia-Bin Huang.

Few-Shot Unsupervised Image-to-Image Translation. [PDF]
Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz.

InGAN: Capturing and Retargeting the "DNA" of a Natural Image. [PDF]
Assaf Shocher, Shai Bagon, Phillip Isola, Michal Irani.

Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis. [PDF]
Wen Liu, Zhixin Piao, Jie Min, Wenhan Luo, Lin Ma, Shenghua Gao.

CVPR 2019

[accepted paper list]

Latent Filter Scaling for Multimodal Unsupervised Image-To-Image Translation. [PDF]
Yazeed Alharbi, Neil Smith, Peter Wonka.

Attention-Aware Multi-Stroke Style Transfer. [PDF]
Yuan Yao, Jianqiang Ren, Xuansong Xie, Weidong Liu, Yong-Jin Liu, Jun Wang.

Textured Neural Avatars. [PDF]
Aliaksandra Shysheya, Egor Zakharov, Kara-Ali Aliev, Renat Bashirov, Egor Burkov, Karim Iskakov, Aleksei Ivakhnenko, Yury Malkov, Igor Pasechnik, Dmitry Ulyanov, Alexander Vakhitov, Victor Lempitsky.

Homomorphic Latent Space Interpolation for Unpaired Image-To-Image Translation. [PDF] [Github]
Ying-Cong Chen, Xiaogang Xu, Zhuotao Tian, Jiaya Jia.

Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation. [PDF]
Hao Tang, Dan Xu, Nicu Sebe, Yanzhi Wang, Jason J. Corso, Yan Yan.

TraVeLGAN: Image-To-Image Translation by Transformation Vector Learning. [PDF]
Matthew Amodio, Smita Krishnaswamy.

ReversibleGANs for Memory-Efficient ImageTo Image Translation. [PDF]
Tycho F.A. van der Ouderaa, Daniel E. Worrall.

Image-To-Image Translation via Group-Wise Deep Whitening-And-Coloring Transformation. [PDF] [Github]
Wonwoong Cho, Sungha Choi, David Keetae Park, Inkyu Shin, Jaegul Choo.

Towards Visual Feature Translation. [PDF]
Jie Hu, Rongrong Ji, Hong Liu, Shengchuan Zhang, Cheng Deng, Qi Tian.

Towards Instance-Level Image-To-Image Translation. [PDF]
Zhiqiang Shen, Mingyang Huang, Jianping Shi, Xiangyang Xue, Thomas S. Huang.

Art2Real: Unfolding the Reality of_Artworks via Semantically-Aware Image-To-Image Translation. [PDF]
Matteo Tomei, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara.

TransGaGa：Geometry-Aware Unsupervised Image To Image Translation. [PDF] [arxiv] [project]
Wayne Wu, Kaidi Cao, Cheng Li, Chen Qian, Chen Change Loy.

ICLR 2019

[accepted paper list]

InstaGAN: Instance-aware Image-to-Image Translation. [PDF] [Github]
Sangwoo Mo, Minsu Cho, Jinwoo Shin.

Harmonic Unpaired Image-to-image Translation. [PDF]
Rui Zhang, Tomas Pfister, Jia Li.

Local Image-to-Image Translation via Pixel-wise Highway Adaptive Instance Normalization. [PDF]
Wonwoong Cho, Seunghwan Choi, Junwoo Park, David Keetae Park, Tao Qin, Jaegul Choo.

EG-UNIT: Exemplar Guided Unsupervised Image-to-Image Translation with Semantic Consistency. [PDF]
Liqian Ma, Xu Jia, Stamatios Georgoulis, Tinne Tuytelaars, Luc Van Gool.

Unsupervised one-to-many image translation. [PDF]
Samuel Lavoie-Marchildon, Sebastien Lachapelle, Mikołaj Bińkowski, Aaron Courville, Yoshua Bengio, R Devon Hjelm.

Unsupervised Image to Sequence Translation with Canvas-Drawer Networks. [PDF]
Kevin Frans, Chin-Yi Cheng.

Unsupervised Video-to-Video Translation. [PDF]
Dina Bashkirova, Ben Usman, Kate Saenko.

AAAI 2019

Exploiting Time-Series Image-to-Image Translation to Expand the Range of Wildlife Habitat Analysis. [PDF]
Ruobing Zheng, Ze Luo, Baoping Yan.

Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation. [PDF]
Lijie Fan, Wenbing Huang, Chuang Gan, Junzhou Huang, Boqing Gong.

OT-CycleGAN: Guiding the One-to-one Mapping in CycleGAN via Optimal Transport. [PDF]
Guansong Lu, Zhiming Zhou, Yuxuan Song, Kan Ren, Yong Yu.

ACM MM 2019

C2-GAN: Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation.[PDF]
Hao Tang, Dan Xu, Gaowen Liu, Wei Wang, Nicu Sebe, Yan Yan.

Towards Automatic Face-to-Face Translation.[PDF] [Github] [Project]
Prajwal Renukanand, Rudrabha Mukhopadhyay, Jerin Philip, Abhishek Jha, Vinay Namboodiri and C.V. Jawahar.

Preserving Semantic and Temporal Consistency for Unpaired Video-to-Video Translation.[PDF]
Kwanyong Park, Sanghyun Woo, Dahun Kim, Donghyeon Cho, In So Kweon.

Mocycle-GAN: Unpaired Video-to-Video Translation.[PDF]
Yang Chen, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei.

Journal 2019

Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention.
Chao Yang, Taehwan Kim, Ruizhe Wang, Hao Peng, C.-C. Jay Kuo.
TIP 2019. [PDF]

AttGAN: Facial Attribute Editing by Only Changing What You Want.
Zhenliang He, Wangmeng Zuo, Meina Kan, Shiguang Shan, Xilin Chen.
TIP 2019. [PDF] [Github]

Others 2019

RL-GAN: Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation.
Shani Gamrian, Yoav Goldberg.
ICML 2019. [accepted paper list] [PDF] [Supplementary PDF] [Github]

Stylizing Video by Example.
Ondřej Jamriška, Šárka Sochorová, Ondřej Texler, Michal Lukáč, Jakub Fišer, Jingwan Lu, Eli Shechtman, Daniel Sýkora.
SIGGRAPH 2019. [PDF]

CartoonRenderer: An Instance-based Multi-Style Cartoon Image Translator.
Yugang Chen, Muchun Chen, Chaoyue Song, Bingbing Ni.
International Conference on Multimedia Modeling (MMM2020). [PDF]

AttentionGAN: Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation.
Hao Tang, Dan Xu, Nicu Sebe, Yan Yan.
IJCNN 2019. [Github]

SMIT: Stochastic Multi-Label Image-to-Image Translation.
Andrés Romero, Pablo Arbeláez, Luc Van Gool, Radu Timofte.
ICCV Workshops 2019. [PDF] [Github]

Image-to-Image Translation with Multi-Path Consistency Regularization.
Jianxin Lin, Yingce Xia, Yijun Wang, Tao Qin, Zhibo Chen.
IJCAI 2019. [PDF]

Asymmetric Generative Adversarial Networks for Image-to-Image Translation.
Hao Tang, Dan Xu, Hong Liu, Nicu Sebe.
arxiv, 14 Dec 2019 (ACCV 2018 Extension) [PDF] [GitHub]

PPN2V: Fully Unsupervised Probabilistic Noise2Void.
Mangal Prakash, Manan Lalit, Pavel Tomancak, Alexander Krull, Florian Jug.
arxiv, 27 Nov 2019. [PDF] [GitHub] [MPI-CBG: Max-Planck Institute of Molecular Cell Biology and Genetics]

PN2V:Probabilistic Noise2Void: Unsupervised Content-Aware Denoising.
Alexander Krull, Tomas Vicar, Florian Jug.
arxiv, 3 Jun 2019. [PDF] [Github]

Unpaired Image Translation via Adaptive Convolution-based Normalization.
Wonwoong Cho, Kangyeol Kim, Eungyeup Kim, Hyunwoo J. Kim, Jaegul Choo.
arxiv, 29 Nov 2019. [PDF]

EDIT: Exemplar-Domain Aware Image-to-Image Translation.
Yuanbin Fu, Jiayi Ma, Lin Ma, Xiaojie Guo.
arxiv, 24 Nov 2019. [PDF] [GitHub]

Council-GAN: Breaking the cycle - Colleagues are all you need.
Ori Nizan, Ayellet Tal.
arxiv, 24 Nov 2019. [PDF]

injectionGAN: Toward Learning a Unified Many-to-Many Mapping for Diverse Image Translation.
Wenju Xu, Shawn Keshmiri, Guanghui Wang.
arxiv 2019. [PDF]

Cross-Domain Cascaded Deep Feature Translation.
Oren Katzir, Dani Lischinski, Daniel Cohen-Or.
arxiv 2019. [PDF]

CrossNet: Latent Cross-Consistency for Unpaired Image Translation.
Omry Sendik, Dani Lischinski, Daniel Cohen-Or.
arxiv 2019. [PDF]

Before 2018

pix2pix: [Project] [Code] [Paper]
BicycleGAN: [Code] [Tensorflow]
CycleGAN: [Project] [CycleGAN] [pytorch-CycleGAN-and-pix2pix] [Full Paper]
DualGAN: [Code] [Paper]
DiscoGAN: [Code] [Paper]
StarGAN: CVPR 2018. [Code] [Paper]
VAE-GAN: [Code] [Paper]
UNIT:[Code]
cVAE-GAN: [Paper]
DTN: [Code] [Paper]
FaderNets: [Code] [Paper]
IcGAN: [Code] [Paper]
GeneGAN: [Code] [Paper]
Face-Age-cGAN: [Paper]
DAGAN: Deep Attention GAN

Datasets

Please cite their papers if you use the data.

pix2pix Datasets

Some datasets can also be downloaded manually from the website or automatically using the following script:

python download-dataset.py datasetname

facades: 400 images from CMP Facades dataset. (31MB)
sketch: http://mmlab.ie.cuhk.edu.hk/archive/cufsf/
oil-chinese: http://www.cs.mun.ca/~yz7241/dataset/
day-night: http://www.cs.mun.ca/~yz7241/dataset/
facades: 400 images from CMP Facades dataset. [Citation]
cityscapes: 2975 images from the Cityscapes training set. [Citation]
maps: 1096 training images scraped from Google Maps
edges2shoes: 50k training images from UT Zappos50K dataset. Edges are computed by HED edge detector + post-processing. [Citation]
edges2handbags: 137K Amazon Handbag images from iGAN project. Edges are computed by HED edge detector + post-processing. [Citation]

CycleGAN Datasets

facades: 400 images from the CMP Facades dataset. [Citation]
cityscapes: 2975 images from the Cityscapes training set. [Citation]
maps: 1096 training images scraped from Google Maps.
horse2zebra: 939 horse images and 1177 zebra images downloaded from ImageNet using keywords wild horse and zebra
apple2orange: 996 apple images and 1020 orange images downloaded from ImageNet using keywords apple and navel orange.
summer2winter_yosemite: 1273 summer Yosemite images and 854 winter Yosemite images were downloaded using Flickr API. See more details in our paper.
monet2photo, vangogh2photo, ukiyoe2photo, cezanne2photo: The art images were downloaded from Wikiart. The real photos are downloaded from Flickr using the combination of the tags landscape and landscapephotography. The training set size of each class is Monet:1074, Cezanne:584, Van Gogh:401, Ukiyo-e:1433, Photographs:6853.
iphone2dslr_flower: both classes of images were downlaoded from Flickr. The training set size of each class is iPhone:1813, DSLR:3316.

Attribute Editing

CelebA. The CelebFaces Attributes (CelebA) dataset contains 202,599 face images of celebrities, each annotated with 40 binary attributes. size 178×218. hair color (black, blond, brown),gender (male/female), and age (young/old).
CelebA-HQ.
CelebAMask-HQ. It is a large-scale face image dataset that has 30,000 high-resolution face images selected from the CelebA dataset by following CelebA-HQ. Each image has segmentation mask of facial attributes corresponding to CelebA. The masks of CelebAMask-HQ were manually-annotated with the size of 512×512 and 19 classes including all facial components and acessories such as skin, nose, eyes, eyebrows, ears, mouth, lip, hair, hat, eyeglass, earring, necklace, neck, and cloth.
RaFD. The Radboud Faces Database (RaFD) consists of 4,824 images collected from 67 participants. Each participant makes eight facial expressions in three different gaze directions, which are captured from three different angles.
CMU Multi-PIE Face Database. [Multi-PIE] A large (305GB) database of images for training facial recognition software. It consists 13 poses within ±90 degrees of 337 subjects and can be used for face frontalization experiments.
AFHQ. Released in StarGAN v2. Animal FacesHQ (AFHQ) consists of 15,000 high-quality images at 512 × 512 resolution. We collected images with permissive licenses from the Flickr and Pixabay websites. All images are vertically and horizontally aligned to have the eyes at the center. The low-quality images were discarded by human effort. See the Project or Paper for more details.

Others

-Makeup Transfer. [Download]

DeepFashion. In-shop Clothes Retrieval Benchmark evaluates the performance of in-shop Clothes Retrieval. This is a large subset of DeepFashion, containing large pose and scale variations. It also has large diversities, large quantities, and rich annotations, including 7,982 number of clothing items, 52,712 number of in-shop clothes images, and ~200,000 cross-pose/scale pairs, Each image is annotated by bounding box, clothing type and pose type. Download
AI-Generated Faces: Free Resource of 100K Faces Without Copyright. [Download]
All-Age-Faces (AAF) Database - contains 13'322 face images (mostly Asian) distributed across all ages (from 2 to 80), including 7381 females and 5941 males. GitHub Paper
Celeb-DF. A New Dataset for DeepFake Forensics. [Download]
The Deepfake Detection Challenge (DFDC) Preview Dataset. Facebook AI. [PDF] [Project].
Faceforensics++. Learning to detect manipulated facial images, 2019.
AI Generated Diverse Photos. [Project]
t-less. An RGB-D- Dataset for6 D Pose Estimation of Texture-less Objects.

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Awesome Image-to-image Translation Paper

Contributing

Table of Contents

Tutorials

Supervised

Unsupervised

General

Attention-Examplar-Guided

Disentanglement

Many-to-many

Applications

Attribute-Editing

Video

Data Augmentation

Model-Compression-and-Pruning

Adversarial-Examples

Imbalanced Data

Few-Shot

Image-Synthesis

Retargeting-and-3D-Vision

Conference

ICLR 2020

AAAI 2020

Others 2020

NeurIPS 2019

ICCV 2019

CVPR 2019

ICLR 2019

AAAI 2019

ACM MM 2019

Journal 2019

Others 2019

Before 2018

Datasets

pix2pix Datasets

CycleGAN Datasets

Attribute Editing

Others

License

About