Comparison of GAN Architectures

This repository contains code and results for comparing several GAN architectures.

Diving into the code

1. Setting up the project dependencies

Install pipenv using the command pip insalll pipenv
Install the project dependencies by running the command pipenv install in the project directory which will set up the dependencies after creating a virtual env.

2. Running the code

2.1. Using the jupyter notebook

You can find the jupyter notebook for this project in notebooks/project_gans.ipynb. Running it should be fairly simple as the code is well-commented and explained fairly well.

2.2. Running the code using CLI

In the project directory use the following commands:

For training: cg train

For Generating samples: cg generate_samples

Use the command cg --help for an overview of what each command does

The file config.yaml stores the configuration of the classifier.

Note: To compute the FID scores, I used the following repo: https://github.com/mseitzer/pytorch-fid

3. Results

3.1. Train dataset samples

MNIST dataset

Fashion-MNIST dataset

KMNIST dataset

3.2. Generated Samples

LSGAN

WGAN

WGAN-GP

Note: The samples are generated randomly and not cherry-picked

3.3. FID Score Comparison

Note: All FID scores are reported on the same architecture of the Generator and the discriminator.

For the MNIST dataset, the FID scores for the architectures are:

Architecture	FID Score	Dataset	Number of Samples
DCGAN	10.8589	MNIST	8k
LSGAN	10.9903	MNIST	8k
WGAN with gradient clipping	9.4645	MNIST	8k
WGAN with gradient penalty	17.69	MNIST	8k

The architecture for the discriminator and the generator was constant and other factors like learning_rate, optimizer was also constant. All the architectures were trained for 50 epochs.

In the given setting WGAN with weight clipping produces the best FID score on the MNIST dataset.

3.4. Effect of Finetuning on sample quality

Source Dataset: MNIST Target Dataset: Fashion-MNIST GAN Architecture: WGAN with gradient penalty

Architecture	FID Score(Scratch)	FID Score(Fine-tuned)	Dataset	Number of Samples	# epochs	transfer
DCGAN	23.2419	23.3011	Fashion-MNIST	8k	50	Generator only
WGAN with gradient penalty	36.5917	31.38651	Fashion-MNIST	8k	20	Both

From the above results it can be concluded that:

Finetuning works best when both the generator and the discriminator are transferred
Finetuning might also work when the source and the target datasets are not even related like the ones in this case.

3.5. Effect of using Different activation functions

I experimented with using the Mish and the Swish activation functions in the generator and the discriminator and the GAN training was very unstable with no plausible samples
Using leaky_relu in the discriminator and swish in the generator generates some plausible samples but not good enough. These are shown below:

4. Author

Kushagra Pandey / @kpandey008

About

This repository compares the performance of DCGAN, WGAN, WGAN-GP and LSGAN in terms of sample quality and several other factors

generative-adversarial-network gan wgan-gp dcgan wgan lsgan sample-quality generative-model deep-learning

MIT License

Languages

Language:Jupyter Notebook 97.7%Language:Python 2.3%