HavanaBeachDeepLearning

Members: Kiss Richárd (KAYXFT) Pogány Domonkos (D8AFH4) Sándor Dániel (F193CB)

First Assignment

For our homework assignment we chose to make a Generative Adversarial Network. Our main goal is to generate pokemons, based on the following dataset: https://www.kaggle.com/kvpratama/pokemon-images-dataset/data. As a possible improvement we would try to make a network that is able to generate pokemons based on drawings, by users.

Preprocessing:

As we are making a GAN we don't need to separate the images to training and validation set, because all the images will be training images. Also we don't need preprocessing (normalizing), because only the discriminator will use the images to detect whether its input image is a generated "false" image, or an original one. So if we want to generate images like to the original images, we must not change them. The only preprocessing we made, is that we resized the images (from 256x256 to 64x64), hence the network will train on smaller data, and can converge faster.

To run the .ipynb file it should be in the same folder as the "pokemon" folder, which contains the images.

Second Assignment

Model architecture

We built our model based on the DCGAN architecture. The model has a generator and a discriminator network. We feed a noise (gaussian) vector into the generator network as its input and the output is a 64x64 RGB image. The discriminators input is a 64x64 RGB image and the output is the probability that the input is real.

Training

The discriminators loss function is the binary-crossentropy of the output probability and the real values (1 or 0). To train the generator we feed the generators output to the discriminator (we call the combined model adverarial) and based on the previous loss we propagate back the error to the generator. It is an option to fix the discriminators weights in the adversarial model, so we only update the generators weights. We tried training the model with fixed and not fixed weights as well. We tried two training strategies:

In each epoch we train the discriminator and the adversarial both.
In each epoch we choose which model to train based on their accuracies. (We train only the "weaker" model) - in this case each model is trained until it performs as "good" as the other.

Testing Hyperparameters

We tried mutliple optimizers: adam and rmsprop. Our results can be found in the train1 results folder. Naming conventions:

no train - the discriminators weigths are fixed in the adversarial model.
train - the discriminators weigths are NOT fixed in the adversarial model.
separately - we used the 2. strategy during training.
GXM and DXM - D stands for the discriminator, G is the generator. XM means the model has X Million weights.
the name of the image files specifies the training epoch count.

Third (final) Assignment

New models

We tried several other approaches, like WGAN, VAE, AAE. You can read more about it in the documentation pdf.

New data

To get enough input images we have augmented the images in the "pokemon_small" folder, the new images are found in the "augmented" folder

Results

VAE/AAE:

The name of the images: [modell name] [imagesize]x[imagesize] [epoch size]e [sep?] [aug/small] Where the aug/small tells if we trained it on the pokemon_small or the augmented data, and sep means we used the 2. strategy during the a AAE training.

WGAN:

The images are in two folders based on the training data (one was maed by the basic dataset and the other by the augmented), they are in ascending order of the epochs.

Running models

The pokemon_small or augmented folders should be placed next to the .ipynb files. Upload to google colab to try the models!

ricsi98 / HavanaBeachDeepLearning