Implementation and Evaluation of Variational Autoencoders for High Resolution Medical Imaging

Over the last few years deep learning models have achieved great success to improve computer aided diagnosis. Most of the deep learning models are using supervised learning methods, which have the disadvantage of elaborate preprocessing. Therefore, this work is focusing on an unsupervised learning approach, namely, on variational autoencoders (VAE)s proposed by Kingma et al. [1]. VAEs are powerful generative models, which allow analyzing and disentanglement of the latent space for a given input. In this work the standard VAE model and four enhanced models are derived, applied and discussed on a high resolution knee dataset.

Implemented and Evaluted Models

Variational Autoencoder (VAE) [1]
Spatial Variational Autoencoder (SVAE) [2]
Variational Perceptual Generative Autoencoders (VPGA) [3]
Vector Quantized Variational Autoencoder (VQ-VAE) [4]
Introspective Variational Autoencoder (IntroVAE) [5]

Structure of this repository

The directory general contains different util methods for the implementation. In the model folder the five different models are implemented using the same architecture. The figures and plots for the thesis are created with the utillity files in the thesis_util directory.

Playing around with the laten space of IntroVAE

prerequisites: Python 3.7 installed

Install required python packages

$ pip install -r requirements.txt

Start JupyterLab

$ jupyter-lab

Navigate to Jupyter Notebook

models/IntroVae/IntroVAE_latent_space_interactive.ipynb

set the path of your project in the path2project variable

References

[1] Diederik P. Kingma and Max Welling. Auto-encoding variational bayes. URL https://arxiv.org/pdf/1312.6114.pdf.

[2] ZhengyangWang, Hao Yuan, and Shuiwang Ji. Spatial variational auto-encoding via matrix-variate normal distributions. URL http://arxiv.org/pdf/1705.06821v2.

[3] Zijun Zhang, Ruixiang Zhang, Zongpeng Li, Yoshua Bengio, and Liam Paull. Perceptual generative autoencoders. URL http://arxiv.org/pdf/1906.10335v1.

[4] Aaron van den Oord, Oriol Vinyals, and Koray Kavukcuoglu. Neural discrete representation learning, . URL http://arxiv.org/pdf/1711.00937v2.

[5] Huaibo Huang, Zhihang Li, Ran He, Zhenan Sun, and Tieniu Tan. Introvae: Introspective variational autoencoders for photographic image synthesis. URL http://arxiv.org/pdf/1807.06358v2.

duennbart / masterthesis_VAE

Implementation and Evaluation of Variational Autoencoders for High Resolution Medical Imaging

Implemented and Evaluted Models

Structure of this repository

Playing around with the laten space of IntroVAE

References

About

Languages