Autoencoder

This repository contains code for feature dimensionality reduction using autoencoder. The repository will be updated with other methods to encode the input as well as code to train autoencoder on textual dataset.

Requirements

Python 2.7
Tensorflow 1.2.1
Numpy

Project Module

utility_dir: storage module for data, vocab files, saved models, tensorboard logs, outputs.
implementation_module: code for model architecture, data reader, training pipeline and test pipeline.
settings_module: code to set directory paths (data path, vocab path, model path etc.), set model parameters (hidden dim, attention dim, regularization, dropout etc.), set vocab dictionary.
run_module: wrapper code to execute end-to-end train and test pipeline.
visualization_module: code to generate embedding visualization via tensorboard.
utility_code: other utility codes