German-English translator

This project aims to train a German-English translator based on the attentional neural model. The dataset for training is IWSLT German-English dataset.

Algorithm Design

Basically follow the framework given in the pseudocode, which can be divided into training and translation. Both parts include:

Encoding: Using mini batch in training process, do the padding and deal with the masks.
Decoding: Feed in an endsymbol and extract an initial state. Using this state and the encoding of source sentence to compute the first context in order to get the prediction. Then compute the loss for the first word.

How to run:

Start a p2.xlarge instance on aws.
Use AMI provided by course instructor.
GPU+ at least 11G memory
run "python MT_mini.py"

Estimated Time

Each epoch takes about 30 minutes.

unique24 / mt_assignment1

German-English translator

Algorithm Design

How to run:

Estimated Time

About

Languages