hash2430 / French_EN_NMT

French to English neural machine translation trained on multi30k dataset.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

1. Examples

Sentence
French un homme portant des lunettes et un costume pointe du doigt alors qu' il parle
English G.T. a man wearing glasses and a suit points the finger as he speaks
Translated a man wearing glasses and a costume reaches while talking .

2. Score

Model BLEU
Concat attention with bucket sampler 26.14
Dot attention with bucket sampler 26.65
Dot attention without bucket sampler 27.23

3. Implementation

  • Training set: Multi30k dataset (29,000 French and English sentences, each)
  • Memory efficiency: Bucket sampler implemented for similar sentence-length within batch
  • Word representation: nn.Embedding, not transferred from Word2vec or other representations
  • Seq2seq with attention:
    • GRU-based encoder, decoder for source and target language sequence, respectively.
    • Dot product & concat attention for aligning source language and target language.

4. Pretrained models

Model Pretrained model
Concat attention with bucket sampler pretrained_model
Dot attention with bucket sampler pretrained_model

5. TODO

  • Transferring word representation from Word2vec or Glove will improve the score.

About

French to English neural machine translation trained on multi30k dataset.


Languages

Language:Python 88.8%Language:Perl 9.6%Language:Shell 1.5%