This project is originally forked from https://github.com/chqiwang/transformer.
- Pre-processing script
Create a new config file.
cp config_template.yaml your_config.yaml
Configure train.src_path, train.dst_path, scr_vocab and dst_vocab in your_config.yaml. After that, run the following command to build the vocabulary files.
python vocab.py -c your_config.yaml
Edit src_vocab_size and dst_vocab_size in your_config.yaml according to the vocabulary files generated in previous step.
Run the following command to start training loops:
python train.py -c your_config.yaml