datnth1709 / VietAI-Machine-Translation

Final project in VietAI-NLP02 course

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

VietAI-Machine-Translation

Final project in VietAI-NLP02 course

Installation

  • Create a new conda environment
  • For Pytorch, please refer to PyTorch official tutorial to find a suitable cuda version
  • Run pip install -r requirements.txt

Quick Run

  • Open the experiment.ipynb notebook and run the training/evaluating cell with pre-defined parameters

Project Structure

  • data_processor.py: contain the code to preprocess data and load it into transformers dataset
  • inference.py: contain code to run inference
  • metrics.py: contain code to compute sacrebleu
  • models.py: contain code to create model and tokenizer
  • train.py: contain code to train
  • trainers.py: contain code to create Trainer

About

Final project in VietAI-NLP02 course


Languages

Language:Jupyter Notebook 97.6%Language:Python 2.4%