avinashvarna / sanskrit_nmt

Training NMT models for Sanskrit

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

sanskrit_nmt

Training NMT models for Sanskrit

Experiments with training various Neural Machine Translation (NMT) architectures for various Sanskrit NLP tasks (sandhi splitting, tagging, etc.).

Sandhi splitting using Transformer model

sandhi_split/transformer_small_vocab has scripts + output model for sandhi splitting using a transformer model provided by the OpenNMT-tf library. A small model and vocabulary were used to ensure a small model size and fast training. The results are quite good for a small model that can be trained in < 1hr on a good GPU. See the README in that directory for more details.

About

Training NMT models for Sanskrit

License:MIT License


Languages

Language:Python 69.2%Language:Shell 30.8%