manyoso / tibetan_corpus

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Homage

I pay homage to my Spiritual Heroes: 
The Virtuous Teachers,
Practicing the Excellent Path,
Praised by the Wise.

Tibetan Language Parallel Corpus

This project is focused on creating a high quality tibetan language parallel corpus for use in automatic computer assisted machine translation. The initial language focus will be translations from standard tibetan to standard english and wylie transliteration.

Scripts are written in Python to convert existing corpus documents into dictionary format and then into TMX (Translation Memory Exchange) format for input into statistical machine translation tools like Moses.

Dedication

May whatever merit 
Gained by this task 
Accrue to the cause
For the liberation of all.

About


Languages

Language:Python 100.0%