hadrianpaulo / ULMFiT-Filipino-Tagalog

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Language Model for Filipino (Tagalog) Language

This project contains relevant files for creating a language model for the Filipino language from the Filipino (Tagalog) Wikipedia corpus, Wiktionary, and Wikibooks.

Language model weights and the itos (index -> string) mapping pickle file is available for download here

The RNN was trained in 4 iterations with the following learning rates:

  1. 8
  2. 4
  3. 1.5
  4. 0.2

The learning rates were determined through the plot_lr() methods of fastai.

Performance:

Perplexity: 26.1997

Accuracy: 0.4403

TODO:

About

License:Apache License 2.0


Languages

Language:Jupyter Notebook 100.0%