leticiabeeck / language-classifier

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Language Classifier

Character Level Recurrent Neural Network to predict European languages.

Based on the PyTorch tutorial found here.

The model uses data from the European Parliament Proceedings Parallel Corpus. The corpora are segmented by word, and the model uses only one instance of each unique word, per language.

Current Languages included are:

  • Bulgarian
  • Czech
  • Dutch
  • English
  • French
  • German
  • Italian
  • Polish
  • Spanish
  • Swedish

About

License:MIT License


Languages

Language:Python 100.0%