niharikabalachandra / Language-Detection-MultinomialLogisticRegression

Language Detection using the European Parliament Proceedings Parallel Corpus. European Parliament Proceedings Parallel Corpus is a text dataset used for evaluating language detection engines. The 1.5GB corpus includes 21 languages spoken in EU. This project aims to build a machine learning model trained on this dataset to predict new unseen data.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

niharikabalachandra/Language-Detection-MultinomialLogisticRegression Issues

No issues in this repository yet.