There are 0 repository under language-classification topic.
:spider: The pipeline for the OSCAR corpus
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
A Language Classifier powered by Recurrent Neural Network implemented in Python without AI libraries. AI from scratch.
Hyperdimensional computing explained and demonstrated
Python Binding for Rust WhatLang, a language detection library
Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek
语言识别数据集的基本数据分析方法,包括SVM算法。
ISO 639 and IETF Language Code Lookup Tool
Plain swahili dastaset. Public sourced from public repositories
NLP projects.
A single-layer neural network written from scratch that predicts the language of the text.
Suite of Python modules to recognise the language of a file
This curated collection brings together a dataset of common Swahili stopwords gathered from various sources on the internet. Stopwords are words that are frequently used in a language but typically don't contribute significant meaning to a text.
Personalized voice assistant 'Alice' with Language classification, Speech Recognition, Machine Translation, Restoring Punctuation, Conversational and Speech Synthesis.
This program identify the input text language.
Build and analyzed a language classifying pipeline in order to gain insight and familiarity to typical Natural Language Processing (NLP) tools and strategies.
Classifying English, Slovak, Czech language using Naive Bayes
Detecting hate speech in tweets using bag-of-trick models and bi-LSTM networks.
Language Identification Techniques and Analysis (5 encoding X 17 models)
Using Decision Tree and AdaBoost to classify languages(English/Dutch)
Language Identifier model : takes a sentence as input and predict its language.
Code for the live site for LangSonic, a multilingual speech classification CNN.
Streamlit web app
Implementing a Naive Bayes Classifier for multiclass classification to identify language of a given text
Language classifier based on BERT to classify Aviation Safety Reporting System (ASRS) narratives into categories that were clustered by using language embedding similarity.
Detecting the location and native language of a place from an image
Click below to checkout the website
Classified sentences into one of Slovak, Czech, and English. Implemented relevant preprocessing steps, addressed the class imbalance in training set by employing the learned theory of Naive Bayes Models, and implementing subword units.