There are 2 repositories under language-recognition topic.
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Natural language detection library for Rust. Try demo online: https://whatlang.org/
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
A TensorFlow-based spoken language identification
Collection of self-supervised models for speaker and language recognition tasks.
📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа
Dialect identification using Siamese network
Knife is a Java top-down parser generator for building parsers from grammars in BNF format.
Language identification using Siamese network based on i-vector
Multi-label MFoM Framework for Speech Articulatory Attributes Detection
The LALR parser generator (LPG) is a tool for developing scanners and parsers. Supports multi-language . Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.
👄 Fork of the language detector Lingua, with the intention to increase detection speed and reduce memory consumption
Simple, yet fast, Python scripts to read Kaldi NNet3 models and compute bottleneck features
Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification
The todo app Select20 leverages language recognition to manage tasks more efficiently. The distraction-free and blazing fast app supports offline usage and compatibility to CalDav.
🎏🎌 language recognition script implemented using basic algorithms and spaghetti code
The NASABot integrated with NASA API and LUIS (Language Recognition Service). It provides access to the latest NASA API (like Space Weather Database Of Notifications and other NASA services) using plain English and Natural User Flow.
A single-layer neural network written from scratch that predicts the language of the text.
The LALR parser generator (LPG) is a tool for developing scanners and parsers written in TypeScript ,C#, Java, C++ or C. Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.
Suite of Python modules to recognise the language of a file
Grammax is a Java bottom-up SLR/CLR parser generator that builds parsers from grammars in Backus-Naur-Form.
Streaming version of Linguakit, a multilingual toolkit for NLP
This project is about creating an automated youtube videos scraper using Airflow, Selenium, ytb-dlp library.
Natural language detection library for .NET, suitable for long and short text alike
Implementation of a Pushdown Automaton that recognizes strings belonging to a language valid arithmetic expressions over floating point numbers
This project focuses on language translation of images to texts using Pytesseract. This program successfully translates 4 different images in terms of languages and sources into english. This program is capable to translate more than 50 languages using Pytesseract and google translate.
Implementation of a parser, a compiler and an interpreter for a programming language called “SimplanPlus” which is based on ANTLR.