There are 3 repositories under language-recognition topic.
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
Natural language detection library for Rust. Try demo online: https://whatlang.org/
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
A TensorFlow-based spoken language identification
Natural language detection library for .NET, suitable for long and short text alike
📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа
Collection of self-supervised models for speaker and language recognition tasks.
Dialect identification using Siamese network
Knife is a Java top-down parser generator for building parsers from grammars in BNF format.
The LALR parser generator (LPG) is a tool for developing scanners and parsers. Supports multi-language . Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.
Language identification using Siamese network based on i-vector
Multi-label MFoM Framework for Speech Articulatory Attributes Detection
👄 Fork of the language detector Lingua, with the intention to increase detection speed and reduce memory consumption
Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification
Simple, yet fast, Python scripts to read Kaldi NNet3 models and compute bottleneck features
The todo app Select20 leverages language recognition to manage tasks more efficiently. The distraction-free and blazing fast app supports offline usage and compatibility to CalDav.
The NASABot integrated with NASA API and LUIS (Language Recognition Service). It provides access to the latest NASA API (like Space Weather Database Of Notifications and other NASA services) using plain English and Natural User Flow.
A short demo of the Windows ELS API
🎏🎌 language recognition script implemented using basic algorithms and spaghetti code
This is a refined and re-implemented version of the archived plugin for ElasticSearch elasticsearch-langdetect, which itself builds upon the original work by Nakatani Shuyo, found at https://github.com/shuyo/language-detection. The aforementioned implementation by Nakatani Shuyo serves as the default language detection component within Apache Solr.
A single-layer neural network written from scratch that predicts the language of the text.
The LALR parser generator (LPG) is a tool for developing scanners and parsers written in TypeScript ,C#, Java, C++ or C. Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.
Suite of Python modules to recognise the language of a file
Grammax is a Java & C++ bottom-up SLR/CLR parser generator that builds parsers from grammars in Backus-Naur-Form.
Simple and convenient yet powerful parsing lib. No Regexes, tree walkers, (E)BNF or books necessary! No separate lexer required.
This project is about creating an automated youtube videos scraper using Airflow, Selenium, ytb-dlp library.
Modify Markdown fenced code blocks to contain the language name by detecting it from the block contents.