There are 10 repositories under text-normalization topic.
🧹 Python package for text cleaning
Chinese text normalization for speech processing
Japanese text normalizer for mecab-neologd
Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks
Myanmar Language Script Library
Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain
Code and model files for paper: I. Lourentzou et al., Adapting Sequence to Sequence models for Text Normalization in Social Media", ICWSM'19
This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.
Convert English text from written expressions into spoken forms
JS / Python3 / PHP Lib to work with UTF8 polytonic greek and latin
pyTorch implementation for Text Normalization Challenge
Proper categorization of e-commerce products enhances the user experience and achieves better results with external search engines. The objective of the project is to classify a product into four given categories, based on its description available on an e-commerce platform.
An online text normalization tool for Chinese-English mixed text-to-speech system
Useful String extensions to save you time in production.
Repository for text normalization research.
My work during internship at FPT.AI 2020
Our source code for the paper "Transformer-based Joint Learning Approach for Text Normalization in Vietnamese ASR"
An ASR recipe and speech corpus of Icelandic parliamentary speeches
Phonetic normalization using Recurrent Neural Networks
Implementing text normalization for Farsi(Persian) language.
Small Python wrapper class for the CAB webservice.
Implementation of the paper on Text normalization by Choudhury et al.
Library supports converting number to Vietnamese for .NET C# ./
A web app for Spam classification using Natural Language Processing.
Twitter Sentiment Analysis using Natural Language Processing(NLP)
Accurate categorization of eCommerce products improves user experience and boosts search engine visibility. The project goal is to classify products into 14 predefined categories using their descriptions sourced from an eCommerce platform.
Predict emotions (happiness, anger, sadness) from WhatsApp chat data using machine learning and deep learning models. Includes text normalization, vectorization (TF-IDF, BoW, Word2Vec, GloVe), and model evaluation.