There are 4 repositories under punctuation topic.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
A sentence segmenter that actually works!
Punctuation restoration and spell correction experiments.
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
Text normalization library for Python
Text and Punctuation correction with Deep Learning
äøęę ē¹ē¬¦å·ęØ”åļ¼åÆ仄ē»ęę¬ę·»å ę ē¹ē¬¦å·ć
Pre-process arabic text (remove diacritics, punctuations and repeating characters)
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --
Apache OpenNLP wrapper for Nodejs
A small seq2seq punctuator tool based on DistilBERT
ŠŠµŠ¹ŃŠ¾Š½Š½Š°Ń ŃŠµŃŃ Š“Š»Ń Š²Š¾ŃŃŃŠ°Š½Š¾Š²Š»ŠµŠ½ŠøŃ ŠæŃŠ½ŠŗŃŃŠ°ŃŠøŠø Š½Š° ŃŃŃŃŠŗŠ¾Š¼ ŃŠ·ŃŠŗŠµ.
Sequence to sequence model for Arabic punctuation prediction.
#Sentimental Analytics
Regular expression for matching punctuation characters.
Regular Expressions for finding wrong punctuation before publishing.
A blazingly fast tool for converting to English punctuations
LinTO Platform punctuation service.
Created a Python library specifically for Traditional Chinese stopwords and punctuations removal
Pyspark WordCount
Armenian mnemonic keyboard layout
Russian mnemonic keyboard layout
š¤ Tiny & versatile š„ Node.js library for in-depth text analysis, manipulation and data extraction.
A curated list of awesome punctuator
Armenian Mnemonic R keyman keyboard layout
A small library for getting stats on punctuation in files. - Node Module
ā®Forced evolution for unicellular entitesā®
simple regex for correcting punctuations
A fast sentence/word tokenizer, and punctuation remover.
Remove Punctuation is a tool that help you to strip all punctuation marks and symbols from a text document or input string.
Remove punctuation characters from a string.