ngrams

There are 1 repository under ngrams topic.

neuspell / neuspell
NeuSpell: A Neural Spelling Correction Toolkit
spelling-correction spell-checkers spellcheck neural-models neural-spell-check spell-checker nlp spell-correction dataset spell-correction-datasets ngrams
Language:Python 697
bakwc / JamSpell
Modern spell checking library - accurate, fast, multi-language
spellcheck spellchecker ngrams nlp cpp python spelling-correction java ruby csharp
Language:C++ 652
thepanacealab / covid19_twitter
Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development
tweets dataset retweets tweets-acquired frequent-terms twitter-stream dissemination ngrams
Language:Jupyter Notebook 480
bennyschmidt / next-token-prediction
Next-token prediction in JavaScript — build fast language and diffusion models.
ai autocomplete autocompletion diffusion-models language-models llm markov-chain next-token-prediction ngram-language-model ngrams pixel-prediction word-prediction embeddings vector-embeddings
Language:JavaScript 143
jermp / tongrams
A C++ library providing fast language model queries in compressed space.
trie elias-fano ngrams store-frequency-counts gram-counts-files language-model
Language:C++ 132
winkjs / wink-nlp-utils
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.
tokenize stem ngrams bag-of-words phonetize stop-words sentence-boundary-detection nlp natural-language-processing
Language:JavaScript 132
landrok / language-detector
A fast and reliable PHP library for detecting languages
language-detector ngrams
Language:PHP 131
proycon / colibri-core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
c-plus-plus python nlp ngrams skipgram ngram corpus linguistics library text-processing computational-linguistics pattern-recognition
Language:C++ 129
orgtre / google-books-ngram-frequency
Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code
google language-learning linguistics ngrams wordlist
Language:Python 92
joshualoehr / ngram-language-model
Python implementation of an N-gram language model with Laplace smoothing and sentence generation.
ngram-language-model ngram perplexity nlp language-model python laplace-smoothing ngrams language-models smoothing-methods
Language:Python 88
shantanu1109 / Coursera-DeepLearning.AI-Natural-Language-Processing-Specialization
This Repository Contains Solution to the Assignments of the Natural Language Processing Specialization from Deeplearning.ai on Coursera Taught by Younes Bensouda Mourri, Łukasz Kaiser, Eddy Shyu
coursera natural-language-processing hashing knearest-neighbor-algorithm logistic-regression naive-bayes pca vector-spaces autocorrect bag-of-words cbow markov-chain minimum-edit-distance ngrams pos-tagging tokenization
Language:Jupyter Notebook 80
postmodern / raingrams
A flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.
ruby ngrams
Language:Ruby 69
anfederico / poesy
Poetry generation via natural language markov models
poetry nlp markov modeling ngrams
Language:Python 54
starlordvk / Typing-Assistant
Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
nlp javascript python ngrams autocompletion text-prediction prediction corpus typing-assistant keyboard ngram-model trigram-model bigrams natural-language-processing
Language:CSS 53
OpenPecha / pybo
🦜 NLP for Tibetan, in Python.
nlp computational-linguistics search ngrams language-models linguistics toolkit tibetan tibetan-nlp
Language:Python 36
susantabiswas / Word-Prediction-Ngram
Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques
ngrams nlp trigrams quadgrams bigrams unigram language-model prediction-ngram ngram-probabilistic-model prediction knesser-ney-smoothing good-turing backoff interpolated-knesser-ney
Language:Jupyter Notebook 36
indranil143 / Mental-Health-Sentiment-Analysis-using-Deep-Learning
A deep learning project using fine-tuned RoBERTa to classify mental health sentiments from text, aiming to provide early insights and support. ⚕️❤️
deep-learning eda emotion-detection logistic-regression machine-learning mental-health mental-health-awareness mentalsupport ngrams nlp python pytorch roberta sentiment-analysis transformermodel nltk spacy l2-regularization sentiment-classification
Language:Jupyter Notebook 27
DasariJayanth / Malware-Detection-in-PE-files-using-Machine-Learning
Detecting Malware in PE files
machine-learning python-3 pefile malware-analysis malware-detection hybridstaticanalysis staticanalysis ngrams opcodengrams bytengrams opcode asm byte dll exe peheader classifier-model
Language:Jupyter Notebook 26
kampersanda / tongrams-rs
Rust library providing fast language model queries in compressed space
compression elias-fano language-model ngrams nlp trie
Language:Rust 25
madhurima-nath / nlp_fuzzy_match_algorithms
fuzzy-matching-algorithms levenshtein machine-learning natural-language-processing fuzzy-string-matching data-science nlp-machine-learning nlp algorithms bitap ngrams fuzzy-matching
Language:Jupyter Notebook 22
ngrams-dev / general
NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and issues.
ngram-analysis ngram-model ngrams nlp natural-language-processing ngram
22
gustavecortal / natural-language-processing
Slides, exercises, and exams for my course "Natural Language Processing" (École Pour l'Informatique et les Techniques Avancées, 2024 and 2025)
feedforward-neural-networks logistic-regression naive-bayes naive-bayes-classifier ngram-language-model ngrams nlp recurrent-neural-networks slides tf-idf tokenization transformer tutorial vector-semantic-models word2vec
Language:Jupyter Notebook 19
slowikj / seqR
fast and comprehensive k-mer counting package
k-mer-counting kmer kmer-counting kmer-frequency-count kmers ngrams ngram rcpp rcppparallel rpackage bioinformatics bioinformatics-tool genomics protein-sequences hashing feature-engineering feature-extraction dna-processing k-mer hashing-algorithms
Language:C++ 19
ggerganov / ggwords
Generate language n-gram statistics
language ngrams statistics
Language:C++ 17
jermp / tongrams_estimation
A C++ library implementing fast language models estimation using the 1-Sort algorithm.
ngrams ngram-language-model
Language:C++ 17
kchapelier / ngram-word-generator
Word generation based on n-gram models, and a cli utility to generate said models.
javascript procedural-generation ngrams text
Language:JavaScript 17
dtinth / bangkokipsum
Random Thai text generator
netlify ngrams
Language:HTML 16
KhaledAshrafH / Auto-Filling-Text
This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.
auto-complete auto-complete-text auto-filling bigram-model bigrams n-gram n-grams natural-language-processing news-articles ngram ngram-analysis ngram-language-model ngram-model ngrams nlp tkinter tkinter-gui trigram trigram-model trigrams
Language:Python 16
go-generalize / volcago
Model Generator for Firestore
go golang firestore firestore-database firebase generator code-generation n-grams ngrams
Language:Go 15
jaytimm / google-ngrams-and-r
An R-based guide to sampling Google n-gram data, building historical term-feature matrices & investigating lexical semantic change historically.
ngrams lexical-semantics word-embeddings vector-space-model
15
StephanGeorg / trigram-similarity
Determining the similarity of alphanumeric text based on trigram matching.
trigram trigrams similarity text-similarity postgres ngrams
Language:JavaScript 15
DanielJohnBenton / Ngrams.java
:cake: A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.
java-library java n-grams skip-grams bagofwords bag-of-words ngrams ngram remove-duplicates duplicates-removed creating-ngrams
Language:Java 14
thuynh323 / Natural-language-processing
text mining, regex, N-grams, fuzzy matching
data-cleaning fuzzy-matching ngrams regex text-mining
Language:Jupyter Notebook 13
dayyass / language-modeling
Pipeline for training Language Models using PyTorch.
deep-learning natural-language-processing nlp language-modeling text-generation python pytorch decoding ngrams rnn sampling gpt-2 lstm
Language:Python 12
loginn / ngrams_graphs
ngram graphs library
graph ngram ngrams ngrams-graphs nlp
Language:Python 12
shenxiangzhuang / bleuscore
BLEU Score in Rust
bleu bleu-score deep-learning maturin ngrams nlp pyo3 python rust tokenizer
Language:Rust 11

ngrams

neuspell / neuspell

bakwc / JamSpell

thepanacealab / covid19_twitter

bennyschmidt / next-token-prediction

jermp / tongrams

winkjs / wink-nlp-utils

landrok / language-detector

proycon / colibri-core

orgtre / google-books-ngram-frequency

joshualoehr / ngram-language-model

shantanu1109 / Coursera-DeepLearning.AI-Natural-Language-Processing-Specialization

postmodern / raingrams

anfederico / poesy

starlordvk / Typing-Assistant

OpenPecha / pybo

susantabiswas / Word-Prediction-Ngram

indranil143 / Mental-Health-Sentiment-Analysis-using-Deep-Learning

DasariJayanth / Malware-Detection-in-PE-files-using-Machine-Learning

kampersanda / tongrams-rs

madhurima-nath / nlp_fuzzy_match_algorithms

ngrams-dev / general

gustavecortal / natural-language-processing

slowikj / seqR

ggerganov / ggwords

jermp / tongrams_estimation

kchapelier / ngram-word-generator

dtinth / bangkokipsum

KhaledAshrafH / Auto-Filling-Text

go-generalize / volcago

jaytimm / google-ngrams-and-r

StephanGeorg / trigram-similarity

DanielJohnBenton / Ngrams.java

thuynh323 / Natural-language-processing

dayyass / language-modeling

loginn / ngrams_graphs

shenxiangzhuang / bleuscore