Andreas Wagner's repositories
big-phoney
Get phonetic spellings and syllable counts for any english word. Works with made-up and non-dictionary words
browser-core
Cliqz features, shared across products including Cliqz browsers for Windows, Mac, Android and iOS
elasticsearch-record-linkage
ElasticSearch plugin to expose scoring metrics useful for record linkage and deduplication
grobid-quantities
GROBID extension for identifying and normalizing physical quantities.
Interactive-Dictionary
In this program, the user interacts with a dictionary. The user can input a word, part of speech, and filter the dictionary by part of speech. The Java program interacts with an enum to pull data from. There are still a bit of fixes to make, but the program overall works.
liblevenshtein-java
Various utilities regarding Levenshtein transducers. (Java)
query-suggestions
Produces meaningful completions for partial queries given by the user. Semester project for the course "Information Retrieval" at the University of Tübingen in the winter semester 2016/17.
Recommendation-System
Hybrid RecSys, CF-based RecSys, Model-based RecSys, Content-based RecSys, Finding similar items using Jaccard similarity
SCStemmers
A collection of stemmers for Serbian and Croatian
seldon-core
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
spaczz
Fuzzy matching and more functionality for spaCy.
spell-correction-gingerit-demo
Tutorial on creating a spelling correction Python application using Gingerit and Streamlit
tinspin-indexes
Spatial index library with R*Tree, STR-Tree, Quadtree, CritBit, KD-Tree, CoverTree
universal-recommender
Java™ Programming Language™ library for recommendation engine implementation and scientific evaluation (2009–2010)
wildcard-trie
String trie that supports wildcard search
words-grouping
tool for listing most common words from a file with given tolerance for each group (using Levenshtein distance)