Johannes's repositories
RussianTransliterator
Russian transliterator, focus is on learning to set the stress correctly.
command-line-args
A mature, feature-complete library to parse command-line options.
Configs
Config files for e.g. VIM, VSC, etc. https://realpython.com/vim-and-python-a-match-made-in-heaven/
dependency_parser
Assignment_2 from SNLP Course
got
A simple tool for golang unit tests
HCI
Assignments and Experiments from the Human-Computer-Interaction course.
I-O-B-Tokenizer
Tokenizer using: logistic regression, NNs
Information_Retrieval_Assignments
University Assignments for the course Information Retrieval
ISMLA_NLP_UIMA
Course: Industrial strength multi language analysis: segmentor, tokenizer and other nlp tasks for demonstration purposes of UIMA
Mercedes-Benz-Connected-Vehicle-Chatbot
Houndify based Chatbot with self-defined Merzedes-Benz Connected Vehicle usecases
MovieSubtitlesInvestigation
Quantitative measure of Chinese & Japanese similarity using parallel movie subtitle corpora (using uimaFit).
ngram_language_model
Assignment_1 from SNLP Course
order_creation_tool
Group project of "Introduction to Software Engineering" university class
Parrot_Paraphraser
A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
Perceptron
This is a simple perceptron implementation based on literatur from Sebastian Raschka.
PRONCH
The idea is to compare product and brand names against taboo words and to detect critical similarities
rename_files
rename file collections
Shazamify
Setup description for the Shazamify Siri shortcut.
Sortings
Playing around with basic sorting algorithms in Go
SpellChecker
SpellChecker with GWT (3rd semester project)
Tokenizer4Chinese
A Greedy Tokenizer For Chinese using UIMA