Sebastian Strub's starred repositories
ChatterBot
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
scattertext
Beautiful visualizations of how language differs among document types.
longformer
Longformer: The Long-Document Transformer
bert-extractive-summarizer
Easy to use extractive text summarization with BERT
nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
clinicalBERT
repository for Publicly Available Clinical BERT Embeddings
holmes-extractor
Information extraction from English and German texts based on predicate logic
text-summarizer
Understand Text Summarization and create your own summarizer in python
nlp_profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
python-knowledge-graph
A Python implementation of a basic Knowledge Graph
bitcoinVend
Offline bitcoin vending machine
Intelligent_Document_Finder
Document Search Engine Tool
Generate_True_or_False_OpenAI_GPT2_Sentence_BERT
Generate True or False questions from any content with OpenAI GPT2 text generation, Sentence-BERT semantic search and Berkley constituency parser.
KONVENS2019_and_LREC2020
Code for our GermEval@KONVENS 2019 and TRAC@LREC 2020 papers on Offensive Language Identification using BERT
eaternity-api
The repository for the Eaternity REST API Documentation.
lightning-rfc
Lightning Network Specifications
nlp-nonsense
Sentence-level nonsense detector