Sebastian Strub (Sebastenhauer)

Sebastenhauer

Geek Repo

Location:Zurich, Switzerland

Github PK Tool:Github PK Tool

Sebastian Strub's starred repositories

ChatterBot

ChatterBot is a machine learning, conversational dialog engine for creating chat bots

Language:PythonLicense:BSD-3-ClauseStargazers:14056Issues:545Issues:1646

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8992Issues:119Issues:993

texthero

Text preprocessing, representation and visualization from zero to hero.

Language:PythonLicense:MITStargazers:2883Issues:42Issues:120

scattertext

Beautiful visualizations of how language differs among document types.

Language:PythonLicense:Apache-2.0Stargazers:2237Issues:55Issues:101

longformer

Longformer: The Long-Document Transformer

Language:PythonLicense:Apache-2.0Stargazers:2041Issues:42Issues:228

FARM

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Language:PythonLicense:Apache-2.0Stargazers:1736Issues:53Issues:406

cryptos

Pure Python from-scratch zero-dependency implementation of Bitcoin for educational purposes

Language:Jupyter NotebookStargazers:1592Issues:38Issues:3

finBERT

Financial Sentiment Analysis with BERT

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1432Issues:35Issues:62

bert-extractive-summarizer

Easy to use extractive text summarization with BERT

Language:PythonLicense:MITStargazers:1393Issues:25Issues:111

nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Language:Jupyter NotebookStargazers:1144Issues:51Issues:8
Language:PythonLicense:Apache-2.0Stargazers:748Issues:26Issues:15

clinicalBERT

repository for Publicly Available Clinical BERT Embeddings

Language:PythonLicense:MITStargazers:665Issues:25Issues:41

xtreme

XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.

Language:PythonLicense:Apache-2.0Stargazers:631Issues:20Issues:69

holmes-extractor

Information extraction from English and German texts based on predicate logic

Language:PythonLicense:MITStargazers:387Issues:20Issues:9

text-summarizer

Understand Text Summarization and create your own summarizer in python

nlp_profiler

A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.

Language:PythonLicense:NOASSERTIONStargazers:243Issues:12Issues:25

LNPBPs

LNP/BP standards for bitcoin layer 2 & 3 protocols

python-knowledge-graph

A Python implementation of a basic Knowledge Graph

bitcoinVend

Offline bitcoin vending machine

Language:CLicense:MITStargazers:82Issues:6Issues:3

10kGNAD

Ten Thousand German News Articles Dataset for Topic Classification

Language:PythonLicense:MITStargazers:81Issues:2Issues:4

Intelligent_Document_Finder

Document Search Engine Tool

Language:PythonLicense:MITStargazers:71Issues:5Issues:3

lnbook

Mastering the Lightning Network (LN)

Language:ShellLicense:NOASSERTIONStargazers:53Issues:6Issues:0

Generate_True_or_False_OpenAI_GPT2_Sentence_BERT

Generate True or False questions from any content with OpenAI GPT2 text generation, Sentence-BERT semantic search and Berkley constituency parser.

Language:Jupyter NotebookStargazers:34Issues:5Issues:1
Language:Jupyter NotebookStargazers:11Issues:2Issues:0

KONVENS2019_and_LREC2020

Code for our GermEval@KONVENS 2019 and TRAC@LREC 2020 papers on Offensive Language Identification using BERT

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

eaternity-api

The repository for the Eaternity REST API Documentation.

lightning-rfc

Lightning Network Specifications

nlp-nonsense

Sentence-level nonsense detector

Language:Jupyter NotebookStargazers:1Issues:1Issues:0