Wee Tee, Soh's starred repositories
timesearch
The subreddit archiver
KeyphraseVectorizers
Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase matrix.
docker-selenium-lambda
The simplest demo of chrome automation by python and selenium in AWS Lambda
python-telegram-bot
We have made you a wrapper you can't refuse
profanity-check
A fast, robust Python library to check for offensive language in strings.
wiki-detox
See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse
hatespeechdata
Catalog of abusive language data (PLoS 2020)
Abusive-Language-Detection-Categorization
Abusive Language Detection and Categorization
sota-extractor
The SOTA extractor pipeline
paperswithcode-data
The full dataset behind paperswithcode.com
AutoPhrase
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
Stylized_Dialog
Code for paper "Stylized Dialogue Response Generation Using Stylized Unpaired Texts"