Chan's repositories
BLM-emotions
Official code and data of the paper "An Analysis of Emotions and the Prominence of Positivity in #BlackLivesMatter Tweets"
glove-python
Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/
automatic-emailing-md
Built to automatically send markdown file to email in HTML format
chan0park.github.io
Personal Webpage
hindi-tokenizer
This is a package in Python which implements a tokenizer, stemmer for Hindi language
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
long-summarization-1
Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"
lowresource-nlp-bootcamp-2020
The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020
mosesdecoder
Moses, the machine translation system
OpenNMT-py
Open Source Neural Machine Translation in PyTorch
seq2seq-con
Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
vaderSentiment
VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social media, and works well on texts from other domains.
wikipron
Massively multilingual pronunciation mining
WiktionaryParser
A Python Wiktionary Parser