Constantine Lignos's repositories
Codeswitchador
Fast, simple identification of codeswitching in Tweets and other short messages.
nyt-corpus-reader
A parser and MongoDB backed store for searching the New York Times Annotated Corpus (LDC2008T19)
StateoftheUnion
A repository for teaching simple text analysis and web scraping using the SOTU address.
ArtificialLangLearning
Tools and data related to artificial language learning experiments.
mt-clir-emnlp-2019
Experiments for the EMNLP 2019 paper "The Challenges of Optimizing Machine Translation for Low Resource Cross-Language Information Retrieval"
WordSegmentation
Experiments in infant word segmentation.
DetectorMorse
Fast supervised sentence boundary detection using the averaged perceptron
nlp-in-ling
Natural Language Processing Research in North American Linguistics Departments
py-flac2mp3
flac2mp3 implementation using the Mutagen ID3 library: Can operate incrementally, converts album art.
python-sutime
Python wrapper for Stanford CoreNLP's SUTime
word2vec
This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research.