Sandeep Soni's repositories
mobility-books
Mobility of characters in fiction
comparing-word2vec-models
Code, data, and notes for tutorial on using word embeddings to find variation and change
geoSGLM
Code for learning geographically-informed word embeddings
Language:Java000
gt-nlp-class
Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"
000
QTM340
Data and code to support "Practical Approaches to Data Science with Text" class (QTM 340, Fall 2023)
GPL-3.0000
Language:CSSGPL-3.0000
Language:Java000
whitespace-normalizer
Segment a non-standard token formed by dropping white space between two adjacent tokens in text.
Language:Python000