Devendra Singh Sachan's repositories
anchor-baggage
Experimentation code for the article "Building Topic Models Based on Anchor Words" based on the paper "Learning Topic Models: Going beyond SVD" by Sanjeev Arora, Rong Ge, and Ankur Moitra.
courserecommender
Course Recommendation using k-means clustering.
gradient_optimizers
Python package for wrapping gradient optimizers for models in Theano
hat-trie-1
HAT-Trie for Python
information-retrieval
Stanford CS276 Solutions
pybloomfiltermmap
Fast Python Bloom Filter using Mmap
sim-shootout
Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neighbours-intro
SimpleLDA-R
a simple R implementation of variational inference for LDA
tan-clustering
Hierarchical word clustering, following "Brown clustering" (Brown et al., 1992)
tweetokenize
Tokenization and pre-processing for Twitter data used to train classifiers.
wikipedia_parser
Parse Wikipedia dumps, extracts links, and page types.