mat kelcey's repositories

resemblance

trying shingling / resemblance / simhash / sketching to do some data deduping

Language:RubyLicense:MITStargazers:98Issues:10Issues:0

common-crawl

playing around with the common crawl dataset

Language:JavaStargazers:70Issues:9Issues:0

cartpoleplusplus

3d cartpole gym env using bullet physics trained from pixels with tensorflow LRPG, DDPG & NAF

Language:PythonLicense:MITStargazers:57Issues:2Issues:3

rnn_lm

various simple RNNs trained on synthetic grammars

Language:PythonStargazers:30Issues:3Issues:0

ros-mpu6050-node

raspberry pi c++ ROS mpu6050 IMU node

Language:C++License:MITStargazers:23Issues:4Issues:3

collocations

bigram / trigram analysis of wikipedia; mainly mutual info

Language:PythonStargazers:22Issues:3Issues:0

wikipediaPhilosophy

do all first links on wikipedia _really_ lead to philosophy?

common-crawl-quick-hacks

common crawl quick hack examples

Language:JavaStargazers:19Issues:5Issues:0

snli_nn

hacking on the stanford natural language inference (SNLI) corpus (in theano)

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

snli_nn_tf

hacking on the stanford natural language inference (SNLI) corpus (tensorflow)

Language:PythonLicense:MITStargazers:10Issues:2Issues:1

yahoo_lda_utils

some simple pre/post processing utils for yahoo lda

Language:PythonStargazers:10Issues:4Issues:0

diy_twitter_client

a simple twitter client that learns what i like to read

Language:RubyStargazers:9Issues:0Issues:0

neural_prob_lang_model

hacky exploratory variants on NN language models

Language:PythonStargazers:9Issues:2Issues:0

trending

testing out some trending algorithms, mostly written in hadoop pig

Language:RubyStargazers:9Issues:2Issues:0

gardenhose-microslurp

bootstrap scripts for getting a micro ec2 instance piping gardenhose to s3

Language:ShellStargazers:7Issues:3Issues:0

ros-motorhat-node

raspberry pi c++ ROS wrapper for adafruit motorhat

Language:PythonLicense:MITStargazers:7Issues:3Issues:1

named-entity-extraction

proof of concept using NLTK for named entity extraction

Language:RubyStargazers:5Issues:0Issues:0

consistent_hash

afternoon hackery visualising the consistent hash method

Language:RubyStargazers:4Issues:2Issues:0

mislabelled-training-data

quick experiment on an approach to correct mislabelled data

Language:PythonStargazers:4Issues:2Issues:0

text-utils

simple c++ corpus munging apps

Language:C++Stargazers:3Issues:2Issues:0

connected-components

test of iterative python embedded pig

Language:RubyStargazers:2Issues:0Issues:0

poi-dups

near duplicate detection (with ngram frequency weighting)

Language:RubyStargazers:2Issues:2Issues:0

pseudocounts

comparing pseudocount methods

Language:CStargazers:2Issues:0Issues:0

tf-icf-experiment

code for tf/icf experiment (term freq, inverse corpus freq) a system for calculating tf/icf on a stream of data

Language:RubyStargazers:2Issues:2Issues:0
Language:PythonStargazers:1Issues:0Issues:0

random-projection

simple random projection experiments

Language:RubyStargazers:1Issues:2Issues:0

1e6_sentences

simple parsing / embedding utils for the 1e6 sentences dataset

Language:PythonStargazers:0Issues:0Issues:0

robot0

launch files for ros nodes on robot0

Stargazers:0Issues:2Issues:0

vw_experiments

vowpal experiments

Language:PythonStargazers:0Issues:2Issues:0