Justin Li's repositories
html-table-extractor
extract data from html table
django-script-runner
A simple django-based interactive website to execute any python scripts
quora-crawler
use selenium, beautiful soup and mongodb to crawl and store data from quora
address-matching
Python script for matching a list of messy addresses against a gazetteer using dedupe.
graph_algorithms
classical graph algorithms
graph_generator
implement some random graph generators with input parameters
data_analytics_project
Projects for Data Analytics Nanodegree on Udacity
datasciencecoursera
For testing
dedupe
:id: A python library for accurate and scaleable fuzzy matching, record deduplication and entity-resolution.
docker-elastalert
Docker image with Yelp's ElastAlert
elastalert
Easy & Flexible Alerting With ElasticSearch
ig-crawler
crawl the h1b update data from immigration girl
learning-spark
Example code from Learning Spark book
paasta
An open, distributed platform as a service
ProgrammingAssignment2
Repository for Programming Assignment 2 for R Programming on Coursera