Alex Rutherford's repositories
arabic_nlp
Tools to normalise and derive sentiment from Arabic text
ds_efficient
Set of scripts to efficiently parse large streaming DataSift corpora out of memory and incrementally
label_maker
Python script to make a nice legend using ImageMagick
synthetic_towers
Notebook describing a naive recursive algorithm to place cell towers according to input population map
unthesaurus_scraping
Scripts to scrape hierarchical taxonomy from UNBIS Thesaurus
youtube_scraping_v3
Scripts to grab vides, meta-data and comments matching a keyword query
caida
Analysis of CAIDA AS data
geolocation
Scripts to grab, parse and store database of place names and set up API for querying
api-samples
Code samples for YouTube APIs, including the YouTube Data API, YouTube Analytics API, and YouTube Live Streaming API. The repo contains language-specific directories that contain the samples.
arabic_lang_classifier
Project to collect training Arabic and Farsi corpora and learn classification based on 3 character sequences
constitution_site
Microsite for constitutions analysis
country_programme_documents
UNICEF Country Program Documents
evo_dynamics
Evolutionary Dyamics Tutorial
facebook_page_scraping
Notebook to robustly query public Facebook pages based on keywords
hausa_detection
A IPython notebook to grab content in the Hausa language (dominant in Nigeria) from the BBC Hausa account using the Twitter API. This in turn can be used for language detection based on the distribution of n-character sequences
jax_explorations
Playing with JAX for end to end differentiable sims
liben_nowell
C++ routines to calculate rank-based probability distribution and to load into MySQL database
nairaland_scraping
Scripts to scrape Nairalanf content
name_gender_scraping
Notebook to scrape Indian names and genders
plagiarism
Fun web app to spoof plagiarism detectors
voicesofyouth
Data from Voices of Youth blog for LWT hackathon New York October 2015
weather_ingestion
Scripts to ingest open weather map data and store in MongoDB
youtube_monitor
Proto-type web app in Python to search and monitor topics of interest, based on keywords, on YouTube