Andra's repositories
hdfs-spark-hive-dev-setup
This repository contains makescript and instruction on how to setup local hdfs+spark+hive setup.
blog-old
A Jekyll blog theme with just the right amount of style
blog1
:muscle: My Stack Problems - Jekyll Theme
ckan
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers datahub.io, catalog.data.gov and data.gov.uk among many other sites.
datanesia.github.io
Simple pages for Datanesia project
dataproc-initialization-actions
Run in all nodes of your cluster before the cluster starts - let's you customize your cluster
docker-hadoop-spark-workbench
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
dockerfile-dit4c-container-openrefine
DIT4C container for OpenRefine
GetOldTweets-java
A project written in Java to get old tweets, it bypass some limitations of Twitter Official API.
git-novice
Software Carpentry introduction to Git for novices.
Guardian-comment-scraper
Scrapes comments from guardian articles and outputs them to JSON or CSV
malaysian_parliament_hansard_url
Malaysian Parliament Hansard URL
play-bootstrap
A Play Framework library for Bootstrap
resbaz-cookbook
ResBaz Cookbook
schoolofdata-ext
School of Data extensions
scribble
A really clean and minimal Jekyll theme. ♥
spreadsheet-ecology-lesson
This repository contains Data Carpentry lessons on using spreadsheets for data wrangling.
tes_scraper_house
house
workshop-resbaz-d3-20150818
@isakiko workshop d3
world-universities-csv
List of all universities in the world in CSV