Jose's repositories
louvain-modularity
A GraphX implementation of Louvain method for community detection. This project also showcases the fact that you don't need to setup a cluster to run spark jobs.
spark-zeppelin
A very lightweight box with spark and zeppelin.
DSbox
Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.
ambari-flink-service
Ambari service for Apache Flink
JART
Just Another Recommendation Tool
Titanic
Estadísiticos del Titanic
datasets
Handy datasets
mbitschool-bigdata-neo4j
Demos y fuentes del módulo de Neo4j en Big Data
National-Emissions-Inventory
National Emissions Inventory fine particulate matter pollution in the United states over the 10-year period 1999–2008
courses
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
s3-bucket-list
List a S3 bucket contents à la Apache directory listing (kinda)
Human-Activity-Recognition-Using-Smartphones
Human Activity Recognition Using Smartphones Data Set
ProgrammingAssignment2
Repository for Programming Assignment 2 for R Programming on Coursera
ExData_Plotting1
Plotting Assignment 1 for Exploratory Data Analysis