obins's repositories
awesome-datascience
:memo: An awesome Data Science repository to learn and apply for real world problems.
deeppy
Deep learning in Python
Data-Science-45min-Intros
Ipython notebook presentations for getting starting with basic programming, statistics and machine learning techniques
aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
spark-kernel
A kernel that enables applications to interact with Apache Spark.
DataAnalytics
A repository with different graph processing tehnologies
parallel_ml_tutorial
Tutorial on scikit-learn and IPython for parallel machine learning
vagrant-projects
Vagrant projects for various use-cases
awesome-scala
A community driven list of useful Scala libraries, frameworks and software.
DataScienceResources
Open Source Data Science Resources.
scalacaster
Purely Functional Algorithms and Data Structures in Scala
data-science-from-scratch
code for Data Science From Scratch book
learning-spark
Example code from Learning Spark book
vagrant-spark-zeppelin
Vagrant, Apache Spark and Apache Zeppelin VM for teaching
Sparkaggle_1
This is a group project for people who have taken edX cs190x Scalable Machine Learning with Spark to work together on a Kaggle challenge in Spark
ipython-notebooks
A collection of IPython notebooks covering various topics.
free-data-science-books
Free resources for learning data science
reference-apps
Spark reference applications
BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
LearnDataScience
Open Content for self-directed learning in data science
mooc-setup
Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course
spark
Mirror of Apache Spark
spark-csv
CSV data source for Spark SQL and DataFrames
pyspark-csv
An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parses csv data into SchemaRDD. No installation required, simply include pyspark_csv.py via SparkContext.
spark-ml-streaming
Visualize streaming machine learning in Spark
SparkStreamingApps
A spark sbt blueprint to build your own spark apps off of.
LearningScalaMaterials
Supplementary materials for the "Learning Scala" book from O'Reilly Media
DataScienceSpCourseNotes
Compiled Notes for all 9 courses in the Coursera Data Science Specialization