Qian Xie's repositories
DataScienceEngineeringApacheSpark
Data Science and Engineering with Apache Spark
Notebook-TeachingTips
A Place for Posting Resources for Teachers, TAs and Students in courses using Jupyter Notebooks
drunken-data-quality
Spark package for checking data quality
fullstackpython.com
Full Stack Python source with Pelican, Bootstrap and Markdown.
hackondata
Toronto Apache Spark HackOn(Data) 1st Place Winner
imbalanced-learn
Python module to perform under sampling and over sampling with various techniques.
ipython-notebooks
A collection of IPython notebooks covering various topics.
jupyter-presentation-template
Cloud Native Presentation Slides with Jupyter Notebook + Reveal.js
LectureNotes
Lecture content for UW Software Engineering for Data Scientists
nyc-taxi-data
Import public NYC taxi and Uber trip data into PostgreSQL / PostGIS database, analyze with R
pandas-profiling
Create HTML profiling reports from pandas DataFrame objects
py-viz-blog
Code for Pythonic visualization blog post
PySpark-Boilerplate
A boilerplate for writing PySpark Jobs
pyspark-jupyter-cdh
Pyspark Jupyter Notebook on Cloudera CDH
scala_school
Lessons in the Fundamentals of Scala
scalable-data-science
Course in scalabe data science using Apache Spark over Databricks.
scientific_python_cheat_sheet
simple overview of python, numpy, scipy, matplotlib functions that are useful for scientific work
solid-jekyll
A Jekyll port of the Solid theme (by blacktie.co).
spark-df-profiling
Create HTML profiling reports from Apache Spark DataFrames
spark-etl
Apache Spark based ETL Engine
spark-etl-demo
Demo of an ETL Spark Job
twitter-sentiment-analysis
Streaming tweets with spark, language detection & sentiment analysis, dashboard with Kibana
Udacity_Data_Wrangling_With_MongoDB
Content and my work for Udacity course Data Wrangling with MongoDB
Udacity_fullstack
Course materials and my work for Udacity fullstack nanodegree
Udacity_Intro_to_Data_Analysis
Content and my work for Udacity course Intro to Data Analysis
uwseds.github.io
UW Software Engineering for Data Science Website