Qian Xie's repositories

DataScienceEngineeringApacheSpark

Data Science and Engineering with Apache Spark

Language:HTMLStargazers:1Issues:0Issues:0

Notebook-TeachingTips

A Place for Posting Resources for Teachers, TAs and Students in courses using Jupyter Notebooks

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

adv-r

Advanced R programming: a book

Language:TeXStargazers:0Issues:2Issues:0
Language:ScalaStargazers:0Issues:0Issues:0

drunken-data-quality

Spark package for checking data quality

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

fullstackpython.com

Full Stack Python source with Pelican, Bootstrap and Markdown.

Language:HTMLLicense:MITStargazers:0Issues:2Issues:0

hackondata

Toronto Apache Spark HackOn(Data) 1st Place Winner

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

imbalanced-learn

Python module to perform under sampling and over sampling with various techniques.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

ipython-notebooks

A collection of IPython notebooks covering various topics.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

jupyter-presentation-template

Cloud Native Presentation Slides with Jupyter Notebook + Reveal.js

Language:HTMLLicense:MITStargazers:0Issues:2Issues:0

LectureNotes

Lecture content for UW Software Engineering for Data Scientists

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

MySQL

Lab Notebooks for Coursera course Manage Big Data with MySQL

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

nyc-taxi-data

Import public NYC taxi and Uber trip data into PostgreSQL / PostGIS database, analyze with R

Language:RStargazers:0Issues:0Issues:0

pandas-profiling

Create HTML profiling reports from pandas DataFrame objects

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

py-viz-blog

Code for Pythonic visualization blog post

Language:HTMLStargazers:0Issues:0Issues:0

PySpark-Boilerplate

A boilerplate for writing PySpark Jobs

Language:PythonStargazers:0Issues:2Issues:0

pyspark-jupyter-cdh

Pyspark Jupyter Notebook on Cloudera CDH

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

scala_school

Lessons in the Fundamentals of Scala

Language:HTMLStargazers:0Issues:2Issues:0

scalable-data-science

Course in scalabe data science using Apache Spark over Databricks.

Language:HTMLLicense:UnlicenseStargazers:0Issues:2Issues:0

scientific_python_cheat_sheet

simple overview of python, numpy, scipy, matplotlib functions that are useful for scientific work

Language:HTMLLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

solid-jekyll

A Jekyll port of the Solid theme (by blacktie.co).

Language:CSSStargazers:0Issues:0Issues:0

spark-dev

Apache Spark development

Language:ScalaLicense:GPL-3.0Stargazers:0Issues:2Issues:0

spark-df-profiling

Create HTML profiling reports from Apache Spark DataFrames

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

spark-etl

Apache Spark based ETL Engine

Language:ScalaLicense:MITStargazers:0Issues:0Issues:0

spark-etl-demo

Demo of an ETL Spark Job

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

twitter-sentiment-analysis

Streaming tweets with spark, language detection & sentiment analysis, dashboard with Kibana

Language:ScalaLicense:MITStargazers:0Issues:0Issues:0

Udacity_Data_Wrangling_With_MongoDB

Content and my work for Udacity course Data Wrangling with MongoDB

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Udacity_fullstack

Course materials and my work for Udacity fullstack nanodegree

Language:PythonStargazers:0Issues:0Issues:0

Udacity_Intro_to_Data_Analysis

Content and my work for Udacity course Intro to Data Analysis

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

uwseds.github.io

UW Software Engineering for Data Science Website

Language:CSSStargazers:0Issues:0Issues:0