Qian Xie's repositories

albedo

A recommender system for discovering GitHub repos, built with Apache Spark

Language:ScalaLicense:MITStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

databricks-training-data-science-spark

TRAINING: DATA SCIENCE WITH APACHE SPARK 2.X

Language:HTMLStargazers:0Issues:2Issues:0

databricks-training-deeplearning-spark

Deep Learning, Keras, Tensorflow & Spark Training by Databricks

Language:HTMLStargazers:0Issues:2Issues:0

databricks-training-spark-tuning

TRAINING: APACHE SPARK TUNING AND BEST PRACTICES

Language:HTMLStargazers:0Issues:2Issues:0

datasets

CSV datasets used in Plotly API examples

Stargazers:0Issues:2Issues:0

drunken-data-quality

Spark package for checking data quality

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

git-flight-rules

Flight rules for git

License:CC-BY-SA-4.0Stargazers:0Issues:2Issues:0

imbalanced-learn

Python module to perform under sampling and over sampling with various techniques.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

mxnet-the-straight-dope

An interactive book on deep learning. Much easy, so MXNet. Wow.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

MySQL

Lab Notebooks for Coursera course Manage Big Data with MySQL

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

pandas-profiling

Create HTML profiling reports from pandas DataFrame objects

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

PySpark-Boilerplate

A boilerplate for writing PySpark Jobs

Language:PythonStargazers:0Issues:2Issues:0

pyspark-example-project

Example project and best practices for Python-based Spark ETL jobs and applications.

Language:PythonStargazers:0Issues:2Issues:0

pyspark-jupyter-cdh

Pyspark Jupyter Notebook on Cloudera CDH

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

pyspark-pictures

Learn the pyspark API through pictures and simple examples

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

pysparkling

A pure Python implementation of Apache Spark's RDD and DStream interfaces.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

pytest-spark

pytest plugin to run the tests with support of pyspark

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

python-project-template

A template Python project with a focus on best practices.

Language:SmartyLicense:NOASSERTIONStargazers:0Issues:2Issues:0

scala_school

Lessons in the Fundamentals of Scala

Language:HTMLStargazers:0Issues:2Issues:0

scalable-data-science

Course in scalabe data science using Apache Spark over Databricks.

Language:HTMLLicense:UnlicenseStargazers:0Issues:2Issues:0

shablona

A template for small scientific python projects

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

spark-config-and-tuning

spark性能调优总结 spark config and tuning

Stargazers:0Issues:2Issues:0

spark-dev

Apache Spark development

Language:ScalaLicense:GPL-3.0Stargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0

spark2-etl-examples

A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0

Language:ScalaStargazers:0Issues:0Issues:0

spark_training

Sample Spark Code

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

workshops

All materials for workshops - HackOn(Data) - Toronto

License:Apache-2.0Stargazers:0Issues:2Issues:0

xmas-tweets

An Apache Spark case study: Gathering Tweets about Christmas with Apache Spark Streaming. Sentiment Analysis with Spark Core NLP.

Language:ScalaStargazers:0Issues:2Issues:0