Qian Xie's starred repositories
datashader
Quickly and accurately render even the largest data.
advanced-swc
Intermediate and Advanced Software Carpentry tutorial material
pyspark.test
Example unit tests for Apache Spark Python scripts using the py.test framework
awesome-python
An opinionated list of awesome Python frameworks, libraries, software and resources.
interactive-coding-challenges
120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
datasciencetoolbox
A complete environment for busy polyglot data scientists
Learning-Python-Application-Development
Code repository for Learning Python Application Development, published by Packt
pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
PySpark-Boilerplate
A boilerplate for writing PySpark Jobs
pyspark-testing
Unit and integration testing with PySpark can be tough to figure out, let's make that easier.
hypertools-paper-notebooks
Supporting notebooks and data from hypertools paper
post--misread-tsne
How to Use t-SNE Effectively
hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
effectivescala
Twitter's Effective Scala Guide
52-technologies-in-2016
Let's learn a new technology every week. A new technology blog every Sunday in 2016.
spark-infotheoretic-feature-selection
This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.
anaconda-project
Tool for encapsulating, running, and reproducing data science projects
dashboards
Responsive dashboard templates 📊✨
Talk_Demystifying_Machine_Learning
Pycon.co lightning talk "Demystifying Machine Learning"
CostSensitiveClassification
CostSensitiveClassification Library in Python