lilorck's starred repositories

spark-programming-guide-zh-cn

Spark 编程指南简体中文版

License:NOASSERTIONStargazers:187Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:38948Issues:0Issues:0

data_processing_course

Some class materials for a data processing course using PySpark

Language:PythonLicense:NOASSERTIONStargazers:52Issues:0Issues:0

spark_python_ml_examples

Spark 2.0 Python Machine Learning examples

Language:PythonStargazers:95Issues:0Issues:0

Spark-ML-Intro

PySpark Machine Learning Examples

Stargazers:44Issues:0Issues:0

spark-exercises

Coding exercises for Apache Spark

Language:PythonLicense:NOASSERTIONStargazers:103Issues:0Issues:0

Spark-practice

Apache Spark (PySpark) Practice on Real Data

Language:Jupyter NotebookStargazers:271Issues:0Issues:0

pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Language:PythonLicense:BSD-3-ClauseStargazers:42725Issues:0Issues:0

sparklingpandas

Sparkling Pandas

Language:PythonLicense:Apache-2.0Stargazers:364Issues:0Issues:0

pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1145Issues:0Issues:0

sparkit-learn

PySpark + Scikit-learn = Sparkit-learn

Language:PythonLicense:Apache-2.0Stargazers:1153Issues:0Issues:0

PyHive

Python interface to Hive and Presto. 🐝

Language:PythonLicense:NOASSERTIONStargazers:1668Issues:0Issues:0

matplotlib

matplotlib: plotting with Python

Language:PythonStargazers:19681Issues:0Issues:0

scikit-learn

scikit-learn: machine learning in Python

Language:PythonLicense:BSD-3-ClauseStargazers:58922Issues:0Issues:0