edgardeng / python-data-science-days

Days for Python Study in Data Science

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

python-data-science-days

Days for Python Study in Data Science

Menu

  1. python basic data type

  2. python general function

  3. python list & array

  4. python i/o

  5. Numpy Array

  6. Numpy Array Function

  7. Numpy Array Aggregate

  8. Numpy Array Broadcast

  9. Numpy Comparison

  10. Numpy Fancy Index

  11. Numpy Sort

  12. Numpy Structured Array

  13. Matplotlib Introduction

  14. Matplotlib Chart

  15. Introduction to Pandas

  16. Pandas Data Selection

  17. Pandas Data Operation

  18. Pandas Aggregation

  19. Pandas I/O

  20. Pandas String

  21. Pandas Time Series

  22. High-Performance Pandas: eval() and query()

  23. Matplot More

  24. Matplot Custom

TODO

  • NumPy

  • Pandas

How to Build GitBook

## install gitbook
npm install gitbook -g  
npm install gitbook-cli -g 
gitbook --version

## build boook
gitbook build
gitbook pdf ./ ./a.pdf  # 生成 pdf

## build a serve
gitbook serve

Others

Reference

Algorithms

  • Data Science Algorithms: algorithms such as Linear Regression, Logistic Regression, K-Mean Clustering, Random Forest.

  • Statistics and ML: tutorials to python pandas, machine learning algorithms, statistics and data visualization

Machine Learning

  • Scikit Learn: Scikit learn is a python library for machine learning. ( Hint: Dig Deeper )

  • Awesome Machine Learning: tutorials, resources, guides for machine learning, data analysis, natural language processing, data visualization in all the programming languages like Python, R, Java, Go, C++, Swift. Choose accordingly.

  • Complete Machine Learning: collection of tutorials and examples for solving problems using machine learning. It consist of beginning to end steps of ML covering stages such as model evaluation, implementation of ML algorithms, data visualization etc.

  • Parallel Machine Learning: This tutorial is on using scikit learn and ipython for parallel machine learning. Here you’ll find a 2 hours long video from Pycon 2013 with lecture notes and other useful resources.

  • Machine Learning Courses: a list of Best Machine Learning Courses in the world.

Deep Learning

  • Caffe: a deep learning framework made with expression, speed, and modularity in mind.

  • Awesome Deep Learning: tutorials on Deep Learning which includes deep learning courses, free books, videos and lectures, papers

  • Deep Learning in Python: tutorial on implementation of Deep Learning in Python

  • Recurrent Neural Networks: dedicated resources for RNN. codes, lectures, books and resources on multiple applications of RNN.

Data Visualization

  • matplotlib: plotting with Python

  • seaborn: a library for making statistical graphics in Python. It is built on top of matplotlib and closely integrated with pandas data structures.

  • Bokeh: JavaScript visualization library with a Python frontend that creates highly interactive visualizations capable of handling very large and/or streaming datasets.

  • Plotly is the eponymous open source product of the Plotly company, and is similar in spirit to Bokeh. Because Plotly is the main product of a startup, it is receiving a high level of development effort. Use of the library is entirely free.

  • Vispy is an actively developed project focused on dynamic visualizations of very large datasets. Because it is built to target OpenGL and make use of efficient graphics processors in your computer, it is able to render some quite large and stunning visualizations.

  • Vega and Vega-Lite are declarative graphics representations, and are the product of years of research into the fundamental language of data visualization. The reference rendering implementation is JavaScript, but the API is language agnostic. There is a Python API under development in the Altair package. Though as of summer 2016 it's not yet fully mature, I'm quite excited for the possibilities of this project to provide a common reference point for visualization in Python and other languages.

About

Days for Python Study in Data Science

License:MIT License


Languages

Language:Python 100.0%