jddbucknole / Awesome-Data-Science-Materials

Summary of some data science materials

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Awesome-Data-Science-Materials

This repository contains a summary of some data science materials: papers, useful packages, MOOC, career development info, etc.

Data Science Education & Career

  • Machine Learning Mastery: Less math and more tutorials with working code.
  • Deep Learning: Deep Learning Specialization. It teaches the most important and foundational principles of Deep Learning
  • Introduction to Deep Learning (MIT): It is a high-level introduction course. If you want to learn more details about the building blocks of deep learning, refer to the previous course from deeplearning.ai
  • TensorFlow in Practice: This Specialization will teach you best practices for using TensorFlow. It is code heavy but doesn't introduce the theoretical background of the models. It is a great companion to Deep Learning Specialization.

Data Science Paper

SQL

https://www.w3schools.com/sql/

  1. SQL for Data Scientist Learning Notes: http://scientistcafe.com/notes/SQL
  2. Select * SQL: https://selectstarsql.com/
  3. Leetcode: https://lnkd.in/g3c5JGC
  4. LinkedIn Learning: https://lnkd.in/gQXFc4n
  5. Window Functions: https://lnkd.in/g3RtPCJ
  6. HackerRank: https://lnkd.in/grv_9sB
  7. W3 Schools: https://lnkd.in/gJPfrrv
  8. CodeAcademy: https://lnkd.in/gT5xmpN
  9. SQLZoo: https://sqlzoo.net/
  10. SQL Bolt: https://sqlbolt.com/
    1. Zachary Thomas' SQL Questions https://lnkd.in/g-JJzuD
  11. Effective SQL for Data Science https://ploomber.io/posts/sql/

Deep Learning

Effective CNNs

Different Architecture Search Algorithms:

  • NASnet: 1800 GPU days (5 yrs on 1 GPU)

  • AmoebaNet: 3150 GPU days

  • DARTS: 4 GPU days

  • ENAS: 1000 x cheaper than standard NAS

Understanding Neural Networks

RNN

Others

Natural Language Processing

Experimentation

Tidy Data

  • broom package: takes the messy output of built-in functions in R, such as lm, nls, or t.test, and turns them into tidy data frames

Causal Inference

  • CausalTree

Others

  • Unix Learning Notes: http://scientistcafe.com/notes/Unix/
  • scijava-jupyter-kernel aims to be a polyglot Jupyter kernel. It uses the Scijava scripting languages to execute the code in Jupyter client and it's possible to use different languages in the same notebook.

Some of the supported languages are Groovy (default), Python, Beanshell, Clojure, Java, Javascript, Ruby and Scala.

https://github.com/scijava/scijava-jupyter-kernel/tree/afd8c1c7be5b92a734e0fac9d78bcc0216162340

Reproducible Report

R Markdown Theme Gallery: http://www.datadreaming.org/post/r-markdown-theme-gallery/

Web Development

About

Summary of some data science materials