ducle2502 / Python-for-Data-Science

This is the repository for the course Python for Data Science, covering basic python, scientific libraries (e.g., numpy, pandas, sklearn), ML from scratch, PyTorch, and NLP.

Home Page:http://chaklam.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Python for DSAI

This is the repository for the course Python for DSAI at Asian Institute of Technology.

Some resource worth mentioning:

  1. Prerequisites/0 - Reading Roadmap
  • For those who wants to know what papers to read. I have listed ONLY the most important papers you need to read in the field of machine learning
  1. Prerequisities/0 - Installation
  • For newbies who have trouble installing Python and other tools
  1. Prerequisities/0 - Course Notations
  • Understanding notations is the first step towards conquering math, so take a look and familiarized with it
  1. Syllabus/0. Course Introduction.ipynb
  • Contains how I run the course. This course is a 15 weeks course, each week having two labs of 3 hours each. Each lab always end with the assessment and solution.

I would also like to give credits to several githubs that I have revised to create this:

I would also like to thank students who have contributed:

  • Akraradet Sinsamersuk
  • Pranisaa Charnparttarvanit
  • Chanapa Pananookooln

The course is structured into 3 big components, mostly focusing on preprocessing and modeling perspectives:

1. Python Basics

Focus on getting started.

  • Python
  • Numpy
  • Pandas
  • Matplotlib
  • Sklearn

2. Traditional Machine Learning

Focus on understanding the math + coding via coding from scratch

2.1 Regression

  • Linear regression
  • Polynomial regression
  • Regularization

2.2 Classification

  • Logistic regression
  • Naive Gaussian
  • Support Vector Machines
  • Decision Trees
  • K-Nearest Neighbors
  • Bagging
  • Random Forests
  • Boosting - AdaBoost, Gradient Boosting

2.3 Clustering

  • K-means
  • Gaussian Mixture Models

2.4 Dimensionality Reduction

  • Principal Component Analysis

3. Deep Learning

3.1 PyTorch

  • Linear Regression
  • Logistic Regression
  • Convolutional Neural Network

3.2 NLP

  • Recurrent Neural Network
  • Word Embeddings
  • Padded Sequence
  • Convolutional Neural Network for NLP
  • Bidirectonal Stacked LSTM

About

This is the repository for the course Python for Data Science, covering basic python, scientific libraries (e.g., numpy, pandas, sklearn), ML from scratch, PyTorch, and NLP.

http://chaklam.com


Languages

Language:Jupyter Notebook 100.0%