techtenant / Basic-Data-Science-Projects

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Basic Data Science Projects - Collaborative Book Project

authors: Przemek Chojecki, Przemek Jarzynski

We are writing a collaborative data science book with an open-source so that anyone can join and help us!

10 projects, 10 chapters, 1 book with a goal to make it easy for anyone to start with data science by focusing on practical side of it.

If you want to be mentioned in the book and receive a free copy of the final ebook, please help us make the code better.

Table of contents

We plan the following chapters in the book:

Your Data Science setup

Anaconda (installation, updating anaconda, installing new libraries)

Virtualenv (why you need it, creating new environments)

Jupyter Notebook (starting, popular shortcuts)

GitHub (creating an account, basic commands)

Google Colab (data science in the cloud, basic usage)

Data Science Projects

  1. Analysing pharmaceutical sales data (Pandas, Matplotlib, Clustering, Regression)

  2. Predicting House Pricing (KNN, XGBoost)

  3. Introduction to Computer Vision with MNIST (Neural Networks)

  4. Face recognition (Computer Vision)

  5. Titanic Challenge (regression)

  6. Clustering wine dataset with k-means and DBSCAN (Neural Networks)

  7. PGA Tour 2010-2019 clustering (Clustering)

  8. Sentiment Analysis with Twitter (NLP)

  9. Cats and Dogs (Neural Networks)

  10. IMDB Database (regression, data visualisation, advanced NLP)

Practice what you have learnt with these projects (some more project ideas but without solution)

Real-world examples of Data Science in business (how some of the algorithms are currently used in practice)

Next steps (next things to learn, books to read etc.)

About


Languages

Language:Jupyter Notebook 100.0%