student-IH-labs-and-stuff / BER-DAFT-OCT20-Sian

Main Official Repo for the Oct 2020 Berlin Data Analytics Cohort, will contain lesson notes and solutions.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

BER-DAFT-OCT20

This repo contains lecture material, lab solutions and more for the Oct. 2020 Data Analytics course at Ironhack Berlin (LT: Sian).

Table of Contents

  1. Lecture Notes
  1. Lab Solutions
  1. Workshop Material

1. Lecture Notes

Unit 1 (Week 1)

TODO: fix broken links

Day Notes Files
2 teaching2010.ipynb
3 teaching2110.ipynb
4 teaching2110.ipynb, lambda.ipynb (python data cleaning utils)

Unit 2 (Week 2)

TODO: fix broken links

Day Notes Files
1 python_and_sql.ipynb, W2D1 - Data Structures.pdf bank_data_class_example.sql
2 day2sql.sql
3 day3.sql
4 sql day 4.sql, sql day 4 cont.sql
5

Unit 3 (Week 3)

TODO: fix broken links

Day Notes Files
1 Lesson_1_Key_Concepts.pptx , Lesson_2_Key_Concepts.pptx , Lesson_3_Key_Concepts.pptx , Lesson_4_Key_Concepts.pptx, 3.02 joins 3 or more tables.pptx, 3.02 why order of tables in joins matter.pptx, Unit 3 schedule.pptx
2 cross join 2.pptx, cross join.pptx, sub queries 1.pptx, subqueries with where.pptx
3 CTE.pptx, Ranks.pptx,Views check option.pptx, Views.pptx, nested subqueries.pptx, rules of subqueries.pptx, self join 1.pptx, self join 2.pptx, useful subq with IN.pptx
4
5 customer_churn.csv

Unit 4 (Week 4)

Day Notes Files
1 "Healthcare for all" - Case study, instructions (.md), column definitions (.docx), imputation_methods.pptx, numerical columns distribution.pptx, imbalance and mitigation.pptx Healthcare for all, data (.csv)
2
3
4
5

2. Lab solutions

Unit 1 (Week 1)

Lab Link to lab instruction (external repo) Link to lab solution
Lab | Bash link link
Lab | Git link N/A
Lab | Customer Analysis Round 1 link link
Lab | Customer Analysis Round 2 link link
Lab | Customer Analysis Round 3 link link
Lab | Customer Analysis Round 4 link link
Lab | Customer Analysis Round 5 link link
Lab | Customer Analysis Round 6 link link
Lab | Customer Analysis Round 7 link link
Lab | Customer Analysis Final Round link link

Notebooks which we created in Lab sessions

Notebook Files we used in this notebook
QA_2020_10_22_data_cleaning_and_linear_regression.ipynb Usa_Housing.csv
2020_10_23_customer_analysis_round_1-6_with_open_questions.ipynb (WIP - work in progress, will be improved over the day) marketing_customer_analysis.csv

Unit 2 (Week 2)

solutions folder

Lab Link to lab instruction (external repo) Link to lab solution
Lab | SQL Intro link link
Lab | SQl Queries 2 link link
Lab | SQl Queries 3 link link
Lab | SQl Queries 4 link link
Lab | SQl Queries 5 link link
Lab | SQl Queries 6 link link
Lab | SQl Queries 7 link link
Lab | SQl Queries 8 link link
Lab | SQl Queries 9 link link

Unit 3 (Week 3)

solutions folder

Lab Link to lab instruction (external repo) Link to lab solution
Lab | SQL Join link link
Lab | SQL SQL Joins on multiple tables link link
Lab | SQL Self and cross join link link
Lab | Database normalization link link
Lab | SQL SQL subqueries link link
Lab | SQL SQL Advanced Queries link link

Unit 4 (Week 4)

solutions folder

Lab Link to lab instruction (external repo) Link to lab solution
Lab | Cleaning numerical data, Lab | Cleaning Categorical data Lab cleaning numerical data, Lab cleaning categorical data 4.1_4.2_Labs_Cleaning_numerical_and_categorical_data.ipynb (one notebook)

Miscellaneous

Setup instructions & help to get an SQL environment ready

Workshop material

Map-Filter-Reduce

webscraping

About

Main Official Repo for the Oct 2020 Berlin Data Analytics Cohort, will contain lesson notes and solutions.


Languages

Language:Jupyter Notebook 99.8%Language:Python 0.1%Language:Rich Text Format 0.0%