DataAnalysisPath / 7.2-lab_supervised_learning_feature_extraction

Feature Extraction and Introduction to Supervised Learning

Home Page:https://github.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Ironhack logo

Lab | Feature Extraction and Introduction to Supervised Learning

Introduction

As data analysts or data scientists, we find that we don't always get the data we need, but the data that we deserve. Many times it is up to us to extract meaningful information from our data. It could be done by transforming the data using derived columns, grouping the data and using aggregated information, or cleaning and reformatting the data. We will explore these techniques in this lab.

Getting Started

Before working on today's main lab, we'd like you to review some Pandas functions. Open the Pandas-concat-merge-join.ipynb file in the your-code directory, review the information there, and warm up yourself by doing some coding.

Then open the main.ipynb file in the your-code directory. Follow the instructions and add your code and explanations as necessary. By the end of this lab, you will have learned how to prepare a dataset for most scikit-learn algorithms.

Deliverables

  • Pandas-concat-merge-join.ipynb with your responses.
  • main.ipynb with your responses.

Submission

Upon completion, add your deliverables to git. Then commit git and push your branch to the remote.

Resources

Joining and Merging in Pandas

pandas.concat

pandas.DataFrame.merge

pandas.DataFrame.join

SQL Joins

pandas.Series.unique

pandas.DataFrame.dropna

pandas.to_datetime

numpy.where

pandas.Series.value_counts

pandas.core.groupby.DataFrameGroupBy.agg

About

Feature Extraction and Introduction to Supervised Learning

https://github.com


Languages

Language:Jupyter Notebook 100.0%