data2al / Intro_To_Datascience_Training_2018

Springboard Intro To Data Science Training Projects

Home Page:https://www.springboard.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data sets

A Data-Driven Approach to Bank Telemarketing

Keywords

consumer data, banking, telemarketing, wealth, status

Summary

A collection of datasets that provides data that is related to a direct marketing campaign launched by the Portuguese Banking Institute. Within the data set contains customer data and whether the product was successfully subscribed or not. It may be possible to try and determine which factors are in play here with the willingness of the consumer to accept the product based on their age, income or maritial status.

Why pick this topic

This topic is related to business and consumer research. Once a conclusion as to how to connect with the customer is found, it may be applied to a different field. The data processing method is also important in this case. The perks of this topic is the data set is quite complete and only requires one click to download.

How do I get it?

The documentation for the first dataset is here And is downloadable under data folder.

Researching the Opiod Crisis - Accidental Drug Related Deaths in Connecticut from 2012 to 2017

Keywords

medical, drug usage, opioids

Summary

The dataset contains information regarding each accidental death associated with drug overdose in Connecticut from 2012 to June 2017. The data was derived from an investigation by the Office of the Chief Medical Examiner which includes the toxicity report, death certificate, as well as a scene investigation. The set includes information such as Date, Sex, Race, Age, Location with the type of drug used either in single usage or multiple combinations.

Why pick this topic

The topic may provide important insight as to what drug or what combination of drugs are the leading cause of accidental death in connecticut and how age or gender may play in part in the cause of death. The data set contains over 4000 records, but does not contain any numerical variables apart from age, which might be a shortcoming in gaining better insight.

How do I get it?

The documentation for the dataset is here and the link to the data download is under "Downloads & Resources".

Crop Yields and Weather - State of NY

Keywords

Weather, Climate, Crop Yields, Principal Crops

Summary

This dataset requires the combination of two different data sources. The first set being crop yields which highlights the annual production of crops dating back to the 1990s. The second data set contains weather data for each state. But we will mostly be focusing our effort on NY.

Why pick this topic

This data will help us understand how weather affects crops and show if there is a declining or increasing trend on crop production in correlation with the climate change.

Downloading this data set will be a challenge as there are no visible API for both data sources at the moment and the data is spread out. Some web scraping might make it easier to obtain data from the second site, but more likely building the entire data set will require a lot of manual copy and pasting.

How do I get it?

The documentation for the first dataset is here and the link to the data download is under "Latest Releases:".

The second dataset can be obtained here

About

Springboard Intro To Data Science Training Projects

https://www.springboard.com/


Languages

Language:R 100.0%