BiancaTaisePommerening / Neural_Network_Charity_Analysis

Neural Networks and Deep Learning Models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Neural_Network_Charity_Analysis

Neural Networks and Deep Learning Models

Analysis Overview

Using a CSV file containing more than 34,000 organizations that have received funding over the years from an organization called Alphabet Soup, create a binary classifier that is capable of predicting whether applicants will be successful if funded by this organization.

Within this dataset are a number of columns that capture metadata about each organization, such as the following:

  • EIN and NAME—Identification columns
  • APPLICATION_TYPE—Alphabet Soup application type
  • AFFILIATION—Affiliated sector of industry
  • CLASSIFICATION—Government organization classification
  • USE_CASE—Use case for funding
  • ORGANIZATION—Organization type
  • STATUS—Active status
  • INCOME_AMT—Income classification
  • SPECIAL_CONSIDERATIONS—Special consideration for application
  • ASK_AMT—Funding amount requested
  • IS_SUCCESSFUL—Was the money used effectively

Results

Data Preprocessing

What variable(s) are considered the target(s) for your model?

The column "IS_SUCCESSFUL" is the target for the model which is represented by "y", where the goal is to predict if a charity is going to succeed after the donation.

What variable(s) are considered to be the features for your model?

The following columns are the features of the model and they are represented by X:

  • NAME
  • APPLICATION_TYPE
  • AFFILIATION CLASSIFICATION
  • USE_CASE
  • ORGANIZATION
  • STATUS
  • INCOME_AMT
  • SPECIAL_CONSIDERATIONS
  • ASK_AMT

What variable(s) are neither targets nor features, and should be removed from the input data?

The column "EIN" was removed from the input data because it contains only unique identification values.

Compiling, Training, and Evaluating the Model

How many neurons, layers, and activation functions did you select for your neural network model, and why?

model.png

  • Hidden layers: 3;

  • Neurons: 100, 30, and 10 neurons respectively.

  • Activation functions: Relu for the first hidden layer, Sigmoid for the two other hidden layers and also for the output layer.

Were you able to achieve the target model performance?

  • Yes, the performance achieved was 80%, which is a considerable improvement from the previous 73%.

accuracy.png

What steps did you take to try and increase model performance?

  • Multiple tests were performed trying different combinations of hidden layers, neurons, and activation functions. But one of the steps that actually made the difference was to keep the column NAMES, which at first may seem like it contains only unique values since they are identifications, but when investigating this column, it actually does not have only unique values meaning that it should be included on the prediction analysis.

Summary

In summary, it is important to look at all the features before dropping any that may seem irrelevant since that can cause low performance on the model.

The Neural Model with the chosen hidden layers, neurons, and activation functions is 80% accurate at predicting if a charity would be successful after a donation.

As a suggestion, a model that could help to increase the performance even more could be the Random Forest Classifier, since it is good with classification problems.

About

Neural Networks and Deep Learning Models


Languages

Language:Jupyter Notebook 100.0%