deep-neural-networks deeplearning neural-network optimization preprocessing tensorflow

Neural_Network_Charity_Analysis

Neural Networks and Deep Learning Models

Analysis Overview

Using a CSV file containing more than 34,000 organizations that have received funding over the years from an organization called Alphabet Soup, create a binary classifier that is capable of predicting whether applicants will be successful if funded by this organization.

Within this dataset are a number of columns that capture metadata about each organization, such as the following:

EIN and NAME—Identification columns
APPLICATION_TYPE—Alphabet Soup application type
AFFILIATION—Affiliated sector of industry
CLASSIFICATION—Government organization classification
USE_CASE—Use case for funding
ORGANIZATION—Organization type
STATUS—Active status
INCOME_AMT—Income classification
SPECIAL_CONSIDERATIONS—Special consideration for application
ASK_AMT—Funding amount requested
IS_SUCCESSFUL—Was the money used effectively

Results

Data Preprocessing

What variable(s) are considered the target(s) for your model?

The column "IS_SUCCESSFUL" is the target for the model which is represented by "y", where the goal is to predict if a charity is going to succeed after the donation.

What variable(s) are considered to be the features for your model?

The following columns are the features of the model and they are represented by X:

NAME
APPLICATION_TYPE
AFFILIATION CLASSIFICATION
USE_CASE
ORGANIZATION
STATUS
INCOME_AMT
SPECIAL_CONSIDERATIONS
ASK_AMT

What variable(s) are neither targets nor features, and should be removed from the input data?

The column "EIN" was removed from the input data because it contains only unique identification values.

Compiling, Training, and Evaluating the Model

How many neurons, layers, and activation functions did you select for your neural network model, and why?

Hidden layers: 3;
Neurons: 100, 30, and 10 neurons respectively.
Activation functions: Relu for the first hidden layer, Sigmoid for the two other hidden layers and also for the output layer.

Were you able to achieve the target model performance?

Yes, the performance achieved was 80%, which is a considerable improvement from the previous 73%.

What steps did you take to try and increase model performance?

Multiple tests were performed trying different combinations of hidden layers, neurons, and activation functions. But one of the steps that actually made the difference was to keep the column NAMES, which at first may seem like it contains only unique values since they are identifications, but when investigating this column, it actually does not have only unique values meaning that it should be included on the prediction analysis.

Summary

In summary, it is important to look at all the features before dropping any that may seem irrelevant since that can cause low performance on the model.

The Neural Model with the chosen hidden layers, neurons, and activation functions is 80% accurate at predicting if a charity would be successful after a donation.

As a suggestion, a model that could help to increase the performance even more could be the Random Forest Classifier, since it is good with classification problems.

About

Neural Networks and Deep Learning Models

deep-neural-networks deeplearning neural-network optimization preprocessing tensorflow

Languages

Language:Jupyter Notebook 100.0%