bwayvs / GlobalGDP_DataExploration

Using Excel for data exploration of global GDP dataset with ANOVA hypothesis testing and regression analysis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Global GPD Data Exploration

This project explores global Gross Domestic Product (GDP) data by nation and continent from 1999 through 2019 in Excel using ANOVA testing and regression analysis. Using this data, we can see that there is sufficient evidence that there are significant differences in mean of national and continental GDP.

Source

GDP Table in the U.S. Department of Agriculture, Economic Research Service, Macroeconomic Data Set, GDP table. https://www.ers.usda.gov/webdocs/DataFiles/51832/HistoricalGDPSharesValues.xlsx?v=1906.2

Screenshots

Aggregated Continental GDP Data

Raw Data - Continental GDP

I pulled national GDP data from USDA website, then aggregated the continental GDP data from individual countries as part of the preparation for further analysis. The numbers have been condensed to show GDP in millions ($).

Numerical Summary of Data

ANOVA Numerical Table - Continental GDP

I calculated the means, standard deviation and variances of the sample.

Checking for Constant Variance and Normality

Variance and Normality - Continental GDP

I created a dynamic Q-Q plot that can be updated for each of the 6 continents. Here, we see that North America's sample data has a normal distribution. I also checked for constant variance to ensure the spread was roughly equal for all groups. With this information, I was able to move on to hypothesis testing.

ANOVA Hypothesis Testing

Hypothesis Testing Results - Continental GDP

Having met all the requirements, the data was ready for further analysis and hypothesis testing. Using an f-test, I was able to determine the P-value was very small. This tells us that at least one mean continental GDP was different than the others.

Means Plot of Continental GDP Data

ANOVA Summary Graph - Continental GDP

Using a means plot, I was able to illustrate that Group 4 (Asia) had a much higher GDP mean than other continents.

Authors

πŸš€ About Me

I'm Veronica, a results-driven Data Analyst with expertise in SAP and process improvement. With a background in translating complex requirements into actionable insights, I leverage SQL, data visualization tools, and Agile methodologies to optimize supply chains and drive business decisions. My passion lies in turning data into meaningful business strategies, ensuring organizational alignment, and fostering cross-functional collaboration.

πŸ”— LinkedIn Profile

linkedin

About

Using Excel for data exploration of global GDP dataset with ANOVA hypothesis testing and regression analysis