cydneyghamel / pandas-manipulation-analysis

Education data analysis using data manipulation tools

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Pandas-Challenge

Module 4 Challenge Requirements

District Summary (20 points) Calculate the total number of unique schools (2 points) Calculate the total number of students (2 points) Calculate the total budget (2 points) Calculate the average (mean) math score (2 points) Calculate the average (mean) reading score (2 points) Use the code provided to calculate the percentage of students who passed math (2 points) Calculate the percentage of students who passed reading (2 points) Use the code provided to calculate the percentage of students that passed both math and reading (2 points) Create a new DataFrame for the above calculations called district_summary (4 points)

School Summary (20 points) Use the code provided to select the school type (2 points) Calculate the total student count (2 points) Use the code provided to calculate the per capita spending (2 points) Calculate the average test scores (2 points) Calculate the number of schools with math scores of 70 or higher (2 points) Calculate the number of schools with reading scores of 70 or higher (2 points) Use the provided code to calculate the schools that passed both math and reading with scores of 70 or higher (2 points) Use the provided code to calculate the passing rates (2 points) Create a new DataFrame for the above calculations called per_school_summary (4 points)

Highest-Performing Schools by Percentage of Overall Passing (5 points) Sort the schools by % Overall Passing in descending order (2 points) Save the results to a DataFrame called top_schools (2 points) Display the first 5 rows (1 point)

Lowest-Performing Schools by Percentage of Overall Passing (5 points) Sort the schools by % Overall Passing in ascending order (2 points) Save the results to a DataFrame called bottom_schools (2 points) Display the first 5 rows (1 point)

Math Scores by Grade (10 points) Use the code provided to separate the data by grade (1 points) Group by "school_name" and take the mean of each (4 points) Use the code to select only the math_score (1 points) Combine each of the scores above into single DataFrame called math_scores_by_grade (4 points)

Reading Scores by Grade (10 points) Use the code provided to separate the data by grade (1 points) Group by "school_name" and take the mean of each (4 points) Use the code to select only the reading_score (1 points) Combine each of the scores above into single DataFrame called reading_scores_by_grade (4 points)

Scores by School Spending (5 points) Use pd.cut with the provided code to bin the data by the spending ranges (2 points) Use the code provided to calculate the averages (1 points) Create the spending_summary DataFrame using the binned and averaged spending data (2 points)

Scores by School Size (5 points) Use pd.cut with the provided code to bin the data by the school sizes (2 points) Use the code provided to calculate the averages (1 points) Create the size_summary DataFrame using the binned and averaged size data (2 points)

Scores by School Type (5 points) Group the per_school_summary DataFrame by "School Type" and average the results (2 points) Use the code provided to select the new column data (1 point) Create a new DataFrame called type_summary that uses the new column data (2 points)

Written Report (15 points) To receive all points, the written report presents a cohesive written analysis that:

Summarizes the analysis (5 points)

Draws two correct conclusions or comparisons from the calculations (10 points)

About

Education data analysis using data manipulation tools


Languages

Language:Jupyter Notebook 100.0%