liliansteven / Employee-Salaries-analysis-project

This GitHub repository contains a comprehensive analysis of an employee salaries dataset.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Employee Salaries Dataset Analysis

Overview

This GitHub repository contains a comprehensive analysis of an employee salaries dataset. The dataset provides information about the employees of an organization, including their compensation, job titles, and other relevant details.

Dataset Features

  • 'Id': Employee identification number
  • 'EmployeeName': Name of the employee
  • 'JobTitle': Job title of the employee
  • 'BasePay': Base salary of the employee
  • 'OvertimePay': Overtime pay received by the employee
  • 'OtherPay': Additional pay or bonuses
  • 'Benefits': Employee benefits
  • 'TotalPay': Total salary (sum of BasePay, OvertimePay, and OtherPay)
  • 'TotalPayBenefits': Total compensation including benefits
  • 'Year': Year of the recorded data
  • 'Notes': Additional notes (if any)
  • 'Agency': Organization or agency name
  • 'Status': Employment status

Project Tasks

1. Basic Data Exploration

  1. Identify the number of rows and columns in the dataset.
  2. Determine the data types of each column.
  3. Check for missing values in each column.

2. Descriptive Statistics

  1. Calculate basic statistics such as mean, median, mode, minimum, and maximum salary.
  2. Determine the range of salaries.
  3. Find the standard deviation of salaries.

3. Data Cleaning

  1. Handle missing data using suitable methods, with an explanation of the chosen approach.

4. Basic Data Visualization

  1. Create histograms or bar charts to visualize the distribution of salaries.
  2. Use pie charts to represent the proportion of employees in different departments.

5. Grouped Analysis

  1. Group the data by one or more columns.
  2. Calculate summary statistics for each group.
  3. Compare average salaries across different groups.

6. Simple Correlation Analysis

  1. Identify any correlation between salary and another numerical column.
  2. Plot a scatter plot to visualize the relationship.

7. Summary of Insights

  1. Write a brief report summarizing the findings and insights from the analyses.

About

This GitHub repository contains a comprehensive analysis of an employee salaries dataset.


Languages

Language:Jupyter Notebook 100.0%