NoahPlage / The-GitHub-History-of-the-Scala-Language

Find the true Scala experts by exploring its development history in Git and GitHub.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The GitHub History of the Scala Language (DataCamp Project)

Find the true Scala experts by exploring its development history in Git and GitHub.

Project Description

Open source projects contain entire development histories, such as who made changes, the changes themselves, and code reviews. In this project, you'll be challenged to read in, clean up, and visualize the real-world project repository of Scala that spans data from a version control system (Git) as well as a project hosting site (GitHub). With almost 30,000 commits and a history spanning over ten years, Scala is a mature language. You will find out who has had the most influence on its development and who are the experts.

The dataset includes the project history of Scala retrieved from Git and GitHub as a set of CSV files.

Technology

  • Python

Topics

  • Performed data manipulation and data visualisation using Pandas and Matplotlib libraries.
  • Created dataframes and used filtering techniques.
  • Built bar plot and histogram plot.

Outline

  1. Scala's real-world project repository data
  2. Preparing and cleaning the data
  3. Merging the DataFrames
  4. Is the project still actively maintained?
  5. Is there camaraderie in the project?
  6. What files were changed in the last ten pull requests?
  7. Who made the most pull requests to a given file?
  8. Who made the last ten pull requests on a given file?
  9. The pull requests of two special developers
  10. Visualizing the contributions of each developer

Some visualisations from the project

image image

About

Find the true Scala experts by exploring its development history in Git and GitHub.


Languages

Language:Jupyter Notebook 100.0%