a-paxton / oss-community-health

Exploration of community health in open-source software communities developing on GitHub

Home Page:https://osf.io/6ncwt/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Community Health of Open-Source Software

This repository contains the code for our quantitative exploration of community health in open-source software communities developing on GitHub.

Data

Data for this project were obtained through the GitHub API. For reproducibility, we provide access to the data as they existed at the time of our analysis. Data can be found in a zipped folder in our OSF project. The files are organized in that folder as follows:

  • project_name: Each project included in our experiment includes its own top-level directory, named according to the repository.
    • project_name/issues.tsv: Includes tickets, issues, and pull requests made on the project. In addition to the columns provided automatically by GitHub, we have also created two additional columns:
      • num_PR_created: the number of PR created before that issue/PR by that user.
      • num_issue_created: the number of issues created before that issue/PR by the corresponding author.
    • project_name/comments.tsv: Includes all comments made on the issues for the project.

Sample

Currently, our data include 8 projects:

  • matplotlib
  • mayavi
  • numpy
  • pandas
  • scikit-image
  • scikit-learn
  • scipy
  • sphinx-gallery

About

Exploration of community health in open-source software communities developing on GitHub

https://osf.io/6ncwt/

License:MIT License


Languages

Language:Python 49.6%Language:Jupyter Notebook 39.1%Language:R 8.6%Language:Makefile 2.7%