aronlindberg / github-analysis

Analize github activity.

Home Page:www.matrobot.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GitHub-Analysis

This project contains code for parsing and analysis data availabe from githubarchive.org

Installation

Run application

Run the application to extract events for particular repository with command:

java -jar gha.jar params.yaml

Sample params.yaml:

# Extract events from repository rails/rails into events.csv where data in range from 2011-11 to 2012-1

command: find_events

repository: rails/rails

date:
  from: 2011-11
  to: 2012-1
  
datapath: /home/klangner/datasets/github

output: events.csv  

See more

Packages

  • com.matrobot.gha.archive - Contains code to parse githubarchive.org files and create intermediate csv and json files which can be processed by other packages or external programs like weka.
  • com.matrobot.gha.insights - Contains code for analyzing data and find patterns in it.
  • com.matrobot.gha.ghapi - Get data from GitHub API.

Important links

Dependencies

This project uses Maven to resolve dependencies

About

Analize github activity.

www.matrobot.com

License:MIT License