InfuseAI / git-repo-analytics

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Introduction

This is a demo project for PipeRider. It shows how you can use dbt+duckdb+PipeRider to transform and query data efficiently. With these tools, you can generate a report that gives you insights into your git projects.

Usage

  1. Clone this repo

    git clone git@github.com:InfuseAI/git-repo-analytics.git   
    
  2. Prepare git repositories repo for analysis by editing the repos.csv

    The format of the repos.csv file is as follows:

    name,repo
    dbt-core,dbt-labs/dbt-core
    duckdb,duckdb/duckdb
    PipeRider,InfuseAI/piperider
    
  3. Install python package

    python -m venv ./venv
    source ./venv/bin/activate
    pip install -r requirements.txt     
    
  4. Generate the db file from the repo.

    make fetch
    make load
    

    The file git_repo.duckdb is generated under ./data

  5. Run dbt

    make transform
    
  6. Generate report

    make report
    
  7. Run piperider

    make piperider
    

You can either run step-by-step from step4 to step7 or simply do one command

make

Screenshots

About


Languages

Language:Python 92.7%Language:Makefile 7.3%