kelly-sovacool / lang-stats

Collect & plot personal programming language statistics from GitHub repos

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

lang-stats

Collect personal programming language statistics from GitHub repos using PyGithub.

This project is now archived and the functionality has been merged into my meta-repository.

Usage

  1. Clone this repo.

    git clone https://github.com/kelly-sovacool/lang-stats
    
  2. Create a file credentials.yaml with your github login info:

    login: your_login
    password: your_password
    

    Warning: do not add the credentials file to git. Be careful not to accidentally share it with anyone.

  3. Install dependencies:

    conda env create -n lang-stats -f environment.simple.yaml
    conda activate lang-stats
    
  4. Create plots of your programming languages from your github repos:

    ./langstats.py
    

    They will be located in figures/.

    language_all_bytes

    language_top_repos

Notes

Jupyter is way over-represented because .ipynb files are JSON with lots of metadata.

  • It'd be cool to parse out the code chunks and only let code count towards the byte count.
  • Does linguist count comments in regular text files (e.g. .py or .R) towards the code byte count?
  • Try counting bytes in regular .py files to make sure my method matches that of linguist, then apply it to Jupyter code chunks stripped of metadata.

About

Collect & plot personal programming language statistics from GitHub repos

License:MIT License


Languages

Language:Python 100.0%