ShingiraiBhengesa / ycombinator_githubs

A web scraping / data mining script for extracting beginner-friendly github repos from Y Combinator's company database: https://www.ycombinator.com/companies/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ycombinator_githubs

A web scraping/data mining script for extracting beginner-friendly github repos from Y Combinator's company database: https://www.ycombinator.com/companies/

This scraper is designed with Microverse students and graduates in mind. It extracts the most recent and active Y Combinator github repos, written in JavaScript and Ruby, which are the two languages Micronauts use the most!

The resulting list as of July 2022 for YC batches S21, W21, S22, and W22, is in 05_final_repos.csv. Be sure to check it out to find beginner-friendly repos written in JS and Ruby.

Installation

  1. Clone the repository
  2. $ cd ycombinator_githubs
  3. $ npm install

Usage

  1. Generate your github Personal Access Token as detailed in .env.sample
  2. Change the name of .env.sample to simply .env
  3. $ ./scrape.sh

For details on how the scraping works, or what the .csv files contain, read scrape.sh

About

A web scraping / data mining script for extracting beginner-friendly github repos from Y Combinator's company database: https://www.ycombinator.com/companies/

License:MIT License


Languages

Language:JavaScript 56.6%Language:Shell 43.4%