dirkjbreeuwer / yc-scraper

YC Company Scraper is a Python script that fetches information about Y Combinator companies from specified batches, extracts GitHub repository links from their websites, and prints the data.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

YC Company Scraper

YC Company Scraper is a Python script that fetches information about Y Combinator companies from specified batches, extracts GitHub repository links from their websites, and prints the data.

Features

  • Retrieve company information from various Y Combinator batches
  • Scrape GitHub repository links from company websites
  • Print the extracted data

Requirements

  • Python 3.6 or higher
  • requests library

Installation

  1. Clone this repository:
git clone git@github.com:dirkjbreeuwer/yc-scraper.git
cd yc-company-scraper
  1. Install the requests library:
pip install requests

Usage

  1. Open yc_company_scraper.py and modify the batches list to include the desired Y Combinator batch codes.

  2. Run the script:

python yc_company_scraper.py
  1. The script will print the company name, website, YC URL, and any GitHub repository links found on their website.

Disclaimer

This script is for educational purposes only. Always respect the websites' terms of service and robots.txt when scraping data.

About

YC Company Scraper is a Python script that fetches information about Y Combinator companies from specified batches, extracts GitHub repository links from their websites, and prints the data.


Languages

Language:Python 99.6%Language:PowerShell 0.3%Language:Shell 0.1%