Smartproxy / reddit-python-scraper

Scrape Reddit utilising Smartproxy's Web Scraping API

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Reddit Scraper

Scrape Reddit utilising Smartproxy's Web Scraping API

Dependencies

BeautifulSoup

Authentication

Once you have an active Web Scraping API subscription, you can try sending a request right from the dashboard Web Scraping API > API playground method tab simply by clicking on Send Request. You will also see an example of curl request generated on the right.

This Pyhton code example uses Base64 encoded user:pass authentication.

Parser type Example location Download
HTML to JSON reddit_python_scraper.py curl https://raw.githubusercontent.com/Smartproxy/reddit-python-scraper/blob/main/reddit_python_scraper.py > reddit_python_scraper.py

HTML to JSON

This Python script extracts Subreddit details, post data and comments straight from the HTML of Reddit post page and saves them to a JSON file.

About

Scrape Reddit utilising Smartproxy's Web Scraping API

License:MIT License


Languages

Language:Python 100.0%