core116 / webscraping-template

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Webscraping Template

A Python script that converts website url and targeted elements into a readable csv table, customizable for any website. The template blog/gpu csv files should have an identical output as the manual blog/gpu csv files in the manual-scrapes folder. simple.html is a short HTML file you can practice webscraping on.

Make sure to install BeautifulSoup with pip install bs4 and requests with pip install requests

See Wiki for a more thorough walkthrough.

Beautiful Soup Documentation (bs4):

https://www.crummy.com/software/BeautifulSoup/bs4/doc/

Useful videos for introduction to web scraping (Code in manual-scrapes are based off/inspired by these):

Web Scraping with BeautifulSoup and Requests | Corey Schafer: https://www.youtube.com/watch?v=ng2o98k983k

Web Scraping with Python and BeautifulSoup | Data Science Dojo: https://www.youtube.com/watch?v=XQgXKtPSzUI

About


Languages

Language:Python 88.4%Language:HTML 11.6%