HarvardOpenData / Bootcamp-Scraping-Old

Repository for the scraping bootcamp.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Scraping Bootcamp

This is the repository for HODP's Data Scraping Bootcamp.

Here's what you need to do:

  1. Ignore demo_scraping.ipynb, that's for us to demonstrate what scraping looks like, you don't need to change anything there!
  2. Open regex101.com and copy the text from sample_text.txt into the test string box.
  3. Make sure your flavour (on the left hand side of the page) is set to Python.
  4. Feel free to refer to the useful reesources listed below during the bootcamp!

Helpful resources

  1. Regex reference sheet: https://www.regular-expressions.info/quickstart.html or http://www.rexegg.com/regex-quickstart.html#ref
  2. Most of the Regex lessons were taken from here, definitely return to them if you need a refresher/want more lessons.
  3. Great tutorial on how to use BeautifulSoup

About

Repository for the scraping bootcamp.


Languages

Language:Jupyter Notebook 100.0%