Retrieves links, tags and text from any given webpage
Pre-requesites :
- You need to ensure you have BeautifulSoup installed on your machine. You can do this with the following commands:
$ easy_install beautifulsoup4 or $ pip install beautifulsoup4
Further reading : https://www.crummy.com/software/BeautifulSoup/bs4/doc/
- You also need to import urllib2 , this is already in the code
Webretriever has a few options to scrape all links on the page , all tags and text where visible.
- Instructions:
- Run the script using python webRetriever.py
- Enter your url when prompted
- Select from the options