amyzhang / webCrawler

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

inURLs.txt: text file of URLs to be crawled
getURL.py: reads inURLs.txt and prints at least 10 URLs with activities/events from the same root domain to outURLs.txt
outURLs.txt: text file of URLs with activities found

About


Languages

Language:Python 100.0%