Steps
-
Save the html file of http://wenzhang.baidu.com/ as "wenzhang_full.html"
-
Edit Crawler.py
You can choose :
-
Output to txt file
-
Output to Markdown file
- Run Crawler.py
python Crawler.py
The cookie part is modified from https://github.com/n8henrie/pycookiecheat