English | 中文
A Python basic program based on Selenium for crawling recruitment position information, supporting CSV and SQLite data storage
- 51job
- Boss zhipin
Tip
Due to the limitation of 51job's API, the maximum number of entries obtained per search term is limited to 1000
Python 3.11.2
Edge
beautifulsoup4==4.12.2
colorlog==6.8.0
fake-useragent==1.4.0
pandas==2.1.3
selenium==4.15.2
requests==2.31.0
pip install requirements.txt
run test/spider_test.py
├─README.md
├─LICENSE
├─.gitignore
├─requirements.txt
├─log
│ ├─handler_logger.py
│ └─__init__.py
├─output
│ ├─area
│ │ ├─51area.csv
│ │ └─51area.db
│ └─job
│ ├─51job.csv
│ └─51job.db
├─spider
│ ├─jobspider51.py
│ ├─__init__.py
│ └─area
│ ├─areaspider51.py
│ └─__init__.py
└─test
└─spider_test.py
Warning
This program is only for learning and research purposes. Please do not use it for any business or illegal purpose. If you violate the regulations, please be responsible for yourself.。