Case study
clone repository using git
git clone https://github.com/cvngur/scraper
pip install -r requirements.txt
- Python
- BeatifulSoup
- OpenPyXL
- I have used BeautifulSoup and OpenPyXL in projects before, so I did not face a challenge
- How to use concurrent.futures library
- Differences between ThreadPoolExecutor and ProcessPoolExecutor
- We can use multiple threads to decrease the scraping time of URLs.
- API is a structure that provides the ability for applications to communicate with each other. You send a request to the API to getting data you do not have then API returns an answer which includes the data you need.