guoxiaolu / scrapy

google image crawler

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This project is modified from https://gist.github.com/kekeblom/204a609ee295c81c3cc202ecbe68752c.

selenium and corresponding webDriver, a proxy (whose port is 1080) are needed.

This project is currently used for downloading image from google image without using google api.

Sometimes, the final searching number is small, eg. count=500, but only return 28. This may be related to the webdriver, it's a bug, but I don't know how to fix it. Run it again, it returns correctly.

google image and soso image are supported, just change the "is_google"

About

google image crawler


Languages

Language:Python 100.0%