This gem scrapes Google using any operators specified.
- Download the gems 'generalscraper' and 'requestmanager'
- Make a new request manager: requests = RequestManager.new("path/to/proxielist", [min request wait time, max request wait time], # of browsers)
- Make a new GeneralScraper object: l = GeneralScraper.new("site:site.com inurl:.pdf and other operators", "search terms", requests, nil or captcha hash, nil or cm_hash)
- Get the list or resulting pages (l.getURLs) or get full text of results (l.getData)
The proxy list must be a list of proxies in a textfile with each IP on its own line.
The hash to have CAPTCHAs solved is as follows- { captcha_key: "TwoCaptcha key" } If you don't want CAPTCHA's solved, just pass nil.
To translate pages- requests_google = RequestManager.new(nil, [1, 3], 1) t = TranslatePage.new([link, array], requests_google)