mabiesen / google-scraper

A Google Search Results Page Scraper written in Ruby

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Google Scraper

Google Scraper returns all Google Search results in a object oriented fashion. It wraps the boilerplate logic by parsing HTML from Google Search Results Page. All results are obtained directly from Google's SERP and not from deprecated Google Web Search API.

Installing

Add the line in your gemfile. Version 0.0.1 is not released yet, you have to add a reference to this Github until the release.

gem 'google-scraper', github: 'ghonorato/google-scraper'

Because Google Scraper uses Capybara and Poltergeist for scraping, you must have PhantonJS installed on your system.

Simple example

#!/usr/bin/env ruby

require 'google-scraper'

engine = GoogleScraper::Engine.new
results = engine.query(ARGV[0])

results.each do |r| 
  puts "#{r.rank} (Position #{r.page_position} for page #{r.page_number})"
  puts r.title
  puts "-> #{r.url}"
  puts r.friendly_url
  puts r.meta_description
  puts ""
end

Future

  • Add tests
  • Improve this README

About

A Google Search Results Page Scraper written in Ruby

License:MIT License


Languages

Language:Ruby 100.0%