mwgalloway / InfoSearch

A search engine, including web crawler, written in Ruby

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Summary

InfoSearch is an opensource internet search engine built in Ruby.

It works by building an index of webpages utilizing concurrent web crawlers. Given a few seed URLs it will index relevant search terms into a MongoDB database. Prioritizing title and header tags above paragraphs, it will store up to 300 words per page. The web page view acts as a front end to a multi-word text score search on the database and returns a collection of links sorted in order of relevancy.

Dependencies

InfoSearch is known to work with the following environment:

Ruby 2.3.1
MongoDB 3.2.1
Redis 3.2.5

About

A search engine, including web crawler, written in Ruby

License:MIT License


Languages

Language:Ruby 66.6%Language:HTML 14.8%Language:JavaScript 10.2%Language:CSS 8.4%