PallaviTS / web_crawler

Crawler to scrape data from Events/Calendar Sites

Home Page:https://secure-castle-95821.herokuapp.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Events Filter

Crawler to scrape data from Events/Calendar Sites

Collecting information for my website from other web sources

  1. https://www.co-berlin.org/en/calender
  2. https://gorki.de/en/programme/2018/08/all

Events filter: User can filter the events based on different criteria:

  1. Web Source.
  2. Dates.
  3. Simple partial text search on title.

The events are updated frequently so visitors can always access fresh information.

Running a cron job at midnight to get the latest information

Deployed on HEROKU

https://secure-castle-95821.herokuapp.com/

I have added comments in code section to explain my thought process and share toolset used.

About

Crawler to scrape data from Events/Calendar Sites

https://secure-castle-95821.herokuapp.com/


Languages

Language:Ruby 81.2%Language:HTML 13.0%Language:JavaScript 3.2%Language:CSS 1.9%Language:CoffeeScript 0.6%