WebCrawler

A web crawler using C# and ASP.NET

Credits

Description

The program is essentially a News Aggregator for Tempo.co. The RSSParser would work for other sites' RSS Feed, but unfortunately the HTMLParser is site-specific. Thus, the HTMLParser may not work for other websites.

The crawler has three string-matching algorithm for the matching process:

BoyerMoore
Knuth-Morris-Pratt
Regular Expression (uses C# library)

We do not release any artifacts for this program. Open the project in Visual Studio and run it to get it to work.

About

Web Crawling using C# and ASP.NET. Filtering news wanted by user using Brute force, KMP, and Boyer Moore algorithm

Languages

Language:JavaScript 88.1%Language:C# 10.5%Language:ASP 1.1%Language:CSS 0.3%