A large database with over a million rows is explored by building complex SQL queries to draw business conclusions for the data. The project mimics building an internal reporting tool for a newpaper site to discover what kind of articles the site's readers like. The database contains newspaper articles, as well as the web server log for the site.
What are the most popular three articles?
Who are the most popular article authors?
On which days did more than 1% of requests lead to errors?