ilhan-mstf / awesome-list-of-system-incidents

Curated list of news related with system incidents.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

List Of System Incidents

This is a curated list of news related with serious system incidents.

Nowadays, I hear more news related with system crashes that last hours (even days). These crashes sometimes effect millions of users and many apps. To purpose is to learn from their problem to build more reliable systems that enpowers community. Maybe, this will turn to a book that covers the history of system failures and outages.

Table of Contents

Table of contents generated with markdown-toc

Incidents

Feb 1, 2017 - GitLab.com Database Incident

By mistake production database is deleted and after nearly 19 hours hard work broadcasted live on YouTube (yes, they live streamed all system recovery effort) system back online with the lost of six hours database data. Read details

Feb 9, 2017 - Instapaper Database Incident

Instapaper hits file size limit of ext3 file system with its huge 2TB database. System starts to reject saving new articles. Until moving database to another instance, users suffered to reach saved articles. Read details

Tools

TODOs

Contribution

Contributions to the list is welcomed and encouraged. Embrace the system incidents ;). You can follow this guide.

License

Content is licensed under Creative Commons Attribution 4.0 License.

About

Curated list of news related with system incidents.

License:Creative Commons Attribution 4.0 International