muthu-cs/system-design

System Design Basics

Key Characteristics and Fundamentals of Distributed Systems
Monolithic VS Microservice (Service Discovery, Resiliency)
Vertical vs horizontal scaling Watch1
Load Balancing / Application Delivery Controller (ADC) Read1 Read2 Watch1
Consistent Hashing Watch1 Read1 Read2 Read3
Throughput, Latency
CAP theorem
ACID vs BASE
Redundancy and Replication
Partitioning/Sharding
Optimistic vs pessimistic locking
Strong vs eventual consistency
SQL vs NoSQL
Types of NoSQL (Key value, Wide column, Document-based, Graph-based)
Caching
Data center/racks/hosts
CPU/memory/Hard drives/Network bandwidth
Random vs sequential read/writes to disk
DNS lookup
HTTP, HTTPS, HTTP2
- HTTP
- HTTPS Read1
- HTTP & SSL/TLS
- Public key infrastructure and certificate authority(CA)
- Symmetric vs asymmetric encryption
WebSockets
Long-Polling vs WebSockets vs Server-Sent Events
TCP/IP model
IPv4 vs IPv6
TCP vs UDP
Consistent Hashing
CDNs & Edges
Data Partitioning
Indexes
Master-Slave, Master-Master
Active-Passive, Active-Active
Leader election
Design patterns and Object-oriented design
Virtual machines and containers
Pub-sub architecture
REST, GraphQL
MapReduce
Bloom filters and Count-Min sketch
Paxos
Multithreading, locks, synchronization, CAS(compare and set)
Proxies

Building Blocks of Any Frequently Asked System Design Question

Authentication
- JWT
- OAUTH2
File / Media Upload
- S3, Multiple Quality Files
WIP...

Tools and Technologies

Databases Comparison
Cassandra
MongoDB/Couchbase
- Mongo: Read1, Read2, Read3, Read4, Read5, Read6, IQ's
RabbitMQ / Kafka / Pub-Sub comparison Comparison
- RabbitMQ: Watch1, Watch2
- Google PubSub: Watch Playlist
Mysql / PostgreSQL
- Scalability in Postgres
Redis / Memcached
InfluxDB [Suitable for TimeSeries, IoT data]
Zookeeper
NGINX
HAProxy
Solr, Elastic search
Amazon, EC2, S3
Docker, Kubernetes
Hadoop/Spark and HDFS
Eureka, Hysterix
Heroku / Azure DevOps
Jenkins CI/CD

System Design Problems (HLD + LLD)

TinyURL
Instagram | Photo hosting platform
Timeline | Newsfeed | Twitter
Dropbox | Google Drive
Whatsapp | Facebook Messenger NL GS Ref
MakeMyTrip | BookMyShow
Amazon | Flipkart
Youtube | Netflix NL
Uber | IRCTC
Swiggy | Zomato
Yelp | Nearby
Twitter Search
Google Search
SplitWise
Zerodha
API Rate Limiter
Web Crawler
Rate limiting system
Distributed cache
Typeahead Suggestion | Auto-complete system
Recommendation System
Design a tagging system like tags used in LinkedIn

Low Level Design Problems (Machine Coding Round) Reference

Elevator system
Snake and Ladder game
Tic Tac Toe
ATM machine - https://medium.com/swlh/atm-an-object-oriented-design-e3a2435a0830
Traffic Control System
Vehicle Parking System
Online Coding Platform problem-statement
File Sharing System
Object Oriented Design Prerations [https://www.oodesign.com/]
SOLID Principles
Design Patterns [https://refactoring.guru/design-patterns]
More Problems List
More Good Resources:
- https://refactoring.guru/design-patterns/what-is-pattern
- http://www.cs.unibo.it/~cianca/wwwpages/ids/esempi/coffee.pdf Recomended by - sudoCode
- https://cseweb.ucsd.edu//~wgg/CSE210/ecoop93-patterns.pdf Recomended by - sudoCode

Engineering Blogs Ref

Airbnb-http://nerds.airbnb.com/
AirPair-https://www.airpair.com/posts
Artsy-http://artsy.github.io/
Asana-https://eng.asana.com/
Bandcamp-http://bandcamptech.wordpress.com/
BenefitFocus-http://engineering.benefitfocus.com/
Bitly-http://word.bitly.com/
Bittorrent-http://engineering.bittorrent.com/
Cerner-http://engineering.cerner.com/
Chartbeat-http://engineering.chartbeat.com/
Cloudera-http://blog.cloudera.com/blog/
Cloudflare-http://blog.cloudflare.com/
Docker-http://blog.docker.com/category/engineering/
Dropbox-https://blogs.dropbox.com/tech/
Ebay-http://www.ebaytechblog.com/
Etsy-https://codeascraft.com/
Eventbrite-https://engineering.eventbrite.com/
Facebook-https://code.facebook.com/posts/
Flickr-http://code.flickr.net/
Fiftythree-http://making.fiftythree.com/
Flipboard-http://engineering.flipboard.com/
Foursquare-http://engineering.foursquare.com/
Github-http://githubengineering.com/
Gnip-https://engineering.gnip.com/
GoSquared-https://engineering.gosquared.com/
Grouper-http://eng.joingrouper.com/
Groupon-https://engineering.groupon.com/
Harry's-http://engineering.harrys.com/
Heroku-http://engineering.heroku.com/
Honeybadger-http://blog.honeybadger.io/
Indeed-http://engineering.indeed.com/blog/
Instagram-http://instagram-engineering.tumblr.com/
Intent-http://engineering.intenthq.com/
Linkedin-https://engineering.linkedin.com/blog
Livechat-http://developers.livechatinc.com/blog/
Medallia-http://engineering.medallia.com/blog/
Monetate-http://engineering.monetate.com/
Netflix-http://techblog.netflix.com/
Oyster-http://tech.oyster.com/
Paypal-https://www.paypal-engineering.com/
Pinterest-http://engineering.pinterest.com/
Prezi-https://medium.com/prezi-engineering
Quora-http://engineering.quora.com/
Rightscale-http://eng.rightscale.com/
Salesforce-https://developer.salesforce.com/blogs/engineering/
Shopify-http://www.shopify.com/technology
Simple-https://www.simple.com/engineering
Slideshare-http://engineering.slideshare.net/
Songkick-http://devblog.songkick.com/
Soundcloud-https://developers.soundcloud.com/blog/
Spotify-https://labs.spotify.com/
Square-https://corner.squareup.com/
Strava-http://engineering.strava.com/
Tumblr-http://engineering.tumblr.com/
Twitter-https://blog.twitter.com/engineering
Twilio-https://www.twilio.com/engineering/
Thumbtack-https://www.thumbtack.com/engineering/
Wayfair-http://engineering.wayfair.com/
Wealthfront-http://eng.wealthfront.com/
Webengage-http://engineering.webengage.com/
Yahoo-http://yahooeng.tumblr.com/
Yammer-http://engineeringblog.yelp.com/
Yelp-http://engineeringblog.yelp.com/
Zenpayroll-http://engineering.zenpayroll.com/
Zillow-https://engineering.zillow.com/

Other Useful Resources:

HOW TO ACE A SYSTEMS DESIGN INTERVIEW-https://www.palantir.com/2011/10/how-to-ace-a-systems-design-interview/
HighScalability Blog-http://highscalability.com/
Distributed Systems-http://book.mixu.net/distsys/single-page.html
Distributed Deep Dive - https://ably.com/blog/introducing-distributed-deep-dive-interview-series-by-ably-realtime
Architecture for microservice by Microsoft - https://docs.microsoft.com/en-us/dotnet/architecture/microservices/

System Design Interview Approach Template

THINGS TO CONSIDER [5 min]

    (1) Features
    (2) API
    (3) Availability
    (4) Latency
    (5) Scalability
    (6) Durability
    (7) Class Diagram
    (8) Security and Privacy
    (9) Cost-effective

FEATURE EXPECTATIONS [5 min]

    (1) Use cases
    (2) Scenarios that will not be covered
    (3) Who will use
    (4) How many will use
    (5) Usage patterns

ESTIMATIONS [5 min]

    (1) Throughput (QPS for read and write queries)
    (2) Latency expected from the system (for read and write queries)
    (3) Read/Write ratio
    (4) Traffic estimates
            - Write (QPS, Volume of data)
            - Read  (QPS, Volume of data)
    (5) Storage estimates
    (6) Memory estimates
            - If we are using a cache, what is the kind of data we want to store in cache
            - How much RAM and how many machines do we need for us to achieve this ?
            - Amount of data you want to store in disk/ssd

DESIGN GOALS [5 min]

    (1) Latency and Throughput requirements
    (2) Consistency vs Availability  [Weak/strong/eventual => consistency | Failover/replication => availability]

HIGH LEVEL DESIGN [5-10 min]

    (1) APIs for Read/Write scenarios for crucial components
    (2) Database schema
    (3) Basic algorithm
    (4) High level design for Read/Write scenario

DEEP DIVE [15-20 min]

    (1) Scaling the algorithm
    (2) Scaling individual components: 
            -> Availability, Consistency and Scale story for each component
            -> Consistency and availability patterns
    #### Think about the following components, how they would fit in and how it would help
            a) DNS
            b) CDN [Push vs Pull]
            c) Load Balancers [Active-Passive, Active-Active, Layer 4, Layer 7]
            d) Reverse Proxy
            e) Application layer scaling [Microservices, Service Discovery]
            f) DB [RDBMS, NoSQL]
                    > RDBMS 
                        >> Master-slave, Master-master, Federation, Sharding, Denormalization, SQL Tuning
                    > NoSQL
                        >> Key-Value, Wide-Column, Graph, Document
                            Fast-lookups:
                            -------------
                                >>> RAM  [Bounded size] => Redis, Memcached
                                >>> AP [Unbounded size] => Cassandra, RIAK, Voldemort
                                >>> CP [Unbounded size] => HBase, MongoDB, Couchbase, DynamoDB
            g) Caches
                    > Client caching, CDN caching, Webserver caching, Database caching, Application caching, Cache @Query level, Cache @Object level
                    > Eviction policies:
                            >> Cache aside
                            >> Write through
                            >> Write behind
                            >> Refresh ahead
            h) Asynchronism
                    > Message queues
                    > Task queues
                    > Back pressure
            i) Communication
                    > TCP
                    > UDP
                    > REST
                    > RPC

JUSTIFY [5 min]

(1) Throughput of each layer
(2) Latency caused between each layer
(3) Overall latency justification

muthu-cs / system-design

System Design Basics

Building Blocks of Any Frequently Asked System Design Question

Tools and Technologies

System Design Problems (HLD + LLD)

Low Level Design Problems (Machine Coding Round) Reference

Engineering Blogs Ref

Other Useful Resources:

System Design Interview Approach Template

THINGS TO CONSIDER [5 min]

FEATURE EXPECTATIONS [5 min]

ESTIMATIONS [5 min]

DESIGN GOALS [5 min]

HIGH LEVEL DESIGN [5-10 min]

DEEP DIVE [15-20 min]

JUSTIFY [5 min]

More Resources:

Credit:

About