There are 19 repositories under fault-tolerance topic.
These are the best resources for System Design on the Internet
Dkron - Distributed, fault tolerant job scheduling system https://dkron.io
Highly-available Distributed Fault-tolerant Runtime
Service Discovery and Governance Platform for Microservice and Distributed Architecture
A list of papers about distributed consensus.
List of Elixir books
Asynchronous & Fault-tolerant PHP Framework for Distributed Applications.
Simmy is a chaos-engineering and fault-injection tool, integrating with the Polly resilience project for .NET
**No Longer Maintained** Official RAMCloud repo
Notes on Lindsey Kuper's lectures on Distributed Systems
Python Actor concurrency library
Serverless chaos monkey for AWS (runs on AWS Lambda) ☁️ 💥
A daemon, running in background on a Linux router or firewall, monitoring the state of multiple internet uplinks/providers and changing the routing accordingly. LAN/DMZ internet traffic is load balanced between the uplinks.
ZIO-native utilities for making resilient distributed systems
Simple, Erlang-inspired fault-tolerance framework for Rust Futures.
Polly.Contrib.WaitAndRetry is an extension library for Polly containing helper methods for a variety of wait-and-retry strategies.
Lightweight Java SDK used as Proxyless Service Governance
A microservices example developed with Spring Cloud and Vaadin