There are 51 repositories under sre topic.
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
A curated list of Site Reliability and Production Engineering resources.
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Compilation of public failure/horror stories related to Kubernetes
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
Terraform Pull Request Automation
Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
Site Reliability Engineer Interview Preparation Guide
Chaos Engineering Toolkit & Orchestration for Developers
[Moved to cloudprober/cloudprober] An active monitoring software to detect failures before your customers do.
A Frida based tool that traces usage of the JNI API in Android apps.
A collection of postmortem templates
Web UI for Jaeger
A framework for gradual system automation
A curated list of Site Reliability and Production Engineering Tools
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world use Amazon Web Services (AWS)
A curated list of awesome DevOps tools, platforms and resources
Kubernetes utility for exposing image versions in use, compared to latest available upstream, as metrics.
Squzy - is a high-performance open-source monitoring, incident and alert system written in Golang with Bazel and love. Welcome to free SRE
What to Read to Learn More About DevOps
Linux Bash Shell Script and Python Script For Ops and Devops
🔖 Guides, Articles, Podcasts, Videos and Notes to Build Reliable Large-Scale Distributed Systems.
Modern TCP tool and service for network performance observability.
Cloud Operations Sandbox is an open source tool that helps practitioners to learn Service Reliability Engineering practices from Google and apply them on their cloud services using Cloud Operations suite of tools.
A reading and viewing list for larval stage sysadmins and SREs
Curated list of good SRE interview questions.
Collection of AWS SSM Documents to perform Chaos Engineering experiments
Guidance on how to make your environment easier to onboard for Web Ops Engineers, SRE's and DevOps Practitioners
Marmot workflow execution engine
Notes on Site Reliability Engineering. Leave a 🌟 if you found this useful!
Engine used by jnitrace to intercept JNI API calls.
An active monitoring software to detect failures before your customers do.