There are 75 repositories under sre topic.
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
A curated list of Site Reliability and Production Engineering resources.
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Compilation of public failure/horror stories related to Kubernetes
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
Terraform Pull Request Automation
StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 integration packs with 6000+ actions (see https://exchange.stackstorm.org) and ChatOps. Installer at https://docs.stackstorm.com/install/index.html
Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
Site Reliability Engineer Interview Preparation Guide
DevOps Roadmap for 2023. with learning resources
Chaos Engineering Toolkit & Orchestration for Developers
[Moved to cloudprober/cloudprober] An active monitoring software to detect failures before your customers do.
A checklist of anyone practicing Site Reliability Engineering
Cloud Native DataOps & AIOps Platform | 云原生数智运维平台
A Frida based tool that traces usage of the JNI API in Android apps.
A collection of postmortem templates
Web UI for Jaeger
A curated list of awesome DevOps platforms, tools, practices and resources
A curated list of Site Reliability and Production Engineering Tools
A framework for gradual system automation
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world use Amazon Web Services (AWS)
Kubernetes utility for exposing image versions in use, compared to latest available upstream, as metrics.
Squzy - is a high-performance open-source monitoring, incident and alert system written in Golang with Bazel and love. Welcome to free SRE
What to Read to Learn More About DevOps
DevOps/SRE community is for those folks who are trying to learn or explore DevOps with the help of experienced professionals. Opportunities are open to share.
🔖 Guides, Articles, Podcasts, Videos and Notes to Build Reliable Large-Scale Distributed Systems.
A reading and viewing list for larval stage sysadmins and SREs
Linux Bash Shell Script and Python Script For Ops and Devops
Cloud Operations Sandbox is an open source collection of tools that helps practitioners to learn O11y and R9y practices from Google and apply them using Cloud Operations suite of tools.
Modern TCP tool and service for network performance observability.
Curated list of good SRE interview questions.
An active monitoring software to detect failures before your customers do.
Repositório para compartilhamento de conteúdo Gratuito sobre DevOps