There are 112 repositories under sre topic.
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
A curated list of amazingly awesome open-source sysadmin resources.
A curated list of Site Reliability and Production Engineering resources.
DevOps Roadmap for 2024. with learning resources
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
Terraform Pull Request Automation
Site Reliability Engineer Interview Preparation Guide
Compilation of public failure/horror stories related to Kubernetes
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 integration packs with 6000+ actions (see https://exchange.stackstorm.org) and ChatOps. Installer at https://docs.stackstorm.com/install/index.html
CDN Up and Running - Building a CDN from Scratch to Learn about CDN, Nginx, Lua, Prometheus, Grafana, Load balancing, and Containers.
A checklist of anyone practicing Site Reliability Engineering
Chaos Engineering Toolkit & Orchestration for Developers
A curated list of awesome DevOps platforms, tools, practices and resources
[Moved to cloudprober/cloudprober] An active monitoring software to detect failures before your customers do.
A collection of postmortem templates
Layerform helps engineers create reusable environment stacks using plain .tf files. Ideal for multiple "staging" environments.
A curated list of Site Reliability and Production Engineering Tools
Web UI for Jaeger
A collection of git utilities, useful extra git scripts, tutorials and other useful articles.
Azure Terraform SRE framework
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world use Amazon Web Services (AWS)
Kubernetes utility for exposing image versions in use, compared to latest available upstream, as metrics.
NixOS Guide. Learn all about the immutable Nix Operating System and the declarative Nix Expression Language.
DevOps/SRE community is for those folks who are trying to learn or explore DevOps with the help of experienced professionals. Opportunities are open to share.
A reading and viewing list for larval stage SREs and sysadmins