Nikokolas3270 / o11y

Stonesoup observability

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RHTAP Observability

This repository is maintained by the RHTAP O11Y team.

The repository contains Prometheus alert rules files for monitoring RHTAP data plane clusters along with their tests.

Control plane clusters alert rules are maintained by the same team, but are kept in a different repository

Updating Data Plane Alerts

Alert rules for data plane clusters are being deployed by app-interface to RHOBS, to where the data plane metrics are also being forwarded. For deploying the alert rules, app-interface references the location of the rules together with a git reference - branch name or commit hash.

It holds separate references to both staging and production RHOBS instances (monitoring RHTAP staging and production deployments). For both environments, we maintain the reference to the rules as a commit hash (rather than a branch). This means that any changes to the rules will not take effect until the references are updated.

Steps for updating the rules:

  1. Merge the necessary changes to this repository - alerts and tests.
  2. Update the staging environment reference in app-interface to the commit hash of the changes you made.
  3. Once merged and ready to be promoted to production, update the production environment reference in a similar manner.

About

Stonesoup observability

License:Apache License 2.0


Languages

Language:Makefile 100.0%