harpia-io / harpia

Main repository

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Open Source Incident Management system

In our reality, with an existing speed of development and product requirements, we need to make sure that critical problems will be detected in time or even predict the issue and prevent it. Since it can cost millions of dollars, and you can lose your customers.

This platform provides the single place to detect -> investigate -> resolve your production incident

Platform collects events from different systems, allows to manage them in a single place and set the automation flow to make sure that event will be noticed and handled ASAP

How to Install

There are few options to install platform:

More details for each option you can find in our documentation

How to collaborate

Roadmap

Official Roadmap 2023 you can find here - https://github.com/orgs/harpia-io/projects/1

Platform Architecture

harp-architecture.drawio.svg

Technical flow to process alerts

  • harp-collectors: receive alerts from monitoring system, unify the structure and push them to Kafka topic
  • harp-alert-decorator: read alert from Kafka topic (produced by harp-collectors) and add additional info about environments and scenarios that should be applied to the alert
  • harp-daemon: read alert from Kafka topic (produced by harp-alert-decorator), describe the logic and state of the alert and write result to MariaDB
  • harp-aggregator: read alerts from MariaDB, aggregate it and send to Aerospike
  • harp-bridge: read alerts from Aerospike and send to UI via websockets
  • harp-ui: the main user interface of the platform

Additional Services

Platform Monitoring

  • Prometheus metrics in VictoriaMetrics
  • Traces in Grafana Tempo
  • Logs in Grafana Loki
  • Dashboards and Alerts in Grafana

About

Main repository

License:GNU Affero General Public License v3.0


Languages

Language:Shell 100.0%