There are 4 repositories under cluster-management topic.
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.
[CNCF Sandbox Project] Managing your Kubernetes clusters (including public, private, edge, etc.) as easily as visiting the Internet
Example implementations of Hashicorp memberlist
Kubernetes Dynamic Log Tailing Utility
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
CORTX Hare configures Motr object store, starts/stops Motr services, and notifies Motr of service and device faults.
Kafka Manager - A tool for managing Apache Kafka.
HPC cluster deployment and management for the Hetzner Cloud
core application to be used in the background - only for core development. you may watch out for the dummy package
Command-line application for FRµIT Management Server
Rainbow is a tool that lets you easily run repetitive or tedious batch jobs across your cluster.
ASSHer is asynchronous SSH/SFTP massive runner.
Manage configs and policies across multi-cluster and hybrid Kubernetes environments.
Run any bash (shell) command on your cluster easily
Portainer Controller, a poor man's kubectl
Manage your Kafka Cluster(s) and swarms of those clusters with a simple YAML file along with centralizing your topic and ACL configurations.