Site Reliability GPT is a Large Language Model backed Site Reliability tool.
The goal of this project is to develop a tool for the Site Reliability Engineering community to use during the operations, administration, and maintenance of the systems and services they build and support.
Rough roadmap:
- Local integration development environment
- Helm chart for K8s cluster deployment
- Ingestion Pipelines for popular SRE docs
- SREGPT ChatEngine
- In-band model evaluation pipelines
- Out-of-band labeling service