王天庆's starred repositories
anything-llm
The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
Megatron-LM
Ongoing research training transformer models at scale
node-problem-detector
This is a place for various problem detectors running on the Kubernetes nodes.
k8s-device-plugin
NVIDIA device plugin for Kubernetes
shell-operator
Shell-operator is a tool for running event-driven scripts in a Kubernetes cluster
OpenFunction
Cloud Native Function-as-a-Service Platform (CNCF Sandbox Project)
awesome-log-analysis
A list of awesome research on log analysis, anomaly detection, fault localization, and AIOps
sriov-network-device-plugin
SRIOV network device plugin for Kubernetes
whereabouts
A CNI IPAM plugin that assigns IP addresses cluster-wide
lifecycle-toolkit
Toolkit for cloud-native application lifecycle management
explore-logs
Repo for the Loki log exploration app
awesome-AIOps
A curated list of awesome academic researches and industrial materials about Artificial Intelligence for IT Operations (AIOps).
knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
numalogic-prometheus
AIOps for metrics in Prometheus
aiops-modules
AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large Language Models (LLM) and GenAI development and operations on AWS