How I Instrumented vLLM on Kubernetes: The Dashboards, Queries, and SLOs
A practical observability setup for LLM inference on KServe — and the one-line misconfiguration it caught. LLM serving breaks the assumptions behind ordinary service dashboards. A single "request late
Jun 10, 20268 min read397
