typhoon/addons
Dalton Hubble 178afe4a9b Reduce apiserver metrics cardinality and extraneous labels
* Stop mapping node labels to targets discovered via Kubernetes
nodes (e.g. etcd, kubelet, cadvisor). It is rarely useful to
store node labels (e.g. kubernetes.io/os=linux) on these metrics
* kube-apiserver's apiserver_request_duration_seconds_bucket metric
has a high cardinality that includes labels for the API group, verb,
scope, resource, and component for each object type, including for
each CRD. This one metric has ~10k time series in a typical cluster
(btw 10-40% of total)
* Removing the apiserver request duration outright would make latency
alerts a NoOp and break a Grafana apiserver panel. Instead, drop series
that have a "group" label. Effectively, only request durations for
core Kubernetes APIs will be kept (e.g. cardinality won't grow with
each CRD added). This reduces the metric to ~2k unique series
2019-12-08 22:48:25 -08:00
..
cluo Rename CLUO label from "app" to "name" 2019-10-15 00:05:02 -07:00
grafana Update Grafana from v6.5.0 to v6.5.1 2019-11-28 14:11:25 -08:00
nginx-ingress Update nginx-ingress from v0.25.1 to v0.26.1 2019-09-30 22:01:07 -07:00
prometheus Reduce apiserver metrics cardinality and extraneous labels 2019-12-08 22:48:25 -08:00