steep-parrot-14561
02/15/2023, 1:12 AMcontainer_cpu_usage_seconds_total
and kube_pod_container_resource_limits_memory_bytes
. I am trying to monitor these metrics for all pods that are generated by Flyte executions. It is kind of working but sometimes an execution is simply not scraped by Prometheus: the metric is empty. I have a workflow that runs daily and today's execution's metrics are there, but yesterday's are not. As it's the same workflow and it has not changed since yesterday I really don't know what could be happening. Can anyone help me?
This is the Prometheus config file regarding kubelet:
kubelet:
enabled: true
namespace: kube-system
serviceMonitor:
interval: ""
proxyUrl: ""
https: true
cAdvisor: true
probes: true
resource: false
resourcePath: "/metrics/resource/v1alpha1"
cAdvisorMetricRelabelings: []
probesMetricRelabelings: []
cAdvisorRelabelings:
- sourceLabels: [__metrics_path__]
targetLabel: metrics_path
probesRelabelings:
- sourceLabels: [__metrics_path__]
targetLabel: metrics_path
resourceRelabelings:
- sourceLabels: [__metrics_path__]
targetLabel: metrics_path
metricRelabelings: []
relabelings:
- sourceLabels: [__metrics_path__]
targetLabel: metrics_path
tall-lock-23197
quaint-diamond-37493
02/15/2023, 8:05 AMsteep-parrot-14561
02/15/2023, 3:37 PMquaint-diamond-37493
02/15/2023, 6:32 PMhallowed-mouse-14616
02/16/2023, 2:39 PMsteep-parrot-14561
02/17/2023, 8:21 PM