Hi
@freezing-boots-56761, I was able to expose the metrics port and see metrics in datadog (the datadog agent is set to scrape metrics from pods that are annotated with prometheus annotations). I am trying to set up the core metrics dashboard shown
here. However, I am not able to find the metrics referenced in the grafana json on datadog. The metrics referenced in the json are of the format -
flyte:propeller:all:free_workers_count
flyte:propeller:all:round:abort_error[5m]
flyte:propeller:all:round:system_error_unlabeled[5m]
flyte:propeller:all:node:plugin:.*_failure_unlabeled
flyte:propeller:all:node:plugin:.*_success_unlabeled
flyte:propeller:all:round:raw_unlabeled_ms[5m]
flyte:propeller:all:round:raw_ms[5m]
flyte:propeller:all:round:panic_unlabeled[5m]
flyte:propeller:all:collector:flyteworkflow
flyte:propeller:all:metastore:cache_hit
flyte:propeller:all:metastore:cache_miss
flyte:propeller:all:metastore:head_failure_unlabeled
But on datadog, when I search metrics for "Propeller", I can only see these -
flyte_admin_admin_builder_flytepropeller_build_failures.count
flyte_admin_admin_builder_flytepropeller_build_successes.count
flyte_admin_admin_execution_manager_propeller_failures.count
1. Why aren't we seeing all the metrics for propeller? (for metastore, plugin etc) and
2. Why are the metrics names different, is there some aggregation happening somewhere?