Skip to content

Compute Cluster Metrics

This page documents Prometheus metrics emitted by the Compute Cluster component, which manages distributed query execution across compute nodes in a Yellowbrick deployment.

Purpose

These metrics are used to monitor the health, performance, and resource usage of compute clusters. They provide visibility into:

  • Query activity and throughput
  • Memory and CPU utilization
  • Query compilation and execution times
  • Cluster degradation or compute node loss

They are critical for performance tuning, detecting resource contention, and ensuring cluster reliability at scale.

Metrics

NameTypeFreqLabelsDescription
yb_lime_active_queriesgauge10scluster, poolNumber of active queries
yb_lime_cluster_degraded_statusgauge10scluster, status, reasonCluster degraded status
yb_lime_cluster_missing_workersgauge10sclusterNumber of missing workers in each cluster
yb_lime_cluster_stategauge10scluster, state, reasonCluster state
yb_lime_queries_completed_backend_totalcounter10scluster, pool, stateTotal number of queries completed via backend
yb_lime_queries_completed_totalcounter10scluster, stateTotal number of queries completed
yb_lime_queries_submitted_totalcounter10sclusterTotal number of queries submitted
yb_lime_query_bytes_networkhistogram10sclusterBytes network
yb_lime_query_bytes_readhistogram10sclusterBytes read
yb_lime_query_bytes_read_spillhistogram10sclusterBytes read spill
yb_lime_query_bytes_writtenhistogram10sclusterBytes written
yb_lime_query_bytes_written_spillhistogram10sclusterBytes written spill
yb_lime_query_cache_efficiencyhistogram10sclusterQuery cache efficiency
yb_lime_query_compile_timehistogram10sclusterQuery compile time in seconds
yb_lime_query_cpu_usagegauge10scluster, poolDuration weighted average of longest worker CPU query usage as percentage of allocated CPU
yb_lime_query_lock_timehistogram10sclusterQuery lock time in seconds
yb_lime_query_memory_grantedgauge10scluster, poolTotal memory granted to queries in bytes
yb_lime_query_memory_usedgauge10scluster, poolTotal memory used by queries in bytes
yb_lime_query_run_timehistogram10sclusterQuery run time in seconds
yb_lime_query_total_timehistogram10sclusterQuery total time in seconds
yb_lime_query_wait_timehistogram10sclusterQuery wait time in seconds