Skip to content

Replication and Backup Metrics

This page documents Prometheus metrics related to data replication, backup, restore, load, and unload workflows in Yellowbrick. These operations are essential for durability, disaster recovery, and data movement across environments.

Purpose

These metrics allow you to track:

  • Replication behavior: Duration, cycle counts, errors, retries, throughput per replica
  • Operations: Backup, restore, load, and unload counts and durations
  • Operational health: Success/failure breakdowns and replica state transitions

They are critical for monitoring long-running data operations, detecting replication lag or errors, validating backup freshness, and ensuring recovery readiness.

Metrics

NameTypeFreqLabelsVersion IntroducedVersion DeprecatedDescription
yb_lime_active_backups_durationhistogram5m-7.4.0-Duration of active backups in seconds
yb_lime_active_loads_durationhistogram5m-7.4.0-Duration of active loads in seconds
yb_lime_active_restores_durationhistogram5m-7.4.0-Duration of active restores in seconds
yb_lime_active_unloads_durationhistogram5m-7.4.0-Duration of active unloads in seconds
yb_lime_backup_chain_agehistogram5m-7.3.0-Age in days of each backup
yb_lime_backups_durationhistogram5m-7.3.07.4.0Duration in seconds of active backups
yb_lime_backups_totalcounter5mstatus7.3.0-Number of backups completed in error/success states
yb_lime_completed_backups_durationhistogram5m-7.4.0-Duration of completed backups in seconds
yb_lime_completed_loads_durationhistogram5m-7.4.0-Duration of completed loads in seconds
yb_lime_completed_restores_durationhistogram5m-7.4.0-Duration of completed restores in seconds
yb_lime_completed_unloads_durationhistogram5m-7.4.0-Duration of completed unloads in seconds
yb_lime_loads_durationhistogram5m-7.3.07.4.0Duration in seconds of active loads
yb_lime_loads_totalcounter5mstatus7.3.0-Number of bulk loads completed in error/success states
yb_lime_replica_cycles_totalcounter5mreplica_id7.3.0-Number of replication cycles completed (both success and errors included) for the corresponding replica
yb_lime_replica_elapsed_seconds_totalcounter5mreplica_id7.3.0-Number of seconds spent actively replicating for the corresponding replica (cumulative over all replication cycles)
yb_lime_replica_errored_cycles_totalcounter5mreplica_id7.3.0-Number of replication cycles that ended with error for the corresponding replica
yb_lime_replica_retries_totalcounter5mreplica_id7.3.0-Number of retries for the corresponding replica (cumulative over all replication cycles)
yb_lime_replica_sent_bytes_totalcounter5mreplica_id7.3.0-Number of bytes sent for the corresponding replica (cumulative over all replication cycles)
yb_lime_replica_statesgauge5mstate7.3.0-Number of replicas in each state
yb_lime_replica_written_bytes_totalcounter5mreplica_id7.3.0-Number of bytes written for the corresponding replica (cumulative over all replication cycles)
yb_lime_restores_durationhistogram5m-7.3.07.4.0Duration in seconds of active restores
yb_lime_restores_totalcounter5mstatus7.3.0-Number of restores completed in error/success states
yb_lime_unloads_durationhistogram5m-7.3.07.4.0Duration in seconds of active unloads
yb_lime_unloads_totalcounter5mstatus7.3.0-Number of bulk unloads completed in error/success states