Collapse All Expand All
hostStausRecord (every 30s) # 10

/home/work/private/vmalert/ecsp-recording-rules/recording_rules.yml

Rule Samples Updated
record: node:node_cpu_usage_percentage:avg_rate2m | Details
avg(1 - rate(node_cpu_seconds_total{mode="idle"}[2m])) by(user, instance) * 100
3 7.937s ago
record: node:node_memory_usage_percentage:sum | Details
(1-(node_memory_MemFree_bytes+node_memory_Cached_bytes+node_memory_Buffers_bytes+node_memory_Slab_bytes)/node_memory_MemTotal_bytes)*100
3 7.936s ago
record: node_systemdisk_usage_percentage | Details
(node_filesystem_size_bytes{mountpoint="/"} - node_filesystem_free_bytes{mountpoint="/"}) / node_filesystem_size_bytes{mountpoint="/"} * 100
3 7.933s ago
record: node_datadisk_usage_percentage | Details
(node_filesystem_size_bytes{mountpoint="/data"} - node_filesystem_free_bytes{mountpoint="/data"}) / node_filesystem_size_bytes{mountpoint="/data"} * 100
0 7.931s ago
record: node_network_receive_bytes:rate2m | Details
rate(node_network_receive_bytes_total[2m]) * 8
21 7.930s ago
record: node_network_transmit_bytes:rate2m | Details
rate(node_network_transmit_bytes_total[2m]) * 8
21 7.929s ago
record: dcgm_fb_usage_percentage | Details
dcgm_fb_used / (dcgm_fb_used + dcgm_fb_free) * 100
0 7.926s ago
record: dcgm_gpu_utilization_percentage | Details
dcgm_gpu_utilization * 100
0 7.925s ago
record: ecsp_netstatus:count | Details
count(ecsp_netstatus) by (user)
0 7.924s ago
record: ecsp_netstatus:sum | Details
sum(ecsp_netstatus) by (user)
0 7.924s ago
ote-ecs-default (every 30s) # 3

/home/work/private/vmalert/prometheus-alerting-rules/default.yml

Rule Samples Updated
alert: NodeCPUUsage (for: 30 seconds) | Details
ote:ecs:node:NodeCPUUsage:avg_rate > 90
Labels: cluster=ecs service=ote group_id=default inhibit=source severity=warning
0 25.482s ago
alert: NodeMemoryUsage (for: 30 seconds) | Details
ote:ecs:node:NodeMemoryUsage:custom > 90
Labels: inhibit=source severity=warning cluster=ecs service=ote group_id=default
0 25.480s ago
alert: NodeFilesystemUsage (for: 30 seconds) | Details
ote:ecs:node:NodeFilesystemUsage:custom > 90
Labels: cluster=ecs service=ote group_id=default inhibit=source severity=warning
0 25.479s ago
recording_rules (every 15s) # 3

/home/work/private/vmalert/prometheus-recording-rules/calculation.rules.yml

Rule Samples Updated
record: ote:ecs:node:NodeCPUUsage:avg_rate | Details
(1-avg(rate(node_cpu_seconds_total{mode="idle"}[2m])) by(instance))*100
3 7.946s ago
record: ote:ecs:node:NodeMemoryUsage:custom | Details
(1 - (node_memory_MemAvailable_bytes{} / (node_memory_MemTotal_bytes{})))*100
3 7.945s ago
record: ote:ecs:node:NodeFilesystemUsage:custom | Details
(1-node_filesystem_free_bytes{}/node_filesystem_size_bytes{})*100
22 7.943s ago