Rules

containers

2.042s ago

1.512ms

Rule State Error Last Evaluation Evaluation Time
alert: graphnode_down expr: absent((time() - container_last_seen{name="graph-node"}) < 10) for: 30s labels: severity: critical annotations: description: Graph Node container is down for more than 30 seconds. summary: Graph Node down ok 2.042s ago 394.5us
alert: graphnode_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="graph-node"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Graph Node CPU usage is {{ humanize $value}}%. summary: Graph Node high CPU usage ok 2.042s ago 179.4us
alert: graphnode_high_memory expr: sum(container_memory_usage_bytes{name="graph-node"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Graph Node memory consumption is at {{ humanize $value}}. summary: Graph Node high memory usage ok 2.042s ago 67.41us
alert: postgres_down expr: absent((time() - container_last_seen{name="postgres"}) < 10) for: 30s labels: severity: critical annotations: description: Postgres container is down for more than 30 seconds. summary: Postgres down ok 2.042s ago 198.6us
alert: postgres_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="postgres"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Postgres CPU usage is {{ humanize $value}}%. summary: Postgres high CPU usage ok 2.042s ago 114.4us
alert: postgres_high_memory expr: sum(container_memory_usage_bytes{name="postgres"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Postgres memory consumption is at {{ humanize $value}}. summary: Postgres high memory usage ok 2.042s ago 51.67us
alert: nginx_down expr: absent((time() - container_last_seen{name="nginx-proxy"}) < 10) for: 30s labels: severity: critical annotations: description: Nginx container is down for more than 30 seconds. summary: Nginx down ok 2.042s ago 80.04us
alert: nginx_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="nginx-proxy"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: PostNginxgres CPU usage is {{ humanize $value}}%. summary: Nginx high CPU usage ok 2.042s ago 116.8us
alert: nginx_high_memory expr: sum(container_memory_usage_bytes{name="nginx-proxy"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Nginx memory consumption is at {{ humanize $value}}. summary: Nginx high memory usage ok 2.042s ago 63.03us
alert: caddy_down expr: absent((time() - container_last_seen{name="caddy"}) < 10) for: 30s labels: severity: critical annotations: description: Caddy container is down for more than 30 seconds. summary: Caddy down ok 2.042s ago 128.9us
alert: caddy_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="caddy"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Caddy CPU usage is {{ humanize $value}}%. summary: Caddy high CPU usage ok 2.043s ago 76.16us
alert: caddy_high_memory expr: sum(container_memory_usage_bytes{name="caddy"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Caddy memory consumption is at {{ humanize $value}}. summary: Caddy high memory usage ok 2.043s ago 25.45us

host

11.823s ago

492.9us

Rule State Error Last Evaluation Evaluation Time
alert: high_cpu_load expr: node_load1 > 1.5 for: 30s labels: severity: warning annotations: description: Docker host is under high load, the avg load 1m is at {{ $value}}. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server under high load ok 11.823s ago 172.4us
alert: high_memory_load expr: (sum(node_memory_MemTotal_bytes) - sum(node_memory_MemFree_bytes + node_memory_Buffers_bytes + node_memory_Cached_bytes)) / sum(node_memory_MemTotal_bytes) * 100 > 85 for: 30s labels: severity: warning annotations: description: Docker host memory usage is {{ humanize $value}}%. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server memory is almost full ok 11.823s ago 192.9us
alert: high_storage_load expr: (node_filesystem_size_bytes{fstype="aufs"} - node_filesystem_free_bytes{fstype="aufs"}) / node_filesystem_size_bytes{fstype="aufs"} * 100 > 85 for: 30s labels: severity: warning annotations: description: Docker host storage usage is {{ humanize $value}}%. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server storage is almost full ok 11.823s ago 119.1us

targets

5.285s ago

795.5us

Rule State Error Last Evaluation Evaluation Time
alert: monitor_service_down expr: up == 0 for: 30s labels: severity: critical annotations: description: Service {{ $labels.instance }} is down. summary: Monitor service non-operational ok 5.285s ago 787.5us