Rules

containers

1.915s ago

2.054ms

Rule State Error Last Evaluation Evaluation Time
alert: graphnode_down expr: absent((time() - container_last_seen{name="graph-node"}) < 10) for: 30s labels: severity: critical annotations: description: Graph Node container is down for more than 30 seconds. summary: Graph Node down ok 1.915s ago 437.6us
alert: graphnode_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="graph-node"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Graph Node CPU usage is {{ humanize $value}}%. summary: Graph Node high CPU usage ok 1.915s ago 209.5us
alert: graphnode_high_memory expr: sum(container_memory_usage_bytes{name="graph-node"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Graph Node memory consumption is at {{ humanize $value}}. summary: Graph Node high memory usage ok 1.915s ago 99.19us
alert: postgres_down expr: absent((time() - container_last_seen{name="postgres"}) < 10) for: 30s labels: severity: critical annotations: description: Postgres container is down for more than 30 seconds. summary: Postgres down ok 1.914s ago 199.7us
alert: postgres_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="postgres"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Postgres CPU usage is {{ humanize $value}}%. summary: Postgres high CPU usage ok 1.914s ago 131.8us
alert: postgres_high_memory expr: sum(container_memory_usage_bytes{name="postgres"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Postgres memory consumption is at {{ humanize $value}}. summary: Postgres high memory usage ok 1.914s ago 91.5us
alert: nginx_down expr: absent((time() - container_last_seen{name="nginx-proxy"}) < 10) for: 30s labels: severity: critical annotations: description: Nginx container is down for more than 30 seconds. summary: Nginx down ok 1.914s ago 145.8us
alert: nginx_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="nginx-proxy"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: PostNginxgres CPU usage is {{ humanize $value}}%. summary: Nginx high CPU usage ok 1.914s ago 230.5us
alert: nginx_high_memory expr: sum(container_memory_usage_bytes{name="nginx-proxy"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Nginx memory consumption is at {{ humanize $value}}. summary: Nginx high memory usage ok 1.914s ago 137.5us
alert: caddy_down expr: absent((time() - container_last_seen{name="caddy"}) < 10) for: 30s labels: severity: critical annotations: description: Caddy container is down for more than 30 seconds. summary: Caddy down ok 1.914s ago 178.4us
alert: caddy_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="caddy"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Caddy CPU usage is {{ humanize $value}}%. summary: Caddy high CPU usage ok 1.914s ago 118.2us
alert: caddy_high_memory expr: sum(container_memory_usage_bytes{name="caddy"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Caddy memory consumption is at {{ humanize $value}}. summary: Caddy high memory usage ok 1.914s ago 51.62us

host

11.694s ago

614.9us

Rule State Error Last Evaluation Evaluation Time
alert: high_cpu_load expr: node_load1 > 1.5 for: 30s labels: severity: warning annotations: description: Docker host is under high load, the avg load 1m is at {{ $value}}. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server under high load ok 11.694s ago 230.5us
alert: high_memory_load expr: (sum(node_memory_MemTotal_bytes) - sum(node_memory_MemFree_bytes + node_memory_Buffers_bytes + node_memory_Cached_bytes)) / sum(node_memory_MemTotal_bytes) * 100 > 85 for: 30s labels: severity: warning annotations: description: Docker host memory usage is {{ humanize $value}}%. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server memory is almost full ok 11.694s ago 256.1us
alert: high_storage_load expr: (node_filesystem_size_bytes{fstype="aufs"} - node_filesystem_free_bytes{fstype="aufs"}) / node_filesystem_size_bytes{fstype="aufs"} * 100 > 85 for: 30s labels: severity: warning annotations: description: Docker host storage usage is {{ humanize $value}}%. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server storage is almost full ok 11.694s ago 111.7us

targets

5.157s ago

1.014ms

Rule State Error Last Evaluation Evaluation Time
alert: monitor_service_down expr: up == 0 for: 30s labels: severity: critical annotations: description: Service {{ $labels.instance }} is down. summary: Monitor service non-operational ok 5.157s ago 1.005ms