Diferències
Ací es mostren les diferències entre la revisió seleccionada i la versió actual de la pàgina.
| Ambdós costats versió prèvia Revisió prèvia Següent revisió | Revisió prèvia | ||
| info:cursos:pue:devops:sesion7 [09/03/2019 00:24] – mate | info:cursos:pue:devops:sesion7 [04/06/2019 08:36] (actual) – [prometheus] mate | ||
|---|---|---|---|
| Línia 289: | Línia 289: | ||
| </ | </ | ||
| + | <code yaml; swarm_node.rules.yaml> | ||
| + | groups: | ||
| + | - name: / | ||
| + | rules: | ||
| + | - alert: node_cpu_usage | ||
| + | expr: 100 - (avg(irate(node_cpu_seconds_total{mode=" | ||
| + | node_meta * 100) BY (node_name)) > 50 | ||
| + | for: 1m | ||
| + | labels: | ||
| + | severity: warning | ||
| + | annotations: | ||
| + | description: | ||
| + | $value}}%. | ||
| + | summary: CPU alert for Swarm node '{{ $labels.node_name }}' | ||
| + | - alert: node_memory_usage | ||
| + | expr: sum(((node_memory_MemTotal_bytes - node_memory_MemAvailable_bytes) / node_memory_MemTotal_bytes) | ||
| + | * ON(instance) GROUP_LEFT(node_name) node_meta * 100) BY (node_name) > 80 | ||
| + | for: 1m | ||
| + | labels: | ||
| + | severity: warning | ||
| + | annotations: | ||
| + | description: | ||
| + | $value}}%. | ||
| + | summary: Memory alert for Swarm node '{{ $labels.node_name }}' | ||
| + | - alert: node_disk_usage | ||
| + | expr: ((node_filesystem_size_bytes{mountpoint="/ | ||
| + | * 100 / node_filesystem_size_bytes{mountpoint="/ | ||
| + | node_meta > 85 | ||
| + | for: 1m | ||
| + | labels: | ||
| + | severity: warning | ||
| + | annotations: | ||
| + | description: | ||
| + | $value}}%. | ||
| + | summary: Disk alert for Swarm node '{{ $labels.node_name }}' | ||
| + | - alert: node_disk_fill_rate_6h | ||
| + | expr: predict_linear(node_filesystem_free_bytes{mountpoint="/ | ||
| + | GROUP_LEFT(node_name) node_meta < 0 | ||
| + | for: 1h | ||
| + | labels: | ||
| + | severity: critical | ||
| + | annotations: | ||
| + | description: | ||
| + | 6h. | ||
| + | summary: Disk fill alert for Swarm node '{{ $labels.node_name }}' | ||
| + | |||
| + | </ | ||
| + | <code yaml; swarm_tasks.rules.yaml> | ||
| + | groups: | ||
| + | - name: / | ||
| + | rules: | ||
| + | - alert: task_high_cpu_usage_50 | ||
| + | expr: sum(rate(container_cpu_usage_seconds_total{container_label_com_docker_swarm_task_name=~" | ||
| + | BY (container_label_com_docker_swarm_task_name, | ||
| + | * 100 > 50 | ||
| + | for: 1m | ||
| + | annotations: | ||
| + | description: | ||
| + | $labels.container_label_com_docker_swarm_node_id }}'' | ||
| + | $value}}%.' | ||
| + | summary: CPU alert for Swarm task '{{ $labels.container_label_com_docker_swarm_task_name | ||
| + | }}' on '{{ $labels.container_label_com_docker_swarm_node_id }}' | ||
| + | - alert: task_high_memory_usage_1g | ||
| + | expr: sum(container_memory_rss{container_label_com_docker_swarm_task_name=~" | ||
| + | BY (container_label_com_docker_swarm_task_name, | ||
| + | for: 1m | ||
| + | annotations: | ||
| + | description: | ||
| + | $labels.container_label_com_docker_swarm_node_id }}'' | ||
| + | $value}}.' | ||
| + | summary: Memory alert for Swarm task '{{ $labels.container_label_com_docker_swarm_task_name | ||
| + | }}' on '{{ $labels.container_label_com_docker_swarm_node_id }}' | ||
| + | </ | ||
| == otros | == otros | ||
| * [[http:// | * [[http:// | ||