Anthropic Claudeその他⭐ リポ 0品質スコア 50/100

grafana-dashboards

Name: grafana-dashboards
Author: wshobson

本番環境向けのGrafanaダッシュボードを作成・管理し、システムやアプリケーションのメトリクスをリアルタイムで可視化します。監視ダッシュボードの構築、メトリクスの可視化、運用オブザーバビリティ画面の作成時に使用してください。

description の原文を見る

Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.

SKILL.md 本文

Grafana ダッシュボード

包括的なシステム可観測性のための本番環境対応 Grafana ダッシュボードを作成・管理します。

目的

アプリケーション、インフラストラクチャ、ビジネスメトリクスを監視するための効果的な Grafana ダッシュボードを設計します。

使用時期

Prometheus メトリクスを可視化する
カスタムダッシュボードを作成する
SLO ダッシュボードを実装する
インフラストラクチャを監視する
ビジネス KPI を追跡する

ダッシュボード設計の原則

1. 情報階層

┌─────────────────────────────────────┐
│  Critical Metrics (Big Numbers)     │
├─────────────────────────────────────┤
│  Key Trends (Time Series)           │
├─────────────────────────────────────┤
│  Detailed Metrics (Tables/Heatmaps) │
└─────────────────────────────────────┘

2. RED メソッド (サービス)

Rate - 1秒あたりのリクエスト数
Errors - エラーレート
Duration - レイテンシー/応答時間

3. USE メソッド (リソース)

Utilization - リソースが稼働している時間の割合
Saturation - キュー長/待機時間
Errors - エラーカウント

ダッシュボード構造

API 監視ダッシュボード

{
  "dashboard": {
    "title": "API Monitoring",
    "tags": ["api", "production"],
    "timezone": "browser",
    "refresh": "30s",
    "panels": [
      {
        "title": "Request Rate",
        "type": "graph",
        "targets": [
          {
            "expr": "sum(rate(http_requests_total[5m])) by (service)",
            "legendFormat": "{{service}}"
          }
        ],
        "gridPos": { "x": 0, "y": 0, "w": 12, "h": 8 }
      },
      {
        "title": "Error Rate %",
        "type": "graph",
        "targets": [
          {
            "expr": "(sum(rate(http_requests_total{status=~\"5..\"}[5m])) / sum(rate(http_requests_total[5m]))) * 100",
            "legendFormat": "Error Rate"
          }
        ],
        "alert": {
          "conditions": [
            {
              "evaluator": { "params": [5], "type": "gt" },
              "operator": { "type": "and" },
              "query": { "params": ["A", "5m", "now"] },
              "type": "query"
            }
          ]
        },
        "gridPos": { "x": 12, "y": 0, "w": 12, "h": 8 }
      },
      {
        "title": "P95 Latency",
        "type": "graph",
        "targets": [
          {
            "expr": "histogram_quantile(0.95, sum(rate(http_request_duration_seconds_bucket[5m])) by (le, service))",
            "legendFormat": "{{service}}"
          }
        ],
        "gridPos": { "x": 0, "y": 8, "w": 24, "h": 8 }
      }
    ]
  }
}

参考: assets/api-dashboard.json を参照

パネルの種類

1. Stat パネル (単一値)

{
  "type": "stat",
  "title": "Total Requests",
  "targets": [
    {
      "expr": "sum(http_requests_total)"
    }
  ],
  "options": {
    "reduceOptions": {
      "values": false,
      "calcs": ["lastNotNull"]
    },
    "orientation": "auto",
    "textMode": "auto",
    "colorMode": "value"
  },
  "fieldConfig": {
    "defaults": {
      "thresholds": {
        "mode": "absolute",
        "steps": [
          { "value": 0, "color": "green" },
          { "value": 80, "color": "yellow" },
          { "value": 90, "color": "red" }
        ]
      }
    }
  }
}

2. 時系列グラフ

{
  "type": "graph",
  "title": "CPU Usage",
  "targets": [
    {
      "expr": "100 - (avg by (instance) (rate(node_cpu_seconds_total{mode=\"idle\"}[5m])) * 100)"
    }
  ],
  "yaxes": [
    { "format": "percent", "max": 100, "min": 0 },
    { "format": "short" }
  ]
}

3. テーブルパネル

{
  "type": "table",
  "title": "Service Status",
  "targets": [
    {
      "expr": "up",
      "format": "table",
      "instant": true
    }
  ],
  "transformations": [
    {
      "id": "organize",
      "options": {
        "excludeByName": { "Time": true },
        "indexByName": {},
        "renameByName": {
          "instance": "Instance",
          "job": "Service",
          "Value": "Status"
        }
      }
    }
  ]
}

4. ヒートマップ

{
  "type": "heatmap",
  "title": "Latency Heatmap",
  "targets": [
    {
      "expr": "sum(rate(http_request_duration_seconds_bucket[5m])) by (le)",
      "format": "heatmap"
    }
  ],
  "dataFormat": "tsbuckets",
  "yAxis": {
    "format": "s"
  }
}

変数

クエリ変数

{
  "templating": {
    "list": [
      {
        "name": "namespace",
        "type": "query",
        "datasource": "Prometheus",
        "query": "label_values(kube_pod_info, namespace)",
        "refresh": 1,
        "multi": false
      },
      {
        "name": "service",
        "type": "query",
        "datasource": "Prometheus",
        "query": "label_values(kube_service_info{namespace=\"$namespace\"}, service)",
        "refresh": 1,
        "multi": true
      }
    ]
  }
}

クエリで変数を使用

sum(rate(http_requests_total{namespace="$namespace", service=~"$service"}[5m]))

ダッシュボードのアラート

{
  "alert": {
    "name": "High Error Rate",
    "conditions": [
      {
        "evaluator": {
          "params": [5],
          "type": "gt"
        },
        "operator": { "type": "and" },
        "query": {
          "params": ["A", "5m", "now"]
        },
        "reducer": { "type": "avg" },
        "type": "query"
      }
    ],
    "executionErrorState": "alerting",
    "for": "5m",
    "frequency": "1m",
    "message": "Error rate is above 5%",
    "noDataState": "no_data",
    "notifications": [{ "uid": "slack-channel" }]
  }
}

ダッシュボードプロビジョニング

dashboards.yml:

apiVersion: 1

providers:
  - name: "default"
    orgId: 1
    folder: "General"
    type: file
    disableDeletion: false
    updateIntervalSeconds: 10
    allowUiUpdates: true
    options:
      path: /etc/grafana/dashboards

一般的なダッシュボードパターン

インフラストラクチャダッシュボード

キーパネル:

ノードあたりの CPU 使用率
ノードあたりのメモリ使用量
ディスク I/O
ネットワークトラフィック
名前空間ごとのポッド数
ノードステータス

参考: assets/infrastructure-dashboard.json を参照

データベースダッシュボード

キーパネル:

1秒あたりのクエリ数
コネクションプール使用率
クエリレイテンシー (P50, P95, P99)
アクティブな接続数
データベースサイズ
レプリケーションラグ
スロークエリ

参考: assets/database-dashboard.json を参照

アプリケーションダッシュボード

キーパネル:

リクエストレート
エラーレート
応答時間 (パーセンタイル)
アクティブユーザー/セッション
キャッシュヒット率
キュー長

ベストプラクティス

テンプレートから始める (Grafana コミュニティダッシュボード)
一貫した命名規則を使用 (パネル・変数)
関連するメトリクスをグループ化 (行単位)
適切な時間範囲を設定 (デフォルト: 直近 6 時間)
変数を使用 (柔軟性のため)
パネルの説明を追加 (コンテキスト用)
ユニットを正しく設定
色のしきい値を意味のあるものに設定
ダッシュボード全体で一貫した色を使用
異なる時間範囲でテスト

コードとしてのダッシュボード

Terraform プロビジョニング

resource "grafana_dashboard" "api_monitoring" {
  config_json = file("${path.module}/dashboards/api-monitoring.json")
  folder      = grafana_folder.monitoring.id
}

resource "grafana_folder" "monitoring" {
  title = "Production Monitoring"
}

Ansible プロビジョニング

- name: Deploy Grafana dashboards
  copy:
    src: "{{ item }}"
    dest: /etc/grafana/dashboards/
  with_fileglob:
    - "dashboards/*.json"
  notify: restart grafana

詳細情報

作者: wshobson
リポジトリ: wshobson/agents
ライセンス: MIT
最終更新: 不明

GitHubで原本を見る →フィードバックを送る

Source: https://github.com/wshobson/agents / ライセンス: MIT

grafana-dashboards

SKILL.md 本文

Grafana ダッシュボード

目的

使用時期

ダッシュボード設計の原則

1. 情報階層

2. RED メソッド (サービス)

3. USE メソッド (リソース)

ダッシュボード構造

API 監視ダッシュボード

パネルの種類

1. Stat パネル (単一値)

2. 時系列グラフ

3. テーブルパネル

4. ヒートマップ

変数

クエリ変数

クエリで変数を使用

ダッシュボードのアラート

ダッシュボードプロビジョニング

一般的なダッシュボードパターン

インフラストラクチャダッシュボード

データベースダッシュボード

アプリケーションダッシュボード

ベストプラクティス

コードとしてのダッシュボード

Terraform プロビジョニング

Ansible プロビジョニング

関連スキル

詳細情報

関連スキル

superfluid

civ-finish-quotes

nookplot

web3-polymarket

ethskills

xxyy-trade