Observability Dashboard

The Headwind Observability Dashboard provides real-time metrics visualization and monitoring capabilities. It supports multiple metrics backends and automatically discovers available infrastructure.

Overview

The Observability Dashboard (/observability) displays:

Real-time update statistics (pending, approved, applied, failed)
Resource monitoring counts (Deployments, StatefulSets, DaemonSets, HelmReleases)
Time-series data from your metrics backend
Backend health and connectivity status

Accessing the Dashboard

Navigate to the observability page:

# Port forward the Web UI
kubectl port-forward -n headwind-system svc/headwind-ui 8082:8082

# Open in browser
open http://localhost:8082/observability

Or click "Observability" in the Web UI navigation menu.

Metrics Backends

Headwind supports four metrics backend types:

1. Prometheus (Recommended)

Features:

Full time-series support
PromQL query capabilities
24-hour historical data
Industry-standard metrics storage

Auto-Discovery: Headwind automatically detects Prometheus at:

http://prometheus-server.monitoring.svc.cluster.local:80
http://prometheus.monitoring.svc.cluster.local:9090

Manual Configuration:

observability:
  metricsBackend: "prometheus"
  prometheus:
    enabled: true
    url: "http://your-prometheus.namespace.svc.cluster.local:9090"

2. VictoriaMetrics

Features:

Prometheus-compatible API
High performance
Long-term storage
Resource efficient

Auto-Discovery: Detects VictoriaMetrics at:

http://victoria-metrics.monitoring.svc.cluster.local:8428

Manual Configuration:

observability:
  metricsBackend: "victoriametrics"
  victoriametrics:
    enabled: true
    url: "http://victoria-metrics.namespace.svc.cluster.local:8428"

3. InfluxDB v2

Features:

Purpose-built time-series database
Advanced querying with Flux
Built-in visualization

Configuration (no auto-discovery):

observability:
  metricsBackend: "influxdb"
  influxdb:
    enabled: true
    url: "http://influxdb.monitoring.svc.cluster.local:8086"
    database: "headwind"

InfluxDB v2 Configuration:

observability:
  metricsBackend: "influxdb"
  influxdb:
    enabled: true
    url: "http://influxdb.monitoring.svc.cluster.local:8086"
    org: "headwind"           # InfluxDB organization
    bucket: "metrics"         # InfluxDB bucket name
    token: "your-api-token"   # InfluxDB API token

4. Live Metrics (Fallback)

Features:

No external backend required
Parses /metrics endpoint directly
Instant values only (no time-series)
Zero configuration

When Used:

Automatically activated when no backend is available
Useful for testing and development
No historical data

Auto-Discovery

By default, Headwind uses auto-discovery to find available metrics backends:

Discovery Priority:

Prometheus (if available)
VictoriaMetrics (if available)
InfluxDB (if configured and enabled)
Live metrics (always available)

Configuration:

observability:
  metricsBackend: "auto"  # Default

The dashboard displays which backend is currently active in an alert banner.

Configuration

Via ConfigMap

Create or update the Headwind ConfigMap:

apiVersion: v1
kind: ConfigMap
metadata:
  name: headwind-config
  namespace: headwind-system
data:
  config.yaml: |
    observability:
      metricsBackend: "auto"
      prometheus:
        enabled: true
        url: "http://prometheus-server.monitoring.svc.cluster.local:80"
      victoriametrics:
        enabled: false
        url: "http://victoria-metrics.monitoring.svc.cluster.local:8428"
      influxdb:
        enabled: false
        url: "http://influxdb.monitoring.svc.cluster.local:8086"
        database: "headwind"

Via Web UI Settings

Navigate to Settings in the Web UI and configure under Observability / Metrics Storage:

Select metrics backend (auto, prometheus, victoriametrics, influxdb, live)
Enable/disable specific backends
Configure URLs
Click Save Settings
Configuration hot-reloads automatically (no restart required)

Dashboard Features

Metrics Cards

Updates Section:

Pending: Updates awaiting approval
Approved: Total approved updates
Applied: Successfully applied updates
Failed: Updates that failed to apply

Resources Section:

Deployments Watched: Active Deployment monitoring
StatefulSets Watched: Active StatefulSet monitoring
DaemonSets Watched: Active DaemonSet monitoring
Helm Releases Watched: Active HelmRelease monitoring

Time-Series Charts

When using Prometheus, VictoriaMetrics, or InfluxDB backends, the dashboard displays interactive time-series charts powered by Chart.js.

Charts Included:

Updates Over Time - 24-hour historical view:
- Approved updates (teal line)
- Applied updates (green line)
- Failed updates (red line)
Resources Watched - 24-hour resource monitoring:
- Deployments watched (blue line)
- StatefulSets watched (indigo line)
- DaemonSets watched (purple line)
- Helm Releases watched (pink line)

Features:

Interactive hover tooltips showing exact values
Smooth line charts with 5-minute data points
Responsive design adapts to screen size
Legend at bottom for better visibility
Auto-refresh updates charts every 30 seconds

Note: Charts are only available with Prometheus, VictoriaMetrics, or InfluxDB backends. The "Live" fallback mode doesn't support historical data, so charts won't display.

Auto-Refresh

The observability dashboard auto-refreshes every 30 seconds to display the latest metrics and update charts.

Visual Indicators:

Loading spinner while fetching data
Error messages if backend unavailable
Backend name displayed in alert banner
Charts automatically reload with new data

Backend Status

A colored alert banner shows the current backend status:

🟢 Green (Prometheus/VictoriaMetrics): External backend active
🟡 Yellow (InfluxDB): InfluxDB backend (experimental)
🔵 Blue (Live): Fallback mode, no external backend

API Endpoints

Get Current Metrics

Fetch current metric values:

curl http://headwind-ui:8082/api/v1/metrics

Response:

{
  "backend": "Prometheus",
  "metrics": {
    "updates_pending": 5,
    "updates_approved": 120,
    "updates_rejected": 3,
    "updates_applied": 115,
    "updates_failed": 2,
    "deployments_watched": 10,
    "statefulsets_watched": 3,
    "daemonsets_watched": 5,
    "helm_releases_watched": 8
  }
}

Get Time-Series Data

Fetch 24-hour historical data for a specific metric:

curl http://headwind-ui:8082/api/v1/metrics/timeseries/headwind_updates_pending

Response:

[
  {"timestamp": "2025-11-08T00:00:00Z", "value": 3},
  {"timestamp": "2025-11-08T00:05:00Z", "value": 5},
  {"timestamp": "2025-11-08T00:10:00Z", "value": 4},
  ...
]

Parameters:

Metric name: Any valid Headwind metric (see Metrics Reference)
Time range: Fixed 24 hours
Step: 5 minutes

Prometheus Integration

Installing Prometheus

Deploy Prometheus to your cluster:

apiVersion: v1
kind: Namespace
metadata:
  name: monitoring

---
apiVersion: v1
kind: ConfigMap
metadata:
  name: prometheus-config
  namespace: monitoring
data:
  prometheus.yml: |
    global:
      scrape_interval: 15s

    scrape_configs:
      - job_name: 'headwind'
        static_configs:
          - targets: ['headwind-metrics.headwind-system.svc.cluster.local:9090']
        scrape_interval: 15s

---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: prometheus
  namespace: monitoring
spec:
  selector:
    matchLabels:
      app: prometheus
  template:
    metadata:
      labels:
        app: prometheus
    spec:
      containers:
        - name: prometheus
          image: prom/prometheus:v2.48.0
          args:
            - '--config.file=/etc/prometheus/prometheus.yml'
            - '--storage.tsdb.path=/prometheus'
          ports:
            - containerPort: 9090
          volumeMounts:
            - name: config
              mountPath: /etc/prometheus
            - name: storage
              mountPath: /prometheus
      volumes:
        - name: config
          configMap:
            name: prometheus-config
        - name: storage
          emptyDir: {}

---
apiVersion: v1
kind: Service
metadata:
  name: prometheus-server
  namespace: monitoring
spec:
  selector:
    app: prometheus
  ports:
    - port: 80
      targetPort: 9090

Apply the manifest:

kubectl apply -f prometheus.yaml

Verification

Check that Prometheus is scraping Headwind:

# Port forward Prometheus
kubectl port-forward -n monitoring svc/prometheus-server 9090:80

# Open Prometheus UI
open http://localhost:9090

# Query Headwind metrics
# Navigate to Graph tab and query: headwind_updates_pending

The Observability Dashboard will automatically detect Prometheus and switch to it.

VictoriaMetrics Integration

VictoriaMetrics is a Prometheus-compatible alternative with better performance:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: victoria-metrics
  namespace: monitoring
spec:
  selector:
    matchLabels:
      app: victoria-metrics
  template:
    metadata:
      labels:
        app: victoria-metrics
    spec:
      containers:
        - name: victoria-metrics
          image: victoriametrics/victoria-metrics:latest
          args:
            - '-promscrape.config=/etc/prometheus/prometheus.yml'
            - '-storageDataPath=/victoria-metrics-data'
          ports:
            - containerPort: 8428
          volumeMounts:
            - name: config
              mountPath: /etc/prometheus
            - name: storage
              mountPath: /victoria-metrics-data
      volumes:
        - name: config
          configMap:
            name: prometheus-config  # Reuse same config
        - name: storage
          emptyDir: {}

Troubleshooting

"Backend: Live" shown instead of Prometheus

Problem: Auto-discovery not finding Prometheus

Solutions:

Check Prometheus is running:

kubectl get pods -n monitoring -l app=prometheus

Check service name and port:

kubectl get svc -n monitoring prometheus-server

Check connectivity from Headwind pod:

kubectl exec -n headwind-system deployment/headwind -- curl -v http://prometheus-server.monitoring.svc.cluster.local:80/api/v1/query?query=up

Manually configure URL in ConfigMap if auto-discovery fails

"Failed to fetch metrics"

Problem: API requests failing

Possible Causes:

Backend URL incorrect
Backend not responding
Network policy blocking access
Authentication required (not yet supported)

Solutions:

Check backend logs
Verify URL in ConfigMap
Test connectivity from Headwind pod
Check NetworkPolicies

Time-series data showing "No data"

Problem: 24-hour query returns empty results

Possible Causes:

Prometheus retention too short
Metric name incorrect
No data collected yet (new installation)

Solutions:

Wait for Prometheus to collect data (15s intervals)
Verify metric name exists: curl http://headwind-metrics:9090/metrics | grep headwind
Check Prometheus retention settings

Metrics showing 0 or incorrect values

Problem: Values don't match expected state

Possible Causes:

Prometheus scrape failing
Stale data cached
Clock skew

Solutions:

# Check Prometheus is scraping successfully
kubectl logs -n monitoring deployment/prometheus | grep headwind

# Check /metrics endpoint directly
kubectl port-forward -n headwind-system svc/headwind-metrics 9090:9090
curl localhost:9090/metrics | grep headwind_updates

Available Metrics

See the full list of metrics in the Metrics API Reference.

Key Metrics for Observability Dashboard:

headwind_updates_pending - Current pending updates
headwind_updates_approved_total - Total approved updates
headwind_updates_applied_total - Total applied updates
headwind_updates_failed_total - Total failed updates
headwind_deployments_watched - Deployments being monitored
headwind_statefulsets_watched - StatefulSets being monitored
headwind_daemonsets_watched - DaemonSets being monitored
headwind_helm_releases_watched - HelmReleases being monitored

Next Steps

Metrics API Reference - Complete metrics documentation
Web UI Overview - Main dashboard features
Configuration - Customize settings

Overview​

Accessing the Dashboard​

Metrics Backends​

1. Prometheus (Recommended)​

2. VictoriaMetrics​

3. InfluxDB v2​

4. Live Metrics (Fallback)​

Auto-Discovery​

Configuration​

Via ConfigMap​

Via Web UI Settings​

Dashboard Features​

Metrics Cards​

Time-Series Charts​

Auto-Refresh​

Backend Status​

API Endpoints​

Get Current Metrics​

Get Time-Series Data​

Prometheus Integration​

Installing Prometheus​

Verification​

VictoriaMetrics Integration​

Troubleshooting​

"Backend: Live" shown instead of Prometheus​

"Failed to fetch metrics"​

Time-series data showing "No data"​

Metrics showing 0 or incorrect values​

Available Metrics​

Next Steps​

Overview

Accessing the Dashboard

Metrics Backends

1. Prometheus (Recommended)

2. VictoriaMetrics

3. InfluxDB v2

4. Live Metrics (Fallback)

Auto-Discovery

Configuration

Via ConfigMap

Via Web UI Settings

Dashboard Features

Metrics Cards

Time-Series Charts

Auto-Refresh

Backend Status

API Endpoints

Get Current Metrics

Get Time-Series Data

Prometheus Integration

Installing Prometheus

Verification

VictoriaMetrics Integration

Troubleshooting

"Backend: Live" shown instead of Prometheus

"Failed to fetch metrics"

Time-series data showing "No data"

Metrics showing 0 or incorrect values

Available Metrics

Next Steps