Monitoring & Alerts
Monitoring & Alerts
Section titled “Monitoring & Alerts”Short Summary: SSL-CLM exposes Prometheus-compatible metrics and supports webhook alerting.
Prometheus Metrics
Section titled “Prometheus Metrics”The Backend exposes metrics at /actuator/prometheus.
| Metric Name | Type | Description |
|---|---|---|
sslclm_certs_total | Gauge | Total certificates in inventory. |
sslclm_certs_expiring_30d | Gauge | Certificates expiring in < 30 days. |
sslclm_jobs_failed_total | Counter | Number of failed jobs. |
sslclm_agent_up | Gauge | 1 = Up, 0 = Down. |
Grafana Dashboard
Section titled “Grafana Dashboard”We provide a standard Dashboard JSON. Key panels to watch:
- Expiry Burn Down: Certificates expiring soon.
- Job Failure Rate: Spikes indicate systemic issues.
- Agent Health: Alert if any agent drops to 0.
Webhook Alerts
Section titled “Webhook Alerts”Configure webhooks in Settings > Alerts to send notifications to Slack/Teams/PagerDuty.
- Events:
CERT_EXPIRING_CRITICAL(7 days left)JOB_FAILEDAGENT_OFFLINE