Skip to content

Monitoring & Alerts

Short Summary: SSL-CLM exposes Prometheus-compatible metrics and supports webhook alerting.

The Backend exposes metrics at /actuator/prometheus.

Metric NameTypeDescription
sslclm_certs_totalGaugeTotal certificates in inventory.
sslclm_certs_expiring_30dGaugeCertificates expiring in < 30 days.
sslclm_jobs_failed_totalCounterNumber of failed jobs.
sslclm_agent_upGauge1 = Up, 0 = Down.

We provide a standard Dashboard JSON. Key panels to watch:

  1. Expiry Burn Down: Certificates expiring soon.
  2. Job Failure Rate: Spikes indicate systemic issues.
  3. Agent Health: Alert if any agent drops to 0.

Configure webhooks in Settings > Alerts to send notifications to Slack/Teams/PagerDuty.

  • Events:
    • CERT_EXPIRING_CRITICAL (7 days left)
    • JOB_FAILED
    • AGENT_OFFLINE