Common PagerDuty Issues and Solutions

1. Delayed or Missing Notifications

Alerts do not reach responders in a timely manner, leading to delayed incident resolution.

Root Causes:

  • Incorrect notification settings for on-call users.
  • PagerDuty notification delivery system experiencing delays.
  • Spam filtering or silent mode on mobile devices preventing alert reception.

Solution:

Verify that on-call users have correct contact information:

Settings → Users → Contact Methods

Check notification rules:

Settings → Users → Notification Rules

Ensure the PagerDuty app has the necessary permissions on mobile:

Settings → Apps & Notifications → PagerDuty

For email alerts, check spam/junk folders and whitelist @pagerduty.com emails.

2. Escalations Not Triggering

Incidents do not escalate to the next responder when the primary contact does not acknowledge them.

Root Causes:

  • Misconfigured escalation policy.
  • Escalation timeout settings too long or missing.
  • Incident acknowledgement occurring before escalation triggers.

Solution:

Verify escalation policies:

Settings → Escalation Policies

Adjust escalation timeout settings:

Settings → Services → Escalation Timeout

Ensure that responders do not manually acknowledge incidents before escalation:

Incident Details → Acknowledged By

3. API Integration Failures

Automated alerts or custom applications fail to send incidents via the PagerDuty API.

Root Causes:

  • Invalid API key or incorrect authentication method.
  • Incorrect API payload format.
  • Rate limiting blocking API requests.

Solution:

Verify API key validity:

Settings → API Access → API Keys

Ensure correct API payload format:

{
  "incident": {
    "type": "incident",
    "title": "Server Down",
    "service": { "id": "SERVICE_ID", "type": "service_reference" },
    "urgency": "high"
  }
}

Monitor API rate limits:

curl -X GET "https://api.pagerduty.com/rate-limits" -H "Authorization: Token token=YOUR_API_KEY"

4. Webhooks Not Delivering Alerts

Webhook-based notifications do not reach external systems such as Slack or monitoring dashboards.

Root Causes:

  • Incorrect webhook URL or payload format.
  • Webhook delivery failures due to firewall restrictions.
  • Target system unavailable or rejecting requests.

Solution:

Check webhook configuration:

Settings → Extensions → Webhooks

Test webhook manually:

curl -X POST -H "Content-Type: application/json" -d '{ "message": "Test alert" }' WEBHOOK_URL

Review webhook logs in PagerDuty:

Settings → Extensions → Webhook Logs

5. Duplicate Alerts for the Same Incident

PagerDuty creates multiple incidents for the same issue, overwhelming responders.

Root Causes:

  • Multiple monitoring tools triggering redundant alerts.
  • Incident de-duplication settings misconfigured.
  • Alert correlation rules missing or incorrect.

Solution:

Enable de-duplication by using the same dedup_key in API payloads:

{
  "event_action": "trigger",
  "dedup_key": "server-outage",
  "payload": {
    "summary": "Server down",
    "severity": "critical"
  }
}

Configure alert suppression in monitoring tools to prevent redundant triggers.

Use PagerDuty event rules to filter out duplicate alerts:

Settings → Event Rules

Best Practices for PagerDuty Optimization

  • Regularly review on-call schedules and escalation policies.
  • Ensure API keys and webhook integrations are correctly configured.
  • Use deduplication settings to prevent excessive alert noise.
  • Monitor notification logs to identify delivery failures.
  • Test alerts and escalation workflows periodically.

Conclusion

By troubleshooting delayed notifications, failed escalations, API issues, webhook failures, and duplicate alerts, organizations can optimize their PagerDuty incident management workflows. Implementing best practices ensures a fast, reliable, and efficient alerting system.

FAQs

1. Why are my PagerDuty alerts delayed?

Check user notification rules, verify mobile app permissions, and whitelist PagerDuty email addresses.

2. How do I fix escalations not triggering?

Review escalation policies, adjust timeout settings, and ensure incidents are not manually acknowledged before escalation.

3. Why is my API integration failing?

Ensure API keys are valid, verify request payloads, and check rate limits.

4. How do I troubleshoot webhook failures?

Check webhook URLs, test connectivity using curl, and review webhook logs in PagerDuty.

5. How do I prevent duplicate alerts?

Use deduplication keys in API requests, configure alert suppression, and enable PagerDuty event rules.