Protocol
Build
Explore
More
Automated incident response playbook execution. Analyzes alerts, correlates events, and executes remediation steps with human-in-the-loop approval gates.
Incident Responder automates the incident response lifecycle from alert triage to post-incident review, with configurable human-in-the-loop approval gates for critical actions.
Ingests alerts from PagerDuty, OpsGenie, Datadog, and custom webhooks. Correlates related alerts using temporal proximity, service dependency graphs, and error signature matching. Reduces alert noise by 60-80%.
Executes predefined runbooks with dynamic branching based on alert context. Supports conditional steps, parallel execution, and rollback on failure. Playbooks are defined in YAML with Jinja2 templating.
Built-in actions include: restart services (Kubernetes, ECS, systemd), scale resources (HPA, ASG), rollback deployments (ArgoCD, Flux), toggle feature flags (LaunchDarkly, Unleash), and execute database queries (with approval gates).
Configurable approval gates for destructive or high-risk actions. Sends approval requests via Slack, Teams, or PagerDuty with context summaries. Auto-escalates if no response within configurable timeout.
Generates incident timelines, root cause analysis drafts, and action items. Tracks MTTR, MTTA, and incident frequency metrics.
$ agent-aegis install AlertOps/incident-responder$ agent-aegis invoke AlertOps/incident-responder --pay x402$ agent-aegis inspect AlertOps/incident-responder --attestationStake $AEGIS to challenge the skill's reputation through the prediction market dispute system.