Execute qualquer Skill no Manus
com um clique

Execute qualquer Skill no Manus com um clique

$pwd:

incident-postmortem-report

Name: Incident Postmortem Report
Author: cnoe-io

// Produce a thorough incident post-mortem report after an outage or customer-impacting event. Covers executive summary, impact, detailed timeline, root cause, contributing factors, corrective and preventive actions, and lessons learned. Use when the user asks to write, draft, or complete a post-mortem, blameless review, or incident review document.

Executar no Manus

$ git log --oneline --stat

stars:366

forks:64

updated:5 de maio de 2026 às 17:00

Explorador de arquivos

2 arquivos

SKILL.md

readonly

name	incident-postmortem-report
description	Produce a thorough incident post-mortem report after an outage or customer-impacting event. Covers executive summary, impact, detailed timeline, root cause, contributing factors, corrective and preventive actions, and lessons learned. Use when the user asks to write, draft, or complete a post-mortem, blameless review, or incident review document.

Incident Post-Mortem Report

Guide the user through producing a blameless, audit-ready post-mortem suitable for engineering leadership, compliance, and future incident prevention.

Instructions

Phase 1: Scope and audience

Confirm what incident is in scope (ticket ID, time window, service, or free-text summary).
Identify audience (internal engineering only vs. includes executives or customers) and adjust depth of business impact language.
List facts already known vs. gaps that need research (logs, metrics, deploys, comms).

Phase 2: Structure the report

Use the sections below in order unless the organization mandates a different template. Fill each section with concrete data; avoid vague statements.

Executive summary — 2–4 sentences: what broke, who was affected, how long, current status.
Impact — Quantify: duration, error rates, revenue/users affected if known, SLA breach yes/no.
Timeline — UTC timestamps, short event labels. Include detection, escalation, mitigation, full recovery.
Root cause — Single primary cause, explained clearly. Use 5 Whys or equivalent if helpful.
Contributing factors — Environment, process, tooling, or communication issues that amplified impact (not blame).
What went well — Detection, runbooks, teamwork, rollback, comms.
What went wrong — Gaps in monitoring, deploy process, testing, on-call routing, documentation.
Corrective actions — Short-term fixes with owners and dates.
Preventive actions — Longer-term hardening (tests, SLOs, chaos, capacity).
Lessons learned — 2–5 bullet takeaways for the org.
References — Links to incidents, PRs, dashboards, chat threads (no secrets).

Phase 3: Tone and quality bar

Blameless: describe systems and processes, not individuals.
Specific: numbers, tool names, versions, ticket keys.
Actionable: every action item has an owner and a target date when possible.

Output Format

Deliver the post-mortem as Markdown using this skeleton:

# Post-Mortem: [short title]

| Field | Value |
|-------|-------|
| **Status** | Draft / Final |
| **Date** | YYYY-MM-DD |
| **Author(s)** | |
| **Incident ID(s)** | |

## Executive summary
[2–4 sentences]

## Impact
- **Duration:** …
- **Scope:** …
- **Customer / business impact:** …

## Timeline (UTC)
| Time | Event |
|------|-------|
| … | … |

## Root cause
[Clear explanation]

## Contributing factors
- …

## What went well
- …

## What went wrong
- …

## Action items
| Action | Type (corrective / preventive) | Owner | Target date |
|--------|--------------------------------|-------|-------------|
| … | … | … | … |

## Lessons learned
- …

## References
- …

Examples

"Write a post-mortem for yesterday's API outage from 14:00–16:30 UTC affecting checkout."
"Draft a blameless incident review for INC-4521; we rolled back service X at 09:45."
"Turn these notes from the war room into a formal post-mortem document."

Guidelines

If information is missing, state assumptions explicitly or list open questions instead of inventing facts.
Align severity and customer-impact language with the org’s incident taxonomy if the user provides it.
Do not include credentials, full PII, or unreleased security details in the body; redact or summarize.
Prefer UTC for all times unless the user requests a specific timezone.

related-skills.json

mesmo repositório

release-docs.md

from "cnoe-io/ai-platform-engineering"

Generate a combined release blog post for ai-platform-engineering. Produces a single docs/releases/YYYY-MM-DD-release-X-Y-Z.md file containing release notes and the upgrade guide (migration guide) inline. Use when cutting a release, when a user asks "what changed in 0.4.x", or when upgrading their values.yaml to a new chart version.

2026-05-12366

update-docs.md

from "cnoe-io/ai-platform-engineering"

Audit and update all documentation moving parts for ai-platform-engineering. Checks release blog posts, features page, agent docs, homepage version strings, Docusaurus version config, and sidebar completeness. Fixes what is stale and reports what needs manual attention. Use after cutting a release, adding a new agent, or updating platform features.

2026-05-07366

aws-cost-analysis.md

from "cnoe-io/ai-platform-engineering"

Analyze AWS costs by service, account, and time period. Identifies top spenders, cost anomalies, and optimization opportunities. Use when reviewing cloud spend, preparing cost reports, or investigating unexpected charges.

2026-04-15366

check-deployment-status.md

from "cnoe-io/ai-platform-engineering"

Check the health and sync status of all ArgoCD applications across clusters. Identifies out-of-sync, degraded, or unhealthy deployments and provides actionable remediation steps. Use when monitoring deployments, troubleshooting sync failures, or verifying environment health after a release.

2026-04-15366

cluster-resource-health.md

from "cnoe-io/ai-platform-engineering"

Check Kubernetes cluster health including pod status, node conditions, resource utilization, and pending alerts across EKS clusters. Use when monitoring infrastructure health, investigating capacity issues, or performing cluster audits.

2026-04-15366

incident-investigation.md

from "cnoe-io/ai-platform-engineering"

Correlate PagerDuty incidents with Jira tickets and recent ArgoCD deployments to accelerate root cause analysis. Orchestrates multiple agents to build a timeline of events. Use when investigating active incidents, performing post-mortems, or correlating alerts with changes.

2026-04-15366

package.json

"author": "cnoe-io"

"repository": "cnoe-io/ai-platform-engineering"

Abrir repositório GitHub Ver repositórios do creator

$ install --global

$ download --local

Executar no Manus

$ useful --forSOC

Gerentes de sistemas computacionais e de informaçãoOcupações de Gestão11-3021L4

name	incident-postmortem-report
description	Produce a thorough incident post-mortem report after an outage or customer-impacting event. Covers executive summary, impact, detailed timeline, root cause, contributing factors, corrective and preventive actions, and lessons learned. Use when the user asks to write, draft, or complete a post-mortem, blameless review, or incident review document.

Incident Post-Mortem Report

Guide the user through producing a blameless, audit-ready post-mortem suitable for engineering leadership, compliance, and future incident prevention.

Instructions

Phase 1: Scope and audience

Confirm what incident is in scope (ticket ID, time window, service, or free-text summary).
Identify audience (internal engineering only vs. includes executives or customers) and adjust depth of business impact language.
List facts already known vs. gaps that need research (logs, metrics, deploys, comms).

Phase 2: Structure the report

Use the sections below in order unless the organization mandates a different template. Fill each section with concrete data; avoid vague statements.

Executive summary — 2–4 sentences: what broke, who was affected, how long, current status.
Impact — Quantify: duration, error rates, revenue/users affected if known, SLA breach yes/no.
Timeline — UTC timestamps, short event labels. Include detection, escalation, mitigation, full recovery.
Root cause — Single primary cause, explained clearly. Use 5 Whys or equivalent if helpful.
Contributing factors — Environment, process, tooling, or communication issues that amplified impact (not blame).
What went well — Detection, runbooks, teamwork, rollback, comms.
What went wrong — Gaps in monitoring, deploy process, testing, on-call routing, documentation.
Corrective actions — Short-term fixes with owners and dates.
Preventive actions — Longer-term hardening (tests, SLOs, chaos, capacity).
Lessons learned — 2–5 bullet takeaways for the org.
References — Links to incidents, PRs, dashboards, chat threads (no secrets).

Phase 3: Tone and quality bar

Blameless: describe systems and processes, not individuals.
Specific: numbers, tool names, versions, ticket keys.
Actionable: every action item has an owner and a target date when possible.

Output Format

Deliver the post-mortem as Markdown using this skeleton:

# Post-Mortem: [short title]

| Field | Value |
|-------|-------|
| **Status** | Draft / Final |
| **Date** | YYYY-MM-DD |
| **Author(s)** | |
| **Incident ID(s)** | |

## Executive summary
[2–4 sentences]

## Impact
- **Duration:** …
- **Scope:** …
- **Customer / business impact:** …

## Timeline (UTC)
| Time | Event |
|------|-------|
| … | … |

## Root cause
[Clear explanation]

## Contributing factors
- …

## What went well
- …

## What went wrong
- …

## Action items
| Action | Type (corrective / preventive) | Owner | Target date |
|--------|--------------------------------|-------|-------------|
| … | … | … | … |

## Lessons learned
- …

## References
- …

Examples

"Write a post-mortem for yesterday's API outage from 14:00–16:30 UTC affecting checkout."
"Draft a blameless incident review for INC-4521; we rolled back service X at 09:45."
"Turn these notes from the war room into a formal post-mortem document."

Guidelines

If information is missing, state assumptions explicitly or list open questions instead of inventing facts.
Align severity and customer-impact language with the org’s incident taxonomy if the user provides it.
Do not include credentials, full PII, or unreleased security details in the body; redact or summarize.
Prefer UTC for all times unless the user requests a specific timezone.

incident-postmortem-report

Incident Post-Mortem Report

Instructions

Phase 1: Scope and audience

Phase 2: Structure the report

Phase 3: Tone and quality bar

Output Format

Examples

Guidelines

Mais deste repositório

Mais deste repositório

Incident Post-Mortem Report

Instructions

Phase 1: Scope and audience

Phase 2: Structure the report

Phase 3: Tone and quality bar

Output Format

Examples

Guidelines