Run any Skill in Manus with one click

$pwd:

ci-failure-analysis

Name: Ci Failure Analysis
Author: scylladb

// Implement and configure AI-powered CI failure analysis workflows for GitHub Actions. Use when setting up CI failure summaries, configuring claude-code-action for failure diagnosis, writing workflow_run triggered analysis, creating collapsed PR comments with failure diagnostics, implementing flaky test detection, or working on recurring issue tracking. Covers the full SP15 plan.

Run Skill in Manus

$ git log --oneline --stat

stars:0

forks:0

updated:March 13, 2026 at 15:03

SKILL.md

readonly

related-skills.json

same repository

conventional-commit.md

from "scylladb/cqlsh-rs"

Generate standardized commit messages following the Conventional Commits specification. Use when asked to commit changes, write a commit message, create a conventional commit, or when committing code. Analyzes staged changes and produces properly formatted commit messages.

2026-03-250

create-implementation-plan.md

from "scylladb/cqlsh-rs"

Create a new implementation plan or sub-plan for cqlsh-rs features, refactoring, or infrastructure work. Use when asked to plan a feature, create a design document, write an implementation plan, break down a task into phases, or design architecture for a component. Produces structured, AI-executable plans with deterministic language.

2026-03-250

development-process.md

from "scylladb/cqlsh-rs"

Guide the end-to-end development process for cqlsh-rs features: review plans, design tests, implement code, write tests, and update plan documents. Use when starting a new feature, picking up the next development task, or following the project's development workflow from plan to implementation.

2026-03-250

github-actions.md

from "scylladb/cqlsh-rs"

Author and maintain GitHub Actions workflows for CI/CD pipelines. Use when creating new workflows, modifying ci.yml, adding workflow jobs, configuring matrix builds, setting up caching, managing secrets, writing workflow_run triggers, creating release pipelines, or debugging GitHub Actions issues. Covers CI, benchmarking, release, and documentation workflows for cqlsh-rs.

2026-03-130

rust-clippy.md

from "scylladb/cqlsh-rs"

Run Clippy with strict lint settings and fix warnings for Rust code. Use when asked to lint code, fix clippy warnings, enforce Rust idioms, check code quality, or run static analysis. Applies project-specific lint configuration and explains fixes.

2026-03-130

rust-testing.md

from "scylladb/cqlsh-rs"

Generate comprehensive Rust tests for cqlsh-rs modules. Use when asked to write tests, add test coverage, generate unit tests, create integration tests, or improve test coverage for Rust code. Orchestrates test creation following project conventions, cargo test patterns, and the cqlsh-rs testing strategy.

2026-03-130

package.json

"author": "scylladb"

"repository": "scylladb/cqlsh-rs"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	ci-failure-analysis
description	Implement and configure AI-powered CI failure analysis workflows for GitHub Actions. Use when setting up CI failure summaries, configuring claude-code-action for failure diagnosis, writing workflow_run triggered analysis, creating collapsed PR comments with failure diagnostics, implementing flaky test detection, or working on recurring issue tracking. Covers the full SP15 plan.

CI Failure Analysis

Implement AI-powered CI failure analysis that automatically diagnoses failing CI jobs and posts structured, collapsed PR comments with root cause analysis, fix suggestions, and recurring issue detection.

Before Starting

Read docs/plans/15-ai-ci-failure-summaries.md — the full implementation plan
Read the current CI workflow: .github/workflows/ci.yml
Check docs/plans/10-testing-strategy.md for testing context

Architecture

The system has three components:

Test output collection — cargo-nextest produces JUnit XML for structured test data
Failure analysis workflow — ci-failure-analysis.yml triggers on workflow_run completion, invokes anthropics/claude-code-action to analyze logs
PR comment posting — actions/github-script formats and posts/updates a collapsed comment

CI Workflow Fails
    ├─> Collect JUnit XML (cargo-nextest)
    ├─> Collect raw logs (gh run view --log-failed)
    ├─> Invoke claude-code-action with JSON schema
    │     ├─> Classify each failure
    │     ├─> Identify root cause
    │     ├─> Suggest fixes (file:line references)
    │     └─> Detect flaky tests
    ├─> Post/update collapsed PR comment
    └─> Auto-retry if flaky (confidence > 0.8)

Key Implementation Details

Workflow Trigger

Use workflow_run to trigger analysis after the CI workflow completes:

on:
  workflow_run:
    workflows: ["CI"]
    types: [completed]

jobs:
  analyze-failure:
    if: >
      github.event.workflow_run.conclusion == 'failure' &&
      github.event.workflow_run.event == 'pull_request'

This avoids re-running CI and only fires when CI actually fails on a PR.

Required Permissions

permissions:
  contents: read
  pull-requests: write
  actions: read
  checks: read
  id-token: write

Required Secrets

Secret	Purpose
`ANTHROPIC_API_KEY`	Claude API access

claude-code-action Configuration

Use structured JSON output with --json-schema to get validated, parseable results:

- uses: anthropics/claude-code-action@v1
  with:
    anthropic_api_key: ${{ secrets.ANTHROPIC_API_KEY }}
    model: claude-haiku-4-5-20251001
    prompt: |
      Analyze the CI failure...
    claude_args: |
      --allowedTools "Bash(gh run view:*),Bash(gh api:*),Read"
      --model claude-haiku-4-5-20251001
      --json-schema '<schema>'

Failure Classification Taxonomy

Always classify failures into one of these categories:

Category	Description	Auto-action
`compilation_error`	Code does not compile	None
`test_failure`	Test assertion failed	None
`lint_violation`	Clippy or rustfmt failure	Suggest exact fix
`infrastructure_flaky`	Timeout, container startup, network	Auto-retry if confidence > 0.8
`dependency_issue`	Cargo resolution, version conflict	Suggest `cargo update`
`configuration_error`	CI YAML issue, missing secret	Link to docs
`unknown`	Cannot classify	Flag for manual review

PR Comment Format

Use GitHub <details> tags for collapsible sections. Structure:

Summary header — pass/fail counts, classification labels, flakiness assessment
Failed job sections (collapsed) — one <details> per failed job containing error, root cause, suggested fix, file reference, category
Recurring issues section (collapsed) — table of issues seen across multiple runs
Footer — re-run link, attribution

Comment Deduplication

Always check for an existing bot comment starting with ## CI Failure Summary and update it rather than creating a duplicate:

const existing = comments.data.find(c =>
  c.user.type === 'Bot' && c.body.startsWith('## CI Failure Summary')
);
if (existing) {
  await github.rest.issues.updateComment({ ..., comment_id: existing.id, body });
} else {
  await github.rest.issues.createComment({ ..., body });
}

Recurring Issue Detection

Store failure summaries as workflow artifacts with 30-day retention. On each failure:

Download recent ci-failure-* artifacts for the same branch
Match failures by test name and error message similarity
Report patterns when the same failure appears in 2+ of the last 5 runs

Flaky Test Auto-Retry

- name: Auto-retry if flaky
  if: >
    fromJSON(steps.analyze.outputs.structured_output).is_flaky == true &&
    fromJSON(steps.analyze.outputs.structured_output).flaky_confidence > 0.8
  run: gh run rerun ${{ github.event.workflow_run.id }} --failed

cargo-nextest Setup

Replace cargo test with cargo-nextest in the CI workflow for JUnit XML output:

- uses: taiki-e/install-action@nextest
- run: cargo nextest run --all-targets --all-features --message-format junit --output-file test-results.xml
- uses: actions/upload-artifact@v4
  if: always()
  with:
    name: test-results
    path: test-results.xml

Model Selection

Use Claude Haiku 4.5 for cost efficiency (~$0.005 per analysis). Only escalate to Sonnet for complex multi-file failures that need deeper codebase analysis.

Validation Checklist

After implementing, verify:

CI failures trigger the analysis workflow
PR comment appears with collapsed sections
Each failure has: error message, root cause, suggested fix, classification
Comment is updated (not duplicated) on subsequent pushes
Recurring issues are detected across runs
Flaky tests are auto-retried when detected
Cost per analysis stays under $0.01
Workflow does not trigger on non-PR failures (push to main)

ci-failure-analysis

More from this repository

More from this repository

CI Failure Analysis

Before Starting

Architecture

Key Implementation Details

Workflow Trigger

Required Permissions

Required Secrets

claude-code-action Configuration

Failure Classification Taxonomy

PR Comment Format

Comment Deduplication

Recurring Issue Detection

Flaky Test Auto-Retry

cargo-nextest Setup

Model Selection

Validation Checklist

CI Failure Analysis

Before Starting

Architecture

Key Implementation Details

Workflow Trigger

Required Permissions

Required Secrets

claude-code-action Configuration

Failure Classification Taxonomy

PR Comment Format

Comment Deduplication

Recurring Issue Detection

Flaky Test Auto-Retry

cargo-nextest Setup

Model Selection

Validation Checklist