Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

mz-debug-ci

Name: Mz Debug Ci
Author: MaterializeInc

// Investigate CI failures on PR via gh + bk CLI. Trigger: failing checks, Buildkite failures, CI issues — "why is CI red", "build broken", "checks failing", "what went wrong in CI", "nightly broke", "tests failing on this PR", or pasted Buildkite URL. Also PR number + why failing.

Ejecutar en Manus

$ git log --oneline --stat

stars:6306

forks:504

updated:7 de mayo de 2026, 16:53

SKILL.md

readonly

name	mz-debug-ci
description	Investigate CI failures on PR via gh + bk CLI. Trigger: failing checks, Buildkite failures, CI issues — "why is CI red", "build broken", "checks failing", "what went wrong in CI", "nightly broke", "tests failing on this PR", or pasted Buildkite URL. Also PR number + why failing.
argument-hint	<PR number or GitHub PR URL>

Investigate CI failures for a Materialize PR.

Prerequisites

This skill requires both gh (GitHub CLI) and bk (Buildkite CLI) to be installed and authenticated. Before doing anything else, verify both:

which gh && gh auth status
which bk && bk auth status

If either tool is missing or unauthenticated, stop immediately and tell the user what to fix (bk configure or bk auth login for Buildkite). Do not attempt to use the REST API directly or any other workaround — this workflow only works with these CLI tools.

Both gh and bk make network requests that are blocked by the default sandbox. All Bash commands in this workflow must use dangerouslyDisableSandbox: true.

Step 1: Extract PR number

Parse $ARGUMENTS to get the PR number. Handle both formats:

Plain number: 35192
Full URL: https://github.com/MaterializeInc/materialize/pull/35192

Step 2: Find the build

Use gh to get the PR's branch name and then find the Buildkite build:

# Get the branch name for the PR
gh pr view <PR_NUMBER> --json headRefName --jq .headRefName

Alternatively, list failing checks directly:

gh pr checks <PR_NUMBER> 2>&1

Lines containing fail have tab-separated fields:

name	fail	0	https://buildkite.com/materialize/<PIPELINE>/builds/<BUILD>#<JOB_ID>	description

Extract from the URL:

Pipeline: path segment after materialize/ (usually test)
Build number: the number after builds/
Job ID: the UUID after #

Step 3: Check annotations first

Before diving into logs, fetch the build annotations. They contain pre-extracted error messages, stack traces, and links to known flaky test issues — this saves significant time compared to grepping through raw logs.

bk api /pipelines/<PIPELINE>/builds/<BUILD_NUMBER>/annotations --no-pager 2>&1

The response is JSON. Each annotation has:

style: "error" for failures
body_html: HTML containing the error summary, including:
- The specific test/job that failed
- The actual error message or stack trace in <pre><code> blocks
- Links to known flaky test issues (look for GitHub issue links like database-issues/#NNNN)
- Main branch history showing if this test passes on main (flaky test indicator)

Parse the error annotations to get a quick overview of all failures before fetching any logs.

Step 4: Fetch logs when needed

Only fetch full logs when annotations don't provide enough detail. Triage in this order:

clippy — compilation/lint errors that often explain everything
lint-and-rustfmt — formatting and lint-check failures
cargo-test — unit/integration test failures
fast-sql-logic-tests — SLT failures
testdrive — integration test failures (often cascading)
Everything else (checks-parallel, cluster-tests, dbt, etc.)

To fetch a job's log:

bk job log <JOB_ID> -p <PIPELINE> -b <BUILD_NUMBER> --no-timestamps --no-pager 2>&1 | tail -100

For large logs, first grep for errors to find the relevant section:

bk job log <JOB_ID> -p <PIPELINE> -b <BUILD_NUMBER> --no-timestamps --no-pager 2>&1 | grep -B2 -A5 'error\|FAIL\|panicked'

Fetch multiple job logs in parallel when they are independent (e.g., clippy + lint at the same time).

Step 5: Categorize failures

Use these Materialize-specific patterns to diagnose:

Clippy errors

Code lint issues in changed files. Common ones: as_conversions, needless_borrow, clone_on_ref_ptr. Fix the code, not the lint config.

`check-test-flags` lint failure

A new configuration flag was introduced but not registered in the required places:

misc/python/materialize/parallel_workload/action.py (FlipFlagsAction)
misc/python/materialize/mzcompose/__init__.py (get_variable_system_parameters / get_minimal_system_parameters / UNINTERESTING_SYSTEM_PARAMETERS)

Cargo test failures

Read the panic message or assertion diff. Common patterns:

unwrap_err() on Ok → test expected an error but the code now succeeds
assertion left == right failed → behavioral change in output

Testdrive cascades

After one test crashes environmentd, all subsequent tests in that shard fail with Name or service not known or connection closed. Only the first failure in a shard matters — everything after it is a cascade. Look for the first error: or FAIL in the log.

Testdrive shards with the same number (e.g., testdrive-10 and testdrive-with-alloydb-10) run the same tests — if both fail, it's likely to be the same root cause.

SLT failures

Check whether it's wrong output (behavioral change) vs. connection error (crash/timeout). Wrong output means the query semantics changed.

Step 6: Summarize

Group failures by root cause, not by job name. Typically many failing jobs share just 1-2 root causes. Present the summary as:

Root cause A — description, which jobs it affects, what to fix
Root cause B — description, which jobs it affects, what to fix

Distinguish between issues that are clearly caused by the PR's changes vs. pre-existing flaky tests. The annotations often link to known flaky test issues (GitHub database-issues links) — use these to identify pre-existing flakes vs. regressions introduced by the PR.

related-skills.json

mismo repositorio

mz-dbt-release.md

from "MaterializeInc/materialize"

Cut a dbt-materialize PyPI release: bump the version in `__version__.py` and `setup.py`, date the `Unreleased` CHANGELOG entry, and open the release PR with a `Ship: <url>` body. Trigger: "cut a dbt release", "release dbt-materialize", "release the dbt adapter", "ship dbt-materialize vX.Y.Z", "publish dbt-materialize to PyPI", "bump dbt-materialize version", "new dbt adapter version". Use this skill even if the user just says "ship the dbt adapter" or pastes a feature PR and asks for "the next dbt release" without naming version mechanics.

2026-05-206.3k

mz-adapter-guide.md

from "MaterializeInc/materialize"

Correctness invariants + architecture: adapter, coordinator, pgwire, peek paths, timestamp oracle. Trigger: questions about these subsystems — "how does coordinator work", "what are read holds", "explain peek path", "how does timestamp selection work", "why does this query block". Also edits in src/adapter/, src/pgwire/, src/timestamp-oracle/.

2026-05-076.3k

mz-benchmark.md

from "MaterializeInc/materialize"

Add/modify/debug Materialize perf benchmark scenarios. Three frameworks: Feature Benchmark (single-op micro), Scalability Test (SQL throughput under concurrency), Parallel Benchmark (sustained latency via scenarios.py). Trigger: "benchmark", "feature benchmark", "scalability test", "parallel benchmark", "performance regression", "micro-benchmark", "TPS", "latency test", or edits in feature_benchmark/scenarios/, scalability/workload/workloads/, parallel_benchmark/scenarios.py. Note: measurement, not panic-stress (see mz-parallel-workload).

2026-05-076.3k

mz-commit.md

from "MaterializeInc/materialize"

Trigger: "commit", "prepare commit", "create PR", "push", "open pull request", or mentions committing, pre-commit checks, pull requests in Materialize. Also "ship it", "ready to merge". For code review use mz-pr-review.

2026-05-076.3k

mz-limits-test.md

from "MaterializeInc/materialize"

Add/modify/debug limits test. Trigger: "limits test", "Generator subclass", "many objects", "scaling test", or stress-test Materialize with many objects (tables, views, sources, indexes). Also edits in test/limits/mzcompose.py.

2026-05-076.3k

mz-parallel-workload.md

from "MaterializeInc/materialize"

Extend parallel-workload stress framework: random SQL concurrently to catch panics + unexpected errors (not perf — see mz-benchmark). Trigger: "parallel workload", "parallel-workload", "action.py" re parallel workload, or testing panics/unexpected errors under concurrency. Also "add this to parallel workload" or bug that panics under concurrent DDL/DML.

2026-05-076.3k

package.json

"author": "MaterializeInc"

"repository": "MaterializeInc/materialize"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Administradores de redes y sistemas informáticosOcupaciones informáticas y matemáticas15-1244L4

name	mz-debug-ci
description	Investigate CI failures on PR via gh + bk CLI. Trigger: failing checks, Buildkite failures, CI issues — "why is CI red", "build broken", "checks failing", "what went wrong in CI", "nightly broke", "tests failing on this PR", or pasted Buildkite URL. Also PR number + why failing.
argument-hint	<PR number or GitHub PR URL>

Investigate CI failures for a Materialize PR.

Prerequisites

This skill requires both gh (GitHub CLI) and bk (Buildkite CLI) to be installed and authenticated. Before doing anything else, verify both:

which gh && gh auth status
which bk && bk auth status

Both gh and bk make network requests that are blocked by the default sandbox. All Bash commands in this workflow must use dangerouslyDisableSandbox: true.

Step 1: Extract PR number

Parse $ARGUMENTS to get the PR number. Handle both formats:

Plain number: 35192
Full URL: https://github.com/MaterializeInc/materialize/pull/35192

Step 2: Find the build

Use gh to get the PR's branch name and then find the Buildkite build:

# Get the branch name for the PR
gh pr view <PR_NUMBER> --json headRefName --jq .headRefName

Alternatively, list failing checks directly:

gh pr checks <PR_NUMBER> 2>&1

Lines containing fail have tab-separated fields:

name	fail	0	https://buildkite.com/materialize/<PIPELINE>/builds/<BUILD>#<JOB_ID>	description

Extract from the URL:

Pipeline: path segment after materialize/ (usually test)
Build number: the number after builds/
Job ID: the UUID after #

Step 3: Check annotations first

bk api /pipelines/<PIPELINE>/builds/<BUILD_NUMBER>/annotations --no-pager 2>&1

The response is JSON. Each annotation has:

style: "error" for failures
body_html: HTML containing the error summary, including:
- The specific test/job that failed
- The actual error message or stack trace in <pre><code> blocks
- Links to known flaky test issues (look for GitHub issue links like database-issues/#NNNN)
- Main branch history showing if this test passes on main (flaky test indicator)

Parse the error annotations to get a quick overview of all failures before fetching any logs.

Step 4: Fetch logs when needed

Only fetch full logs when annotations don't provide enough detail. Triage in this order:

clippy — compilation/lint errors that often explain everything
lint-and-rustfmt — formatting and lint-check failures
cargo-test — unit/integration test failures
fast-sql-logic-tests — SLT failures
testdrive — integration test failures (often cascading)
Everything else (checks-parallel, cluster-tests, dbt, etc.)

To fetch a job's log:

bk job log <JOB_ID> -p <PIPELINE> -b <BUILD_NUMBER> --no-timestamps --no-pager 2>&1 | tail -100

For large logs, first grep for errors to find the relevant section:

bk job log <JOB_ID> -p <PIPELINE> -b <BUILD_NUMBER> --no-timestamps --no-pager 2>&1 | grep -B2 -A5 'error\|FAIL\|panicked'

Fetch multiple job logs in parallel when they are independent (e.g., clippy + lint at the same time).

Step 5: Categorize failures

Use these Materialize-specific patterns to diagnose:

Clippy errors

Code lint issues in changed files. Common ones: as_conversions, needless_borrow, clone_on_ref_ptr. Fix the code, not the lint config.

`check-test-flags` lint failure

A new configuration flag was introduced but not registered in the required places:

misc/python/materialize/parallel_workload/action.py (FlipFlagsAction)
misc/python/materialize/mzcompose/__init__.py (get_variable_system_parameters / get_minimal_system_parameters / UNINTERESTING_SYSTEM_PARAMETERS)

Cargo test failures

Read the panic message or assertion diff. Common patterns:

unwrap_err() on Ok → test expected an error but the code now succeeds
assertion left == right failed → behavioral change in output

Testdrive cascades

Testdrive shards with the same number (e.g., testdrive-10 and testdrive-with-alloydb-10) run the same tests — if both fail, it's likely to be the same root cause.

SLT failures

Check whether it's wrong output (behavioral change) vs. connection error (crash/timeout). Wrong output means the query semantics changed.

Step 6: Summarize

Group failures by root cause, not by job name. Typically many failing jobs share just 1-2 root causes. Present the summary as:

Root cause A — description, which jobs it affects, what to fix
Root cause B — description, which jobs it affects, what to fix

mz-debug-ci

Prerequisites

Step 1: Extract PR number

Step 2: Find the build

Step 3: Check annotations first

Step 4: Fetch logs when needed

Step 5: Categorize failures

Clippy errors

check-test-flags lint failure

Cargo test failures

Testdrive cascades

SLT failures

Step 6: Summarize

Más de este repositorio

Más de este repositorio

Prerequisites

Step 1: Extract PR number

Step 2: Find the build

Step 3: Check annotations first

Step 4: Fetch logs when needed

Step 5: Categorize failures

Clippy errors

check-test-flags lint failure

Cargo test failures

Testdrive cascades

SLT failures

Step 6: Summarize

`check-test-flags` lint failure

`check-test-flags` lint failure