원클릭으로 Manus에서 모든 스킬 실행

$pwd:

tiered-acceptance

Name: Tiered Acceptance
Author: AgentWorkforce

// Use when a single gate would require deep proof across a large set (44 models × 10 providers; 60+ resources; every region; every plan tier). Splits the acceptance into tier-1 (deep proof on high-volume / high-fidelity slice) and tier-2 (smoke proof on low-volume tail), documents the explicit accepted trade-off, and preserves safety through dormant-default + per-item enable + rollback.

Manus에서 실행

$ git log --oneline --stat

stars:1

forks:0

updated:2026년 5월 20일 10:17

SKILL.md

readonly

name	tiered-acceptance
description	Use when a single gate would require deep proof across a large set (44 models × 10 providers; 60+ resources; every region; every plan tier). Splits the acceptance into tier-1 (deep proof on high-volume / high-fidelity slice) and tier-2 (smoke proof on low-volume tail), documents the explicit accepted trade-off, and preserves safety through dormant-default + per-item enable + rollback.

Tiered acceptance

When a single gate's acceptance criterion would require deep proof across a large set — every model × every provider, every resource × every adapter, every region — the gate becomes the long pole of the run. Either deliver less, ship without proof on the tail, or split. Splitting honestly is the option this skill encodes.

When to invoke

A gate whose ideal acceptance is "all N variants pass deeply" with N large (say ≥20).
A long-pole gate that's blocking the rest of the scoreboard from going GREEN, with most of the work in the tail.
A migration where the high-volume slice covers >90% of real traffic but the long-tail variants exist and have non-zero traffic.

If N is small (<10) or the variants are uniform (same adapter, parameterized), don't tier — prove all of them.

The split

Divide the set into two tiers by realized criticality — not by alphabetical convenience, not by ease.

Tier 1 — full proof

The slice where:

Traffic / usage is concentrated (cover the top X% by volume — pick X to clear a defensible threshold, often 90%).
Failure has highest blast radius (the integration the operator's customers actually use).
Fidelity matters (provider parity is the explicit promise; long-tail divergence is less promised).

For tier-1, the acceptance is the full proof from the original gate intent: deep behavioral parity test, sustained-load test, the failure-class regression test running RED without the fix, cross-PR composition audit, the works.

Tier 2 — smoke proof

The long tail. Acceptance is reduced to: "the path executes without error on a representative sample." Not the deep parity proof. Not the sustained load.

Smoke proof concretely means:

A single instance per tier-2 variant runs the happy path end-to-end and succeeds.
The variant's wiring exists (registry entry, adapter, schema) and matches the structural-proof invariants from swarm-blockers-and-gate-scoreboard.
Failure of a tier-2 variant in production is recoverable by the rollback procedure or by per-item disable.

The tier-2 acceptance is honestly weaker. The trade-off is named.

Safety preservation for tier-2

Tier-2 ships under reduced proof. The safety net comes from the dormant-flip + per-item-enable pattern (dormant-flip-and-rollback):

Dormant by default. Tier-2 variants are not enabled on merge.
Per-item enable. The flip mechanism enables variants one at a time (per provider, per model, per region). The operator chooses the rollout order: usually tier-1 first (deep-proof gives high confidence; observe in prod), tier-2 in batches (lower confidence; observe more carefully).
Per-item rollback. A misbehaving tier-2 variant is disabled by removing one entry from the enable list. The rollback is variant-scoped, not run-scoped. The other variants stay enabled.

This is the trade-off mitigated: deep proof on the high-fidelity slice, smoke proof on the tail, but in production each variant is enabled and observable independently, and any single variant's failure is recoverable without disrupting the others.

Documentation

The split is declared in the contract before tier-2 work begins:

Contract §4 (pre-flip gates) lists the tiered gate as two entries: "Gate G7a — tier-1 full proof for {enumerated variants}" and "Gate G7b — tier-2 smoke proof for {enumerated variants}".
Contract §6 (rollback triggers) names the per-item rollback command and the threshold per variant (often the same threshold as the run-level, just scoped to the variant's metrics).
Contract §8 (escalate) lists the conditions that would force a tier-2 variant into tier-1 (e.g. "if the variant's traffic exceeds X% of total, promote to tier-1 and add deep proof before re-enabling").

Surface the split to the operator at contract-authoring time. Get explicit acknowledgement of the tier-2 reduced proof. Without that, the operator may have assumed full proof and the trade-off was never theirs to accept.

Enumeration discipline

The variant list — both tiers — must be enumerated from a generated source of truth, never hand-maintained. The drift class is the same as the cloud-repo REPO_DECLARED_NANGO_PROVIDER_MODELS lesson: a hand-list that mirrors a generated list silently drifts, and a missing entry silently disables the variant.

Generate the variant list from the registry / adapter directory / config.
Assert at compile time (or in the structural proof) that the generated list and the tier-1 + tier-2 enumerations are exhaustive and disjoint.
Drift in either direction (a variant in the registry that's in neither tier; a variant in a tier that's not in the registry) is a CI failure.

Promotion criteria

A tier-2 variant may need to be promoted to tier-1 mid-run or post-flip:

Its observed traffic share grows past a threshold the contract names.
A real incident on the variant exposes a failure class smoke proof would not catch.
The operator's product priorities change.

Promotion is not a silent step. It re-opens the gate scoreboard: the promoted variant gets deep proof, the gate that was GREEN under tiered acceptance returns to AMBER until the new proof lands.

Anti-patterns

Implicit tiering — "we'll deep-test the important ones and smoke-test the rest" without writing down which is which. Operator-trade-off is never explicit; coverage becomes wishful.
Hand-maintained tier lists. Drift = silent regression.
Smoke-proof masquerading as full proof — a tier-2 variant's smoke test labeled "passed" in the report without the tier distinction noted. Hides the trade-off from anyone reading the report later.
Skipping the per-item-enable pattern. Tiered acceptance without per-item enable means a bad tier-2 variant takes down all variants together. The trade-off is no longer mitigated.
Letting tier-2 enable on merge. Bypasses the dormant-default safety. If tier-2 is enabled on merge, the smoke proof was effectively full-proof's responsibility; the trade-off was an illusion.

What this skill does NOT cover

The dormant-default + per-item-enable mechanism itself (covered by dormant-flip-and-rollback).
The contract sections that document the tiering and the operator acknowledgement (covered by autonomous-run-contract).
The gate scoreboard rows for the tiered gates (covered by swarm-blockers-and-gate-scoreboard).
The cross-PR composition risk when tier-2 lands in multiple PRs (covered by auto-merge-and-composition-safety).

related-skills.json

같은 저장소

auto-merge-and-composition-safety.md

from "AgentWorkforce/workforce"

Use before auto-merging any PR in an autonomous run, and between consecutive merges that touch overlapping code. Covers the per-PR auto-merge bar (live CI verification, substantive review by area, bot-finding stale-vs-actionable triage) and the cross-PR composition discipline (serialize through green main, rebase + re-CI between merges, dormant-safety audit, force-reset over half-merged commits).

2026-05-201

autonomous-run-contract.md

from "AgentWorkforce/workforce"

Use at the start of every autonomous, multi-PR, cutover-class delegated run. Authors the binding contract between operator and agent — the gates, the flip mechanism, the rollback triggers, the standing constraints, and the explicit escalate-to-human conditions — and surfaces it to the operator for explicit grant of (a) auto-merge authority, (b) flip-the-switch authority, (c) swarm-blockers authority. No autonomous work begins until the contract is acknowledged.

2026-05-201

dormant-flip-and-rollback.md

from "AgentWorkforce/workforce"

Use when designing or executing a cutover-class change (migration, ingestion path swap, provider relink, flag enablement). Encodes the dark-launch + single-switch flip pattern, the refusal to flip on amber, and the pre-authorized rollback procedure — the canonical way to make irreversible-feeling infra changes reversible-feeling.

2026-05-201

instrument-dont-guess.md

from "AgentWorkforce/workforce"

Use when a fix has failed two consecutive times for the same symptom. Encodes the discipline that the third action must be a temporary diagnostic (a /_diag endpoint, an enriched structured log, a runtime-captured snapshot) rather than another fix attempt. Also covers the discipline for removing those diagnostics once the real root cause is in.

2026-05-201

swarm-blockers-and-gate-scoreboard.md

from "AgentWorkforce/workforce"

Use during an autonomous run to (a) dispatch supporting codex-impl + claude-review agent pairs against hard blockers when the orchestrator cannot make progress alone, and (b) maintain the live RED / GREEN gate scoreboard the orchestrator reads to authorize the flip. Encodes the file-based reporting convention that keeps the channel readable.

2026-05-201

persona-mcp-servers.md

from "AgentWorkforce/workforce"

Use when authoring an AgentWorkforce persona's `mcpServers` field — covers the two spec variants (http/sse vs stdio), `$VAR` secret substitution, the claude/codex/opencode harness support matrix that constrains harness selection, and the `permissions.allow` pairing for `mcp__<server>` tools

2026-05-131

package.json

"author": "AgentWorkforce"

"repository": "AgentWorkforce/workforce"

GitHub 저장소 열기 Creator 저장소 보기

$ install --global

$ download --local

Manus에서 실행

$ useful --forSOC

소프트웨어 품질 보증 분석가·테스터컴퓨터 및 수학직15-1253L4

name	tiered-acceptance
description	Use when a single gate would require deep proof across a large set (44 models × 10 providers; 60+ resources; every region; every plan tier). Splits the acceptance into tier-1 (deep proof on high-volume / high-fidelity slice) and tier-2 (smoke proof on low-volume tail), documents the explicit accepted trade-off, and preserves safety through dormant-default + per-item enable + rollback.

Tiered acceptance

When to invoke

A gate whose ideal acceptance is "all N variants pass deeply" with N large (say ≥20).
A long-pole gate that's blocking the rest of the scoreboard from going GREEN, with most of the work in the tail.
A migration where the high-volume slice covers >90% of real traffic but the long-tail variants exist and have non-zero traffic.

If N is small (<10) or the variants are uniform (same adapter, parameterized), don't tier — prove all of them.

The split

Divide the set into two tiers by realized criticality — not by alphabetical convenience, not by ease.

Tier 1 — full proof

The slice where:

Traffic / usage is concentrated (cover the top X% by volume — pick X to clear a defensible threshold, often 90%).
Failure has highest blast radius (the integration the operator's customers actually use).
Fidelity matters (provider parity is the explicit promise; long-tail divergence is less promised).

Tier 2 — smoke proof

The long tail. Acceptance is reduced to: "the path executes without error on a representative sample." Not the deep parity proof. Not the sustained load.

Smoke proof concretely means:

A single instance per tier-2 variant runs the happy path end-to-end and succeeds.
The variant's wiring exists (registry entry, adapter, schema) and matches the structural-proof invariants from swarm-blockers-and-gate-scoreboard.
Failure of a tier-2 variant in production is recoverable by the rollback procedure or by per-item disable.

The tier-2 acceptance is honestly weaker. The trade-off is named.

Safety preservation for tier-2

Tier-2 ships under reduced proof. The safety net comes from the dormant-flip + per-item-enable pattern (dormant-flip-and-rollback):

Dormant by default. Tier-2 variants are not enabled on merge.
Per-item enable. The flip mechanism enables variants one at a time (per provider, per model, per region). The operator chooses the rollout order: usually tier-1 first (deep-proof gives high confidence; observe in prod), tier-2 in batches (lower confidence; observe more carefully).
Per-item rollback. A misbehaving tier-2 variant is disabled by removing one entry from the enable list. The rollback is variant-scoped, not run-scoped. The other variants stay enabled.

Documentation

The split is declared in the contract before tier-2 work begins:

Contract §4 (pre-flip gates) lists the tiered gate as two entries: "Gate G7a — tier-1 full proof for {enumerated variants}" and "Gate G7b — tier-2 smoke proof for {enumerated variants}".
Contract §6 (rollback triggers) names the per-item rollback command and the threshold per variant (often the same threshold as the run-level, just scoped to the variant's metrics).
Contract §8 (escalate) lists the conditions that would force a tier-2 variant into tier-1 (e.g. "if the variant's traffic exceeds X% of total, promote to tier-1 and add deep proof before re-enabling").

Enumeration discipline

Generate the variant list from the registry / adapter directory / config.
Assert at compile time (or in the structural proof) that the generated list and the tier-1 + tier-2 enumerations are exhaustive and disjoint.
Drift in either direction (a variant in the registry that's in neither tier; a variant in a tier that's not in the registry) is a CI failure.

Promotion criteria

A tier-2 variant may need to be promoted to tier-1 mid-run or post-flip:

Its observed traffic share grows past a threshold the contract names.
A real incident on the variant exposes a failure class smoke proof would not catch.
The operator's product priorities change.

Promotion is not a silent step. It re-opens the gate scoreboard: the promoted variant gets deep proof, the gate that was GREEN under tiered acceptance returns to AMBER until the new proof lands.

Anti-patterns

Implicit tiering — "we'll deep-test the important ones and smoke-test the rest" without writing down which is which. Operator-trade-off is never explicit; coverage becomes wishful.
Hand-maintained tier lists. Drift = silent regression.
Smoke-proof masquerading as full proof — a tier-2 variant's smoke test labeled "passed" in the report without the tier distinction noted. Hides the trade-off from anyone reading the report later.
Skipping the per-item-enable pattern. Tiered acceptance without per-item enable means a bad tier-2 variant takes down all variants together. The trade-off is no longer mitigated.
Letting tier-2 enable on merge. Bypasses the dormant-default safety. If tier-2 is enabled on merge, the smoke proof was effectively full-proof's responsibility; the trade-off was an illusion.

What this skill does NOT cover

The dormant-default + per-item-enable mechanism itself (covered by dormant-flip-and-rollback).
The contract sections that document the tiering and the operator acknowledgement (covered by autonomous-run-contract).
The gate scoreboard rows for the tiered gates (covered by swarm-blockers-and-gate-scoreboard).
The cross-PR composition risk when tier-2 lands in multiple PRs (covered by auto-merge-and-composition-safety).

tiered-acceptance

Tiered acceptance

When to invoke

The split

Tier 1 — full proof

Tier 2 — smoke proof

Safety preservation for tier-2

Documentation

Enumeration discipline

Promotion criteria

Anti-patterns

What this skill does NOT cover

이 저장소의 다른 Skills

Tiered acceptance

When to invoke

The split

Tier 1 — full proof

Tier 2 — smoke proof

Safety preservation for tier-2

Documentation

Enumeration discipline

Promotion criteria

Anti-patterns

What this skill does NOT cover

이 저장소의 다른 Skills