ワンクリックでManusで任意のスキルを実行

managing-experiment-lifecycle

Guides experiment state transitions: launching, pausing, resuming, ending, shipping variants, archiving, resetting, and duplicating. Covers preconditions, implications for variant assignment and analysis, and the decision framework for when to use each action. TRIGGER when: user asks to launch, pause, resume, end, ship, archive, reset, or duplicate an experiment. DO NOT TRIGGER when: user is creating an experiment (use creating-experiments), configuring rollout (use configuring-experiment-rollout), or setting up metrics (use configuring-experiment-analytics).

Manusで実行

スター34,943

フォーク2,841

更新日2026年4月23日 14:26

ソース

PostHog

PostHog/posthog

GitHub リポジトリを開く Creator のリポジトリを見る

インストールコマンド

ダウンロード

Manusで実行

役立つ用途SOC

市場調査アナリスト・マーケティングスペシャリストビジネス・金融業務職13-1161L4

SKILL.md

readonly

name

managing-experiment-lifecycle

description

Managing experiment lifecycle

This skill covers experiment state transitions — what each action does, when to use it, and how it affects variant assignment and analysis.

State diagram

draft ──launch──▶ running ──end──▶ stopped ──archive──▶ archived
                    │   ▲              │
                  pause resume    ship_variant
                    │   │         (also ends if running)
                    ▼   │
                  paused (flag inactive, still "running" status)

Any non-draft state ──reset──▶ draft

Actions and their implications

For each action, the two key questions:

Who sees what variant? (user perspective)
Who is in my analysis? (statistical perspective)

Launch (`experiment-launch`)

Transitions draft → running. Activates the feature flag and sets start_date.

Preconditions: must be in draft, flag needs ≥2 variants with "control" first
Pre-launch checklist: has at least one metric? Variants correct? Flag implemented in code?
Variants: users start being bucketed into variants based on the configured split
Analysis: data collection begins from start_date

No request body needed.

Pause (`experiment-pause`)

Deactivates the feature flag. Users fall back to the default experience (typically control).

Preconditions: must be running and not already paused
Variants: flag is not returned by /decide — no new exposure events recorded
Analysis: no new data while paused, but existing data is preserved. Experiment stays "running".

No request body. Use experiment-resume to reactivate.

Resume (`experiment-resume`)

Reactivates the feature flag after a pause. Users are re-bucketed deterministically into the same variants.

Preconditions: must be paused
Variants: same assignment as before pause — deterministic bucketing
Analysis: exposure tracking resumes

No request body.

End (`experiment-end`)

Sets end_date and transitions to stopped. The feature flag is NOT modified.

Preconditions: must be running (launched, not already stopped)
Variants: users continue seeing assigned variants (flag stays active)
Analysis: results frozen to data up to end_date

Optional body: conclusion ("won", "lost", "inconclusive", "stopped_early", "invalid") and conclusion_comment.

Use this when you want to freeze results without changing what users see.

Ship variant (`experiment-ship-variant`)

Rewrites the feature flag so the selected variant is served to 100% of users.

Preconditions: must be launched (running or stopped). Cannot ship from draft.
Variants: ALL users see the shipped variant. The flag is rewritten with a catch-all group.
Analysis: if still running, the experiment is also ended (end_date set)

Always confirm with the user before shipping — this permanently rewrites the feature flag.

Required: variant_key (e.g. "test"). Optional: conclusion, conclusion_comment.

Returns 409 if an approval policy requires review before the flag change.

Archive (`experiment-archive`)

Hides a stopped experiment from the default list view.

Preconditions: must be stopped (end_date set)
Variants: no change — flag is unaffected
Analysis: no change — results remain accessible

No request body. Can be restored by setting archived=false via experiment-update.

Reset (`experiment-reset`)

Returns an experiment to draft state. Clears start_date, end_date, conclusion, and archived.

Preconditions: must not already be in draft
Variants: flag is left unchanged — users continue seeing assigned variants
Analysis: previously collected data still exists but won't be included in results unless start_date is adjusted after re-launch

No request body.

Duplicate (`experiment-duplicate`)

Creates a copy as a new draft with fresh dates and no results.

Important: always provide a unique feature_flag_key different from the original. If the same key is used, both experiments share a flag — changes to one affect both.

Optional: custom name (defaults to "Original Name (Copy)").

Decision framework

Situation	Action	Tool
Draft ready, flag implemented, metrics set	Launch	`experiment-launch`
Clear winner, significant results	Ship the winning variant	`experiment-ship-variant`
No significant difference after sufficient time	End as inconclusive	`experiment-end`
Something wrong, need to stop exposure temporarily	Pause	`experiment-pause`
Resume after pause	Resume	`experiment-resume`
Experiment ended, ready to clean up	Archive	`experiment-archive`
Need to start over with same config	Reset to draft	`experiment-reset`
Want a similar experiment with a fresh start	Duplicate	`experiment-duplicate`

Resolving experiments

All lifecycle actions require an experiment ID. If you don't have one, load the finding-experiments skill to resolve the user's reference (name, description, "latest", etc.) to a concrete ID before proceeding.

Error handling

Error message	Meaning
"Experiment has already been launched."	Can't launch a non-draft experiment
"Experiment has not been launched yet."	Can't end/pause/ship a draft
"Experiment has already ended."	Can't end/pause a stopped experiment
"Experiment is already paused."	Use resume instead
"Experiment is not paused."	It's already active
"Experiment is already in draft state."	Nothing to reset
"Experiment is already archived."	Already done

When you get a 400, explain the situation to the user rather than retrying.

このリポジトリの他の Skills

同じリポジトリ

authoring-signals-scouts

PostHog/posthog

How to author, edit, and adapt PostHog Signals scouts — the scheduled agents that scan a project and emit findings into the Signals inbox. Use when a user wants to customize a canonical scout for their own setup (narrow its scope, retune its thresholds, add disqualifiers), tweak a scout's schedule or dry-run posture, or write a brand-new scout from scratch for a specific use case (a custom event, a product surface no canonical scout covers). Covers the scout SKILL.md anatomy, the emit contract, the dedupe + scratchpad-memory conventions, the per-team skills-store path vs the canonical in-repo path, and the dry-run-first test loop. Trigger on "write/edit/customize a signals scout", "new scout for X", "tune my scout schedule", "make a scout that watches <event>".

2026-06-1034.9k

exploring-signals-scouts

PostHog/posthog

How to explore and make sense of PostHog Signals scouts — the scheduled agents that scan a project and emit findings into the Signals inbox. Use when a user wants to understand what scouts they have, how each one is behaving, and whether the fleet is actually working. Covers surveying the fleet and its schedules, reading recent scout runs and drilling into a single run's reasoning, inspecting the durable scratchpad memory the fleet has built up, tracing a run to the findings it emitted, and assessing a scout's health and performance over time (cadence, success rate, emit rate, signal-to-noise). Read-only and exploratory — to write or tune a scout, use `authoring-signals-scouts` instead. Trigger on "what are my scouts doing", "how is my <x> scout performing", "show me recent scout runs", "why did this scout find/emit nothing", "what has the fleet learned", "explore scout run <id>", "is my scout working".

2026-06-1034.9k

signals-scout-ai-observability

PostHog/posthog

Focused Signals scout for PostHog projects using AI observability. Rotates through a set of lenses — cost, latency, errors, volume, eval performance, eval/enrichment config, clusters, and tool usage — watching each for trends and spikes sliced by the dimensions it discovers over time. Leans on the sandbox's bundled `exploring-llm-*` deep-dive skills for the actual queries. Emits findings only when they clear the confidence bar; otherwise writes durable memory and closes out empty. Self-contained peer in the signals-scout-* fleet — no dependencies on other scouts.

2026-06-1034.9k

signals-scout-anomaly-detection

PostHog/posthog

Signals scout that watches a PostHog project's most-viewed dashboards and insights for recent anomalies — sudden bursts, drops, flat-lines, and trend breaks at the daily or hourly level. It discovers what the team actually looks at (view counts, dashboard access), curates a durable watchlist in the scratchpad, and balances re-checking known high-value insights (exploit) against discovering new ones (explore) across runs, since no single run can cover a busy project. Anomalies are scored by robust deviation from each insight's own seasonality-matched baseline; it emits a finding only when a move clears the confidence bar, otherwise it updates the baseline memory and closes out empty. Self-contained peer in the signals-scout-* fleet.

2026-06-1034.9k

signals-scout-csp-violations

PostHog/posthog

Focused Signals scout for PostHog projects collecting Content Security Policy (CSP) violation reports. Watches `$csp_violation` events for fresh blocked-URL clusters, per-directive bursts, page-scoped regressions after deploys, and suspicious third-party domains that may indicate a compromised script. Emits aggregated findings only when a cluster clears the confidence bar; otherwise writes durable memory and closes out empty. Self-contained peer in the signals-scout-* fleet — no dependencies on other skills.

2026-06-1034.9k

signals-scout-data-pipelines

PostHog/posthog

Focused Signals scout for PostHog projects moving data through pipelines. Watches the three delivery surfaces — CDP destinations and transformations (hog functions), batch exports, and hog flows (workflows/messaging) — for contradictions between configured state and actual delivery: functions the watcher quietly degraded or disabled, failure rates stepping above a pipeline's own baseline, batch export runs failing or stalling (a growing data gap), and active flows failing for the people they trigger on. Emits findings only when they clear the confidence bar; otherwise writes durable memory and closes out empty. Self-contained peer in the signals-scout-* fleet — no dependencies on other skills.

2026-06-1034.9k

name

managing-experiment-lifecycle

description

Managing experiment lifecycle

This skill covers experiment state transitions — what each action does, when to use it, and how it affects variant assignment and analysis.

State diagram

draft ──launch──▶ running ──end──▶ stopped ──archive──▶ archived
                    │   ▲              │
                  pause resume    ship_variant
                    │   │         (also ends if running)
                    ▼   │
                  paused (flag inactive, still "running" status)

Any non-draft state ──reset──▶ draft

Actions and their implications

For each action, the two key questions:

Who sees what variant? (user perspective)
Who is in my analysis? (statistical perspective)

Launch (`experiment-launch`)

Transitions draft → running. Activates the feature flag and sets start_date.

Preconditions: must be in draft, flag needs ≥2 variants with "control" first
Pre-launch checklist: has at least one metric? Variants correct? Flag implemented in code?
Variants: users start being bucketed into variants based on the configured split
Analysis: data collection begins from start_date

No request body needed.

Pause (`experiment-pause`)

Deactivates the feature flag. Users fall back to the default experience (typically control).

Preconditions: must be running and not already paused
Variants: flag is not returned by /decide — no new exposure events recorded
Analysis: no new data while paused, but existing data is preserved. Experiment stays "running".

No request body. Use experiment-resume to reactivate.

Resume (`experiment-resume`)

Reactivates the feature flag after a pause. Users are re-bucketed deterministically into the same variants.

Preconditions: must be paused
Variants: same assignment as before pause — deterministic bucketing
Analysis: exposure tracking resumes

No request body.

End (`experiment-end`)

Sets end_date and transitions to stopped. The feature flag is NOT modified.

Preconditions: must be running (launched, not already stopped)
Variants: users continue seeing assigned variants (flag stays active)
Analysis: results frozen to data up to end_date

Optional body: conclusion ("won", "lost", "inconclusive", "stopped_early", "invalid") and conclusion_comment.

Use this when you want to freeze results without changing what users see.

Ship variant (`experiment-ship-variant`)

Rewrites the feature flag so the selected variant is served to 100% of users.

Preconditions: must be launched (running or stopped). Cannot ship from draft.
Variants: ALL users see the shipped variant. The flag is rewritten with a catch-all group.
Analysis: if still running, the experiment is also ended (end_date set)

Always confirm with the user before shipping — this permanently rewrites the feature flag.

Required: variant_key (e.g. "test"). Optional: conclusion, conclusion_comment.

Returns 409 if an approval policy requires review before the flag change.

Archive (`experiment-archive`)

Hides a stopped experiment from the default list view.

Preconditions: must be stopped (end_date set)
Variants: no change — flag is unaffected
Analysis: no change — results remain accessible

No request body. Can be restored by setting archived=false via experiment-update.

Reset (`experiment-reset`)

Returns an experiment to draft state. Clears start_date, end_date, conclusion, and archived.

Preconditions: must not already be in draft
Variants: flag is left unchanged — users continue seeing assigned variants
Analysis: previously collected data still exists but won't be included in results unless start_date is adjusted after re-launch

No request body.

Duplicate (`experiment-duplicate`)

Creates a copy as a new draft with fresh dates and no results.

Important: always provide a unique feature_flag_key different from the original. If the same key is used, both experiments share a flag — changes to one affect both.

Optional: custom name (defaults to "Original Name (Copy)").

Decision framework

Situation	Action	Tool
Draft ready, flag implemented, metrics set	Launch	`experiment-launch`
Clear winner, significant results	Ship the winning variant	`experiment-ship-variant`
No significant difference after sufficient time	End as inconclusive	`experiment-end`
Something wrong, need to stop exposure temporarily	Pause	`experiment-pause`
Resume after pause	Resume	`experiment-resume`
Experiment ended, ready to clean up	Archive	`experiment-archive`
Need to start over with same config	Reset to draft	`experiment-reset`
Want a similar experiment with a fresh start	Duplicate	`experiment-duplicate`

Resolving experiments

Error handling

Error message	Meaning
"Experiment has already been launched."	Can't launch a non-draft experiment
"Experiment has not been launched yet."	Can't end/pause/ship a draft
"Experiment has already ended."	Can't end/pause a stopped experiment
"Experiment is already paused."	Use resume instead
"Experiment is not paused."	It's already active
"Experiment is already in draft state."	Nothing to reset
"Experiment is already archived."	Already done

When you get a 400, explain the situation to the user rather than retrying.

managing-experiment-lifecycle

Managing experiment lifecycle

State diagram

Actions and their implications

Launch (experiment-launch)

Pause (experiment-pause)

Resume (experiment-resume)

End (experiment-end)

Ship variant (experiment-ship-variant)

Archive (experiment-archive)

Reset (experiment-reset)

Duplicate (experiment-duplicate)

Decision framework

Resolving experiments

Error handling

このリポジトリの他の Skills

このリポジトリの他の Skills

Managing experiment lifecycle

State diagram

Actions and their implications

Launch (experiment-launch)

Pause (experiment-pause)

Resume (experiment-resume)

End (experiment-end)

Ship variant (experiment-ship-variant)

Archive (experiment-archive)

Reset (experiment-reset)

Duplicate (experiment-duplicate)

Decision framework

Resolving experiments

Error handling

Launch (`experiment-launch`)

Pause (`experiment-pause`)

Resume (`experiment-resume`)

End (`experiment-end`)

Ship variant (`experiment-ship-variant`)

Archive (`experiment-archive`)

Reset (`experiment-reset`)

Duplicate (`experiment-duplicate`)

Launch (`experiment-launch`)

Pause (`experiment-pause`)

Resume (`experiment-resume`)

End (`experiment-end`)

Ship variant (`experiment-ship-variant`)

Archive (`experiment-archive`)

Reset (`experiment-reset`)

Duplicate (`experiment-duplicate`)