원클릭으로 Manus에서 모든 스킬 실행

$pwd:

kayba-pipeline

Name: Kayba Pipeline
Author: kayba-ai

// End-to-end agent evaluation and improvement pipeline. Takes a traces folder and optional HITL flag, then orchestrates sub-agents through 7 stages — each stage is its own skill invoked by a dedicated sub-agent. Trigger when the user says "run the pipeline", "kayba pipeline", "evaluate and fix", "full eval", "analyze traces and fix", or provides a traces folder with intent to improve their agent.

Manus에서 실행

$ git log --oneline --stat

stars:2,243

forks:277

updated:2026년 3월 17일 10:21

파일 탐색기

8 개 파일

SKILL.md

readonly

name	kayba-pipeline
description	End-to-end agent evaluation and improvement pipeline. Takes a traces folder and optional HITL flag, then orchestrates sub-agents through 7 stages — each stage is its own skill invoked by a dedicated sub-agent. Trigger when the user says "run the pipeline", "kayba pipeline", "evaluate and fix", "full eval", "analyze traces and fix", or provides a traces folder with intent to improve their agent.

kayba-pipeline

End-to-end pipeline: analyze traces → define metrics → build rubric → plan fixes → implement fixes.

Each stage is a separate skill file that can be run independently or as part of this pipeline.

Inputs

The user provides two things:

TRACES_FOLDER — path to a directory containing trace JSON files
HITL — true or false — whether to pause for human review before implementing fixes

If the user doesn't specify HITL, default to true (safe default).

Pipeline overview

┌─────────────────────────────────────────────────────────────────────┐
│  Stage 1: Kayba API Analysis        → skill: kayba-pipeline:stage-1-api-analysis   │
│  Stage 2: Domain Context Gathering  → skill: kayba-pipeline:stage-2-domain-context │
│  ─── stages 1 & 2 run in parallel ───                                              │
│  Stage 3: Metrics & Analysis        → skill: kayba-pipeline:stage-3-metrics        │
│  Stage 4: Rubric Definition         → skill: kayba-pipeline:stage-4-rubric         │
│  Stage 5: Action Plan               → skill: kayba-pipeline:stage-5-action-plan    │
│  Stage 6: HITL Gate                 → skill: kayba-pipeline:stage-6-hitl           │
│  Stage 7: Fix Implementation        → skill: kayba-pipeline:stage-7-fixer          │
└─────────────────────────────────────────────────────────────────────┘

Orchestration instructions

You are the orchestrator. Your job is to:

Create the eval/ directory and eval/pipeline_log.md
Spawn sub-agents that invoke stage skills via the Skill tool
Coordinate stage ordering and handle the HITL gate

Setup

Create eval/ directory and initialize eval/pipeline_log.md:

# Pipeline Log

| Stage | Name | Status | Started | Completed | Notes |
|-------|------|--------|---------|-----------|-------|
| 1 | Kayba API Analysis | pending | | | |
| 2 | Domain Context | pending | | | |
| 3 | Metrics & Analysis | pending | | | |
| 4 | Rubric Definition | pending | | | |
| 5 | Action Plan | pending | | | |
| 6 | HITL Gate | pending | | | |
| 7 | Fix Implementation | pending | | | |

Stages 1 & 2 — run in parallel

Spawn two sub-agents in parallel using the Agent tool:

Agent 1:

Name: api-analyst
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-1-api-analysis" using the Skill tool. The traces folder is: {TRACES_FOLDER}. Follow the skill instructions completely.

Agent 2:

Name: domain-scout
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-2-domain-context" using the Skill tool. The traces folder is: {TRACES_FOLDER}. Follow the skill instructions completely.

Wait for both to complete before proceeding.

Stage 3 — sequential

Spawn one sub-agent after stages 1 & 2 complete:

Name: metric-engineer
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-3-metrics" using the Skill tool. The traces folder is: {TRACES_FOLDER}. Follow the skill instructions completely — this includes iterating on the metrics until you're satisfied.

Stage 4 — sequential

Spawn one sub-agent after stage 3 completes:

Name: rubric-builder
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-4-rubric" using the Skill tool. Follow the skill instructions completely.

Stage 5 — sequential

Spawn one sub-agent after stage 4 completes:

Name: action-planner
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-5-action-plan" using the Skill tool. Follow the skill instructions completely.

Stage 6 — HITL Gate

If HITL is true:

Spawn one sub-agent after stage 5 completes:

Name: hitl-reviewer
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-6-hitl" using the Skill tool. Follow the skill instructions completely. Present the full review to the user and collect their decision before proceeding.

Wait for the sub-agent to complete. Check eval/stage6_decision.md for the outcome:

If decision is "Approve all" or "Approve with modifications" — proceed to Stage 7
If decision is "Reject" — re-run Stage 5 with the user feedback recorded in eval/stage6_decision.md, then re-run Stage 6
Only proceed to Stage 7 after a clear approval is recorded

If HITL is false:

Skip to Stage 7
Log "HITL skipped" in eval/pipeline_log.md

Stage 7 — sequential

Spawn one sub-agent after stage 6 completes (or is skipped):

Name: fixer
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-7-fixer" using the Skill tool. Follow the skill instructions completely.

Error handling

If any stage fails, log the failure in eval/pipeline_log.md with the stage number and error
Do not proceed to dependent stages if a prerequisite failed
If Stage 1 fails (kayba CLI issues), ask the user whether to proceed without API insights — if yes, skip Stage 1 and have Stage 3 work from domain context + raw traces only

After completion

Update eval/pipeline_log.md with final status for all stages. Report to the user:

How many stages completed successfully
Summary of metrics (from rubric)
Summary of fixes applied (from changes log)

related-skills.json

같은 저장소

kayba-stage-1-api-analysis.md

from "kayba-ai/agentic-context-engine"

Fetch pre-computed insights from the Kayba API and build a structured summary. Does NOT upload traces or trigger generation — analysis is assumed to already exist. Trigger when the user says "run stage 1", "get insights", "fetch skills", "kayba analyze", or when invoked by the kayba-pipeline orchestrator. Requires the kayba CLI to be installed and KAYBA_API_KEY to be set.

2026-03-172.2k

kayba-stage-2-domain-context.md

from "kayba-ai/agentic-context-engine"

Gather domain context about the repository and agent — system prompt, tool definitions, domain docs, and behavior patterns from traces. Trigger when the user says "run stage 2", "gather context", "domain context", or when invoked by the kayba-pipeline orchestrator.

2026-03-172.2k

kayba-stage-3-metrics.md

from "kayba-ai/agentic-context-engine"

Define metrics from Kayba insights, implement them as Python measurement code, run against traces, and iterate until the metrics are clean and meaningful. Trigger when the user says "run stage 3", "define metrics", "build metrics", "compute baselines", or when invoked by the kayba-pipeline orchestrator. Requires eval/stage1_insights_summary.md and eval/stage2_domain_context.md to exist.

2026-03-172.2k

kayba-stage-4-rubric.md

from "kayba-ai/agentic-context-engine"

Organize computed metrics into a tiered evaluation rubric with leading, lagging, and quality indicators. Trigger when the user says "run stage 4", "build rubric", "tier metrics", or when invoked by the kayba-pipeline orchestrator. Requires eval/baseline_metrics.json and eval/compute_baselines.py to exist.

2026-03-172.2k

kayba-stage-5-action-plan.md

from "kayba-ai/agentic-context-engine"

Triage each insight into discard/code-fix/prompt-fix and produce a prioritized action plan with specific recommendations. Trigger when the user says "run stage 5", "make action plan", "triage skills", or when invoked by the kayba-pipeline orchestrator. Requires eval outputs from stages 1-4.

2026-03-172.2k

kayba-stage-6-hitl.md

from "kayba-ai/agentic-context-engine"

Human-In-The-Loop gate that presents the action plan with full context, collects an informed approval/modification/rejection decision, and records the outcome. Trigger when the user says "run stage 6", "HITL review", "approve action plan", or when invoked by the kayba-pipeline orchestrator. Requires eval/action_plan.md and eval/baseline_metrics.md to exist.

2026-03-172.2k

package.json

"author": "kayba-ai"

"repository": "kayba-ai/agentic-context-engine"

GitHub 저장소 열기 Creator 저장소 보기

$ install --global

$ download --local

Manus에서 실행

$ useful --forSOC

소프트웨어 개발자컴퓨터 및 수학직15-1252L4

name	kayba-pipeline
description	End-to-end agent evaluation and improvement pipeline. Takes a traces folder and optional HITL flag, then orchestrates sub-agents through 7 stages — each stage is its own skill invoked by a dedicated sub-agent. Trigger when the user says "run the pipeline", "kayba pipeline", "evaluate and fix", "full eval", "analyze traces and fix", or provides a traces folder with intent to improve their agent.

kayba-pipeline

End-to-end pipeline: analyze traces → define metrics → build rubric → plan fixes → implement fixes.

Each stage is a separate skill file that can be run independently or as part of this pipeline.

Inputs

The user provides two things:

TRACES_FOLDER — path to a directory containing trace JSON files
HITL — true or false — whether to pause for human review before implementing fixes

If the user doesn't specify HITL, default to true (safe default).

Pipeline overview

┌─────────────────────────────────────────────────────────────────────┐
│  Stage 1: Kayba API Analysis        → skill: kayba-pipeline:stage-1-api-analysis   │
│  Stage 2: Domain Context Gathering  → skill: kayba-pipeline:stage-2-domain-context │
│  ─── stages 1 & 2 run in parallel ───                                              │
│  Stage 3: Metrics & Analysis        → skill: kayba-pipeline:stage-3-metrics        │
│  Stage 4: Rubric Definition         → skill: kayba-pipeline:stage-4-rubric         │
│  Stage 5: Action Plan               → skill: kayba-pipeline:stage-5-action-plan    │
│  Stage 6: HITL Gate                 → skill: kayba-pipeline:stage-6-hitl           │
│  Stage 7: Fix Implementation        → skill: kayba-pipeline:stage-7-fixer          │
└─────────────────────────────────────────────────────────────────────┘

Orchestration instructions

You are the orchestrator. Your job is to:

Create the eval/ directory and eval/pipeline_log.md
Spawn sub-agents that invoke stage skills via the Skill tool
Coordinate stage ordering and handle the HITL gate

Setup

Create eval/ directory and initialize eval/pipeline_log.md:

# Pipeline Log

| Stage | Name | Status | Started | Completed | Notes |
|-------|------|--------|---------|-----------|-------|
| 1 | Kayba API Analysis | pending | | | |
| 2 | Domain Context | pending | | | |
| 3 | Metrics & Analysis | pending | | | |
| 4 | Rubric Definition | pending | | | |
| 5 | Action Plan | pending | | | |
| 6 | HITL Gate | pending | | | |
| 7 | Fix Implementation | pending | | | |

Stages 1 & 2 — run in parallel

Spawn two sub-agents in parallel using the Agent tool:

Agent 1:

Name: api-analyst
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-1-api-analysis" using the Skill tool. The traces folder is: {TRACES_FOLDER}. Follow the skill instructions completely.

Agent 2:

Name: domain-scout
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-2-domain-context" using the Skill tool. The traces folder is: {TRACES_FOLDER}. Follow the skill instructions completely.

Wait for both to complete before proceeding.

Stage 3 — sequential

Spawn one sub-agent after stages 1 & 2 complete:

Name: metric-engineer
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-3-metrics" using the Skill tool. The traces folder is: {TRACES_FOLDER}. Follow the skill instructions completely — this includes iterating on the metrics until you're satisfied.

Stage 4 — sequential

Spawn one sub-agent after stage 3 completes:

Name: rubric-builder
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-4-rubric" using the Skill tool. Follow the skill instructions completely.

Stage 5 — sequential

Spawn one sub-agent after stage 4 completes:

Name: action-planner
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-5-action-plan" using the Skill tool. Follow the skill instructions completely.

Stage 6 — HITL Gate

If HITL is true:

Spawn one sub-agent after stage 5 completes:

Name: hitl-reviewer
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-6-hitl" using the Skill tool. Follow the skill instructions completely. Present the full review to the user and collect their decision before proceeding.

Wait for the sub-agent to complete. Check eval/stage6_decision.md for the outcome:

If decision is "Approve all" or "Approve with modifications" — proceed to Stage 7
If decision is "Reject" — re-run Stage 5 with the user feedback recorded in eval/stage6_decision.md, then re-run Stage 6
Only proceed to Stage 7 after a clear approval is recorded

If HITL is false:

Skip to Stage 7
Log "HITL skipped" in eval/pipeline_log.md

Stage 7 — sequential

Spawn one sub-agent after stage 6 completes (or is skipped):

Name: fixer
Type: general-purpose
Prompt: Invoke the skill "kayba-pipeline:stage-7-fixer" using the Skill tool. Follow the skill instructions completely.

Error handling

If any stage fails, log the failure in eval/pipeline_log.md with the stage number and error
Do not proceed to dependent stages if a prerequisite failed
If Stage 1 fails (kayba CLI issues), ask the user whether to proceed without API insights — if yes, skip Stage 1 and have Stage 3 work from domain context + raw traces only

After completion

Update eval/pipeline_log.md with final status for all stages. Report to the user:

How many stages completed successfully
Summary of metrics (from rubric)
Summary of fixes applied (from changes log)

kayba-pipeline

kayba-pipeline

Inputs

Pipeline overview

Orchestration instructions

Setup

Stages 1 & 2 — run in parallel

Stage 3 — sequential

Stage 4 — sequential

Stage 5 — sequential

Stage 6 — HITL Gate

Stage 7 — sequential

Error handling

After completion

이 저장소의 다른 Skills

이 저장소의 다른 Skills

kayba-pipeline

Inputs

Pipeline overview

Orchestration instructions

Setup

Stages 1 & 2 — run in parallel

Stage 3 — sequential

Stage 4 — sequential

Stage 5 — sequential

Stage 6 — HITL Gate

Stage 7 — sequential

Error handling

After completion