Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

harness

Name: Harness
Author: zereight

// Pi-native evaluate→improve→persist harness. Provides a systematic loop for generating candidates, judging them against rubrics, and accumulating knowledge as playbooks. Use for iterative output improvement, not for one-off edits (use continuity) or single PR review (use zereight-review).

Ejecutar en Manus

$ git log --oneline --stat

stars:0

forks:0

updated:26 de mayo de 2026, 15:34

SKILL.md

readonly

name	harness
description	Pi-native evaluate→improve→persist harness. Provides a systematic loop for generating candidates, judging them against rubrics, and accumulating knowledge as playbooks. Use for iterative output improvement, not for one-off edits (use continuity) or single PR review (use zereight-review).
argument-hint	[solve\|knowledge\|status]

harness (Pi-native evaluation loop)

Builds on autoctx (judge/improve core) + Pi tools to provide a coherent harness.

Stack

Layer	Source	Role
Extension	`~/.pi/agent/extensions/harness/index.ts`	Tools + TUI + lifecycle
Judge engine	`autoctx` (via `pi-autocontext` dep)	LLM-based evaluation
Skill	`.agents/skills/harness/SKILL.md`	Usage guide + routing

Tools

Tool	Description
`harness_solve`	Full loop: generate → judge → improve → persist playbook
`harness_knowledge`	Read/write/list playbooks (`knowledge/<scenario>/playbook.md`)
`harness_status`	Recent runs, scores, and scenarios overview

Usage

# Solve a task with rubric-driven improvement
goal: "Improve zereight-review checklist for RN PRs"
rubric: "Actionable findings, Effect anti-patterns, security when relevant"
→ Best output + playbook saved to knowledge/<scenario>/

# Check previous runs
→ Recent runs with scores and scenarios

# Browse accumulated knowledge
action: list → all scenarios with playbooks
action: read, scenario: zereight-review → full playbook

When to use

Need	Use
Systematic output improvement	`harness_solve`
Review accumulated lessons	`harness_knowledge read`
Check past run history	`harness_status`
One-off judge/improve	`autocontext_judge` / `autocontext_improve` (pi-autocontext)

Cost

harness_solve consumes Pi provider quota per generation round.
Start with gens: 1 to calibrate rubric, then increase.
harness_knowledge and harness_status are zero-cost.

related-skills.json

mismo repositorio

autocontext.md

from "zereight/skill"

Run iterative agent evaluation and improvement loops (judge, improve, scenarios, playbooks) via pi-autocontext or autoctx CLI. Use when improving repeatable agent workflows (review rubrics, skill quality, scenario-based feedback), checking run status, or accumulating knowledge—not for one-off code edits or repo pattern drift (use continuity) or single PR review (use zereight-review).

2026-05-260

skill-cleaner.md

from "zereight/skill"

Audit agent skills — token cost, duplicates, outdated plugin versions, unused skills, and overly long descriptions. Use when trimming skill prompt budget, finding duplicate or unused skills, auditing plugin versions, or deciding which skills to remove.

2026-05-260

skill-not-showing.md

from "zereight/skill"

Diagnose why a skill is missing from Pi /skill or Cursor lists. Use when a skill exists on disk but does not appear, Pi shows only project or extension skills, or after adding a repo skill under .agents/skills/.

2026-05-260

zereight-review.md

from "zereight/skill"

Comprehensive code review skill for practical PR feedback. Use for feature, bugfix, and refactor reviews. Prioritizes correctness, edge cases, logic invariants, fallback-chain safety, async state transitions, architecture analysis, OWASP security, and clear actionable feedback.

2026-05-260

ljg-skill-map.md

from "zereight/skill"

Skill map and inventory workflow. Use for scanning installed skills, skill overview, 스킬 목록, skill map.

2026-05-260

session-lessons.md

from "zereight/skill"

Inspects Cursor agent transcript history, analyzes repeated assistant mistakes, and turns durable learnings into corrective skills or context updates. Use when the user asks to review previous Cursor chats, learn from a failed interaction, run /session-lessons, or stop the model repeating a mistake.

2026-05-260

package.json

"author": "zereight"

"repository": "zereight/skill"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Analistas de garantía de calidad de software y probadoresOcupaciones informáticas y matemáticas15-1253L4

name	harness
description	Pi-native evaluate→improve→persist harness. Provides a systematic loop for generating candidates, judging them against rubrics, and accumulating knowledge as playbooks. Use for iterative output improvement, not for one-off edits (use continuity) or single PR review (use zereight-review).
argument-hint	[solve\|knowledge\|status]

harness (Pi-native evaluation loop)

Builds on autoctx (judge/improve core) + Pi tools to provide a coherent harness.

Stack

Layer	Source	Role
Extension	`~/.pi/agent/extensions/harness/index.ts`	Tools + TUI + lifecycle
Judge engine	`autoctx` (via `pi-autocontext` dep)	LLM-based evaluation
Skill	`.agents/skills/harness/SKILL.md`	Usage guide + routing

Tools

Tool	Description
`harness_solve`	Full loop: generate → judge → improve → persist playbook
`harness_knowledge`	Read/write/list playbooks (`knowledge/<scenario>/playbook.md`)
`harness_status`	Recent runs, scores, and scenarios overview

Usage

# Solve a task with rubric-driven improvement
goal: "Improve zereight-review checklist for RN PRs"
rubric: "Actionable findings, Effect anti-patterns, security when relevant"
→ Best output + playbook saved to knowledge/<scenario>/

# Check previous runs
→ Recent runs with scores and scenarios

# Browse accumulated knowledge
action: list → all scenarios with playbooks
action: read, scenario: zereight-review → full playbook

When to use

Need	Use
Systematic output improvement	`harness_solve`
Review accumulated lessons	`harness_knowledge read`
Check past run history	`harness_status`
One-off judge/improve	`autocontext_judge` / `autocontext_improve` (pi-autocontext)

Cost

harness_solve consumes Pi provider quota per generation round.
Start with gens: 1 to calibrate rubric, then increase.
harness_knowledge and harness_status are zero-cost.

harness

harness (Pi-native evaluation loop)

Stack

Tools

Usage

When to use

Cost

Más de este repositorio

Más de este repositorio

harness (Pi-native evaluation loop)

Stack

Tools

Usage

When to use

Cost