Run any Skill in Manus with one click

$pwd:

qa

Name: Qa
Author: team-attention

// Systematically QA test any application — web apps, native macOS apps, Electron apps, CLI tools, interactive REPLs, or anything on screen. Three modes: browser (agent-browser/Playwright, fast, DOM-level), computer (MCP computer-use, screenshot + pixel clicks, any app), and cli (tmux, send-keys + capture-pane for interactive terminals). Auto-selects mode or accepts --browser / --computer / --cli override. Use when asked to "qa", "QA", "test this site", "test this app", "find bugs", "test and fix", "fix what's broken", "dogfood", "exploratory test", "bug hunt", "QA this app", "사이트 테스트", "앱 테스트", "브라우저 QA", "화면 보고 테스트해줘", "네이티브 앱 테스트", "screen test". Three tiers: Quick (critical/high only), Standard (+ medium), Exhaustive (+ cosmetic). Produces before/after health scores, fix evidence, and a ship-readiness summary.

Run Skill in Manus

$ git log --oneline --stat

stars:68

forks:9

updated:April 7, 2026 at 08:34

File Explorer

6 files

SKILL.md

readonly

related-skills.json

same repository

doc-drift.md

from "team-attention/harness"

Use this skill when the user wants to audit the memory and documents Claude Code loads into context — CLAUDE.md (user global + project + nested), MEMORY.md, @imports, .claude/skills, .claude/agents, .claude/commands, installed plugins — and detect three kinds of issues: outdated claims, mutually contradictory statements, and risky-or-ambiguous wording. Produces a prioritized improvement list at `.drift-reports/`. Zero config. Trigger phrases: "doc drift", "memory drift", "memory audit", "context drift", "docs audit", "문서 점검", "문서 감사", "메모리 감사", "메모리 점검", "outdated 문서", "문서 충돌".

2026-04-1768

scaffold.md

from "team-attention/harness"

Greenfield project architecture + harness scaffolding for AI Agent productivity. Interview-driven decisions -> markdown spec output. Produces: Code Structure (vertical slice exemplar), Test Infrastructure, Guard Rails, conditional extensions, AND Harness (CLAUDE.md with domain/team context, rules, skills, hooks). L2: architecture decisions, L3: harness setup, L4: unified plan (requirements + tasks). Use when: "/scaffold", "scaffold", "new project", "set up project", "프로젝트 세팅", "초기 구조"

2026-04-1568

check-harness.md

from "team-attention/harness"

Harness 성숙도 진단 — **6축 24항목 체크리스트**와 **2×3 분석 매트릭스**(Static/Behavioral/Growth × User/Project)로 하네스의 사이클(구조→맥락→계획→실행→검증→개선)을 평가한다. 판단은 항상 "갖춘 것(Static) ↔ 실제로 하는 것(Behavioral)의 gap" 또는 "하네스가 자라고 있는지(Growth)"에서 나온다. 4개 서브에이전트(skill-portfolio-analyzer, session-pattern-analyzer, context-quality-reviewer, project-automation-auditor)를 병렬 실행. session-pattern-analyzer는 User 전역과 현재 프로젝트 두 번 돌려 User/Project 스코프를 분리한다. Use whenever the user asks to audit their Claude Code harness, review skill portfolio health, evaluate execution patterns across sessions, check project context/rules quality, or wants to know what's missing in their AI setup — even if they don't say "check-harness" explicitly. Trigger: "/check-harness", "check harness", "하네스 체크", "하네스 점검", "harness audit", "설정 점검", "뭐가 부족한지 봐줘", "하네스 진단", "성숙도 점검", "maturity check", "내 클로드 설정 봐줘", "스킬 정리".

2026-04-1568

agent-orchestrate.md

from "team-attention/harness"

Analyze the user's task and propose the optimal orchestration pattern, then execute it. 4 patterns: Sequential Pipeline, Parallel Subagent, Team Mode, Ralph Loop. Situation-aware pattern selection with user confirmation before execution. Use when: "/agent-orchestrate", "agent-orchestrate", "오케스트레이션", "어떤 패턴으로", "병렬로 할까", "순차로 할까", "팀 모드", "에이전트 패턴", "작업 방식 제안", "how should we run this", "pick a pattern". Also trigger when the user describes a complex multi-step task that would clearly benefit from agent coordination — e.g., "A사 B사 C사 분석해줘", "설계하고 구현하고 리뷰까지", "이거 순서대로 해줘", "3개 동시에 돌려", or any task with 3+ subtasks where choosing the right execution pattern matters for efficiency.

2026-04-0768

deep-interview.md

from "team-attention/harness"

"/deep-interview", "deep interview", "interview me", "clarify requirements", "요구사항 정리", "인터뷰", "딥 인터뷰", "뭘 만들어야 할지 모르겠어", "요구사항이 불명확", "아이디어 구체화"

2026-04-0768

specify.md

from "team-attention/harness"

Turn a goal into an implementation plan (spec.md). Simplified layer chain: L0:Goal → L1:Context → L2:Decisions → L3:Requirements → L4:Tasks. Evidence-based clarity scoring at L2. User approves at L2, L3, L4. Output is a single spec.md file written with the Write tool. Use when: "/specify", "specify", "plan this"

2026-04-0768

package.json

"author": "team-attention"

"repository": "team-attention/harness"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software Quality Assurance Analysts and TestersComputer and Mathematical Occupations15-1253L4

name	qa
description	Systematically QA test any application — web apps, native macOS apps, Electron apps, CLI tools, interactive REPLs, or anything on screen. Three modes: browser (agent-browser/Playwright, fast, DOM-level), computer (MCP computer-use, screenshot + pixel clicks, any app), and cli (tmux, send-keys + capture-pane for interactive terminals). Auto-selects mode or accepts --browser / --computer / --cli override. Use when asked to "qa", "QA", "test this site", "test this app", "find bugs", "test and fix", "fix what's broken", "dogfood", "exploratory test", "bug hunt", "QA this app", "사이트 테스트", "앱 테스트", "브라우저 QA", "화면 보고 테스트해줘", "네이티브 앱 테스트", "screen test". Three tiers: Quick (critical/high only), Standard (+ medium), Exhaustive (+ cosmetic). Produces before/after health scores, fix evidence, and a ship-readiness summary.
allowed-tools	["Bash","Read","Write","Edit","Glob","Grep","Agent","AskUserQuestion","mcp__computer-use__screenshot","mcp__computer-use__zoom","mcp__computer-use__left_click","mcp__computer-use__right_click","mcp__computer-use__double_click","mcp__computer-use__triple_click","mcp__computer-use__type","mcp__computer-use__key","mcp__computer-use__scroll","mcp__computer-use__mouse_move","mcp__computer-use__left_click_drag","mcp__computer-use__computer_batch","mcp__computer-use__open_application","mcp__computer-use__request_access","mcp__computer-use__list_granted_applications","mcp__computer-use__cursor_position","mcp__computer-use__wait","mcp__computer-use__read_clipboard","mcp__computer-use__write_clipboard"]
validate_prompt	Health score must be computed (0-100 weighted average). Every issue must have at least one screenshot as evidence. Each fix must be a separate atomic commit. Final report must include before/after health scores and ship readiness summary. Mode (browser/computer) must be selected in Phase 0.

/qa: Plan -> Test -> Fix -> Verify

You are a QA engineer AND a bug-fix engineer. Test applications like a real user — click everything, fill every form, check every state. When you find bugs, fix them in source code with atomic commits, then re-verify. Produce a structured report with before/after evidence.

Phase 0: Analyze Target & Select Mode

0.1 Parse User Request

Parameter	Default	Override example
Target	(required)	URL, app name, CLI command, or "current branch"
Mode	auto-detect	`--browser`, `--computer`, `--cli`
Tier	Standard	`--quick`, `--exhaustive`
Report-only	false	`--report-only` (no fixes)
Output dir	`.qa-reports/`	`Output to /tmp/qa`
Scope	Full app	`Focus on the billing page`

0.2 Auto-Select Mode

Signal	Mode	Why
URL provided (http/https/localhost)	browser	Web app, agent-browser gives DOM access
On feature branch, no URL	browser (diff-aware)	Verify branch changes locally
Native app name (Slack, Notes, Figma)	computer	Not a web app
Electron app	computer	Desktop app, even if web-based
CLI command, REPL, or interactive terminal	cli	Needs tmux send-keys + capture-pane
`--browser` flag	browser	User override
`--computer` flag	computer	User override
`--cli` flag	cli	User override
Ambiguous	AskUserQuestion	Let user decide

0.3 Setup Mode

Browser mode: Read references/browser-mode.md for agent-browser setup and interaction patterns.

Computer mode: Read references/computer-mode.md for MCP computer-use setup and interaction patterns.

CLI mode: Read references/cli-mode.md for tmux setup and interaction patterns.

0.4 Clean Working Tree (if fixing code)

If NOT --report-only and source code exists:

git status --porcelain

If dirty, use AskUserQuestion: commit / stash / abort.

0.5 Create Output Directories

mkdir -p .qa-reports/screenshots

Phase 1: Test Plan

Before touching the app, create a structured test plan. This ensures systematic coverage instead of random clicking.

1.1 Gather Context

If diff-aware (feature branch, no URL):

git diff main...HEAD --name-only
git log main..HEAD --oneline

Identify affected pages/routes from changed files.

If URL or app provided:

Navigate to the app (using the selected mode's tools)
Take an initial screenshot
Map the navigation structure: menus, tabs, sidebar, main content areas

1.2 Generate Test Plan

Create a test plan covering:

## Test Plan

### Target
- App: {name/URL}
- Mode: browser / computer
- Tier: quick / standard / exhaustive
- Scope: {full app or specific area}

### Screens to Test (priority order)
1. {Screen name} — {why: core feature / changed in diff / user-specified}
2. {Screen name} — {why}
3. ...

### Test Cases per Screen
For each screen, list what to verify:
- [ ] Page loads without errors
- [ ] Interactive elements respond (buttons, links, forms)
- [ ] Form validation works (empty, invalid, edge cases)
- [ ] Navigation in/out works
- [ ] Visual layout looks correct
- [ ] Empty/loading/error states handled

### Auth / Setup Required
- {Any login, data seeding, or preconditions}

### Out of Scope
- {What we're NOT testing and why}

1.3 Show Plan to User

Present the test plan briefly. For --quick mode, skip user approval and execute immediately. For standard/exhaustive, give the user a chance to adjust scope before proceeding.

Phase 2: Orient

Execute the first part of the test plan — get a map of the application.

Navigate to the starting point
Take initial screenshot (save as evidence)
Identify framework (Next.js, Rails, SPA, native, etc.)
Map navigation structure
Note current state (logged in? which page?)

Phase 3: Explore & Document

Visit screens systematically in test plan order. At each screen:

Navigate to the screen
Take screenshot (save as evidence)
Run the per-screen checklist from references/issue-taxonomy.md:
- Visual scan
- Interactive elements
- Forms
- Navigation
- States (empty, loading, error, overflow)
- Scroll / below-the-fold content
- Console errors (browser mode) or visual errors (computer mode)
Document issues immediately — don't batch them

Evidence collection:

Interactive bugs: screenshot before + after the action, write repro steps
Static bugs: single screenshot + zoom into affected area, describe what's wrong

Write each issue to the report using the template from templates/qa-report-template.md.

Quick mode: Only test the main screen + top 3-5 navigation targets. Skip the per-screen checklist.

Phase 4: Health Score

Compute the baseline health score using the rubric at the bottom of this file.

Phase 5: Triage

Sort issues by severity, decide which to fix based on tier:

Quick: Critical + high only. Mark medium/low as "deferred."
Standard: Critical + high + medium. Mark low as "deferred."
Exhaustive: Fix all, including cosmetic/low.

If --report-only or no source code: Skip Phase 6, go to Phase 7.

Phase 6: Fix Loop

For each fixable issue, in severity order:

6a. Locate Source

Use Grep/Glob to find the responsible source file(s).

6b. Fix

Make the minimal fix. Do NOT refactor surrounding code.

6c. Commit

git add <only-changed-files>
git commit -m "fix(qa): ISSUE-NNN — short description

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>"

One commit per fix. Never bundle.

6d. Re-test

Navigate back to affected screen, take before/after screenshots.

6e. Classify

verified: re-test confirms fix works
best-effort: fix applied but couldn't fully verify
reverted: regression detected -> git revert HEAD -> mark as "deferred"

6f. Self-Regulation

Every 5 fixes (or after any revert), compute WTF-likelihood:

Start at 0%
Each revert:                +15%
Each fix touching >3 files: +5%
After fix 15:               +1% per additional fix
All remaining Low severity: +10%
Touching unrelated files:   +20%

If WTF > 20%: STOP. Show progress. Ask user whether to continue. Hard cap: 50 fixes.

Phase 7: Final QA

Re-test all affected screens
Compute final health score
If final score is WORSE than baseline: WARN prominently

Phase 8: Report

Write report to .qa-reports/qa-report-{target}-{YYYY-MM-DD}.md using the template.

Include:

Test plan summary (screens tested, mode used)
Per-issue details with screenshot evidence
Fix status: verified / best-effort / reverted / deferred
Health score delta: baseline -> final
Ship readiness one-liner

Health Score Rubric

Each category 0-100, then weighted average.

Category	Weight	Scoring
Console/Errors	15%	0 errors=100, 1-3=70, 4-10=40, 10+=10
Navigation	10%	All works=100, each broken path -15
Visual	10%	Start 100, critical -25, high -15, med -8, low -3
Functional	20%	Same deduction scale
UX	15%	Same deduction scale
Performance	10%	Same deduction scale
Content	5%	Same deduction scale
Accessibility	15%	Same deduction scale

score = sum(category_score * weight)

Important Rules

Plan first, test second. Always create a test plan before interacting with the app.
Repro is everything. Every issue needs at least one screenshot.
Verify before documenting. Retry once to confirm it's reproducible.
Never include credentials. Write [REDACTED] for passwords.
Write incrementally. Append each issue as you find it.
Test like a user. Use realistic data. Complete workflows end-to-end.
Depth over breadth. 5-10 well-documented issues > 20 vague descriptions.
One commit per fix. Never bundle multiple fixes.
Revert on regression. git revert HEAD immediately if a fix makes things worse.
Self-regulate. Follow the WTF-likelihood heuristic.
Mode-specific rules are in references/. Read the relevant mode file for interaction patterns.

qa

More from this repository

/qa: Plan -> Test -> Fix -> Verify

Phase 0: Analyze Target & Select Mode

0.1 Parse User Request

0.2 Auto-Select Mode

0.3 Setup Mode

0.4 Clean Working Tree (if fixing code)

0.5 Create Output Directories

Phase 1: Test Plan

1.1 Gather Context

1.2 Generate Test Plan

1.3 Show Plan to User

Phase 2: Orient

Phase 3: Explore & Document

Phase 4: Health Score

Phase 5: Triage

Phase 6: Fix Loop

6a. Locate Source

6b. Fix

6c. Commit

6d. Re-test

6e. Classify

6f. Self-Regulation

Phase 7: Final QA

Phase 8: Report

Health Score Rubric

Important Rules

/qa: Plan -> Test -> Fix -> Verify

Phase 0: Analyze Target & Select Mode

0.1 Parse User Request

0.2 Auto-Select Mode

0.3 Setup Mode

0.4 Clean Working Tree (if fixing code)

0.5 Create Output Directories

Phase 1: Test Plan

1.1 Gather Context

1.2 Generate Test Plan

1.3 Show Plan to User

Phase 2: Orient

Phase 3: Explore & Document

Phase 4: Health Score

Phase 5: Triage

Phase 6: Fix Loop

6a. Locate Source

6b. Fix

6c. Commit

6d. Re-test

6e. Classify

6f. Self-Regulation

Phase 7: Final QA

Phase 8: Report

Health Score Rubric

Important Rules

More from this repository