mit einem Klick
desktop-control
// Unified desktop + browser automation. Routes DOM targets to CDP (cli-jaw browser), desktop apps to Computer Use, hybrid combos to both. Codex desktop/CLI + macOS required for Computer Use.
// Unified desktop + browser automation. Routes DOM targets to CDP (cli-jaw browser), desktop apps to Computer Use, hybrid combos to both. Codex desktop/CLI + macOS required for Computer Use.
| name | desktop-control |
| description | Unified desktop + browser automation. Routes DOM targets to CDP (cli-jaw browser), desktop apps to Computer Use, hybrid combos to both. Codex desktop/CLI + macOS required for Computer Use. |
| metadata | {"openclaw":{"emoji":"🖥️","requires":{"bins":["cli-jaw"],"system":["macOS","Google Chrome"]},"install":[{"id":"brew-cliclick","kind":"brew","formula":"cliclick","bins":["cliclick"],"label":"Install cliclick (optional — pointer-action fallback)"}]}} |
Unified skill for all UI automation. Chooses between CDP and Computer Use based on the target, and reports meaningful actions with a path= + action_class= transcript.
This skill is already injected into your system prompt. Do not run
sed,cat,head, orReadto load it from disk. Guessing absolute paths like/Users/*/.codex/skills/...or/Users/*/.cli-jaw-*/skills/...wastes a turn and often targets a file that doesn't exist. If you need a specific reference file (e.g.,reference/computer-use.md), usecli-jaw skill read desktop-control <ref-name>.
Trigger on any request that touches a visible UI:
$computer-use or /computer-use → skip routing analysis, jump straight to reference/computer-use.md. Explicit user opt-in. If Computer Use tools are not available, stop with precondition failed: computer-use unavailable.reference/cdp.mdreference/computer-use.mdreference/vision-click.mdreference/intent-routing.md FIRSTreference/control-workflow.mdpath=cdp, path=computer-use, or path=cdp+cu.get_app_state(app) before interacting with that app. Re-call it on stale warnings, after actions that change UI state, and whenever confidence drops.action_class. Classes: state-read, element-action, value-injection, keyboard-action, pointer-action, pointer-action+vision, scroll-action, drag-action, secondary-action.get_app_state(app) (Computer Use) or cli-jaw browser snapshot (CDP). Never chain actions through uncertainty. Guessing indices or URLs leads to infinite correction loops. If two consecutive actions produced ambiguous state, the next call must be a state-read, not another action.list_apps, get_app_state, click, drag, press_key, scroll, select_text, set_value, type_text, and perform_secondary_action.list_apps if the app is unknown./Applications/Jaw.app and /Applications/Codex Computer Use.app may be required for TCC attribution. Missing app bundles are a setup issue, not a reason to silently switch paths.CDP action:
path=cdp
url=https://example.com
action=click e3
result=ok
Computer Use action:
path=computer-use
app=Google Chrome
action_class=element-action
action=click(element_index=730)
stale_warning=no
result=ok
Hybrid (lookup via CDP, action via Computer Use):
path=cdp+cu
lookup=cli-jaw browser snapshot → bbox of "Play"
action_class=pointer-action
action=click(x=812, y=514)
result=ok
browser — CDP command reference (this skill supersedes its coverage).screen-capture — generic macOS screenshot / webcam / video recording (unchanged).vision-click — no longer auto-active. Absorbed as a tactic in reference/vision-click.md. If you need the low-level recipe (NDJSON parsing, DPR correction), run cli-jaw skill install vision-click.| Symptom | Correct report |
|---|---|
| "I don't see a cursor" | cursor overlay is best-effort in the current build — action=click(...) succeeded; visible cursor not guaranteed |
| CDP server not running | precondition failed: cli-jaw serve not running. Start with 'jaw serve' and retry. |
| Computer Use tools missing | precondition failed: computer-use unavailable |
| cli-jaw CU app missing in packaged install | precondition failed: /Applications/Codex Computer Use.app missing. Recover: jaw doctor --tcc --fix |
| Stale warning on action | re-call get_app_state(app) then retry; log stale_warning=yes in the transcript |
| Non-GUI task routed here | needs boss follow-up: not GUI automation |
Backend engineering guide for orchestrated sub-agents. Framework-agnostic API design, clean architecture, database optimization, security hardening, systematic debugging. Modular: SKILL.md orchestrator + references/ for deep guidance. Injected when role=backend.
Code review guide for all orchestrated sub-agents. Review process, quality thresholds, antipattern detection, giving/receiving feedback. Available to any agent regardless of role — read this SKILL.md when performing or receiving code reviews.
Data engineering and analysis guide for orchestrated sub-agents. Data pipelines, ETL/ELT design, data quality validation, SQL optimization, and analysis patterns. Injected when role=data.
Systematic debugging methodology for all orchestrated sub-agents. 4-phase root cause analysis: investigate → analyze → hypothesize → implement. Injected when encountering errors or during debugging phase.
Production-grade frontend with distinctive aesthetics. Detects stack and applies specialized rules. Modular: SKILL.md orchestrator + references/ for deep guidance. Injected when role=frontend.
PABCD orchestration workflow. Structured 5-phase development with user checkpoints. Injected during orchestration mode.