원클릭으로
genskill
// Generate a reusable skill from the current conversation history using a GEPA-style multi-candidate loop with LLM-as-judge Pareto selection.
// Generate a reusable skill from the current conversation history using a GEPA-style multi-candidate loop with LLM-as-judge Pareto selection.
Consolidate durable project memory from the current conversation and identify traces that may deserve genskill.
Configure Google Calendar Browser with /connect and use Calendar ConnectorAct actions through the global Puffer browser profile.
Configure Gmail Browser with /connect and use Gmail ConnectorAct actions through the global Puffer browser profile.
Recipe for rendering a code review as a Canvas — compose findings with evidence and actions into a scannable page instead of a long text answer.
Guide for implementing or updating a Puffer connector. Use when Codex needs to add a new connector template, subscriber stream, internal connector tool, ConnectorAct action, /connect auth flow, workflow subscription support, or a puffer-connector-* Rust crate in the Puffer repo.
Guide for developing Puffer's computer-use integration and vendored CUA driver. Use when Codex works on vendor/cua-driver, scripts/build-cua-driver.sh, scripts/cua-computer-sandbox.sh, docker/cua-driver-native, CUA MCP manifests, screenshots/image tool-result handling, or specs/puffer-core/118.md.
| name | genskill |
| description | Generate a reusable skill from the current conversation history using a GEPA-style multi-candidate loop with LLM-as-judge Pareto selection. |
| disable-model-invocation | false |
Generate a reusable SKILL.md from the conversation transcript so far.
Treat the transcript as evidence of a non-trivial task. Extract:
Output ONLY a SKILL.md document with YAML frontmatter (name, description) followed by sections: Overview, When to Use, Topic Sections, Common Pitfalls, Verification Checklist. Stay under 15000 bytes.
Prefer skills that are conditional and verifier-first. A good skill tells the future agent which task shapes it applies to, what to inspect before acting, which exact verification signal matters, and how to avoid repeated exploration. Do not generate broad productivity advice or a recipe that would make a future agent ignore the current task prompt, required filenames, schemas, or tests. Prefer a narrow domain skill over a catch-all benchmark skill. If the trace contains several unrelated task families, split mentally and generate the single most reusable workflow with the clearest trigger; do not merge logs, regex, git, certificates, sqlite, service setup, scheduling, and code repair into one broad skill. If the available workflow is only a lightweight artifact contract that the next task prompt will already state exactly, do not turn it into a skill. That knowledge should remain project memory unless it captures a non-obvious multi-step method, repeated recovery pattern, or domain-specific verification procedure that is easy to forget.
When the trace is from an end-to-end benchmark or task runner, preserve verifier-negative lessons too. A generated skill should prevent "looks done" failures: writing a setup script when the verifier expects a running service, passing source tests while writing the wrong report schema/CWE labels, giving up after a denied shell command when an allowed Write/Edit path can create the artifact, or satisfying only one side of a polyglot/bidirectional task.