with one click
Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).
npx skills add https://github.com/linuxhsj/openclaw-zero-token --skill nano-banana-proCopy and paste this command into Claude Code to install the skill
Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).
npx skills add https://github.com/linuxhsj/openclaw-zero-token --skill nano-banana-proCopy and paste this command into Claude Code to install the skill
| name | nano-banana-pro |
| description | Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro). |
| homepage | https://ai.google.dev/ |
| metadata | {"openclaw":{"emoji":"🍌","requires":{"bins":["uv"],"env":["GEMINI_API_KEY"]},"primaryEnv":"GEMINI_API_KEY","install":[{"id":"uv-brew","kind":"brew","formula":"uv","bins":["uv"],"label":"Install uv (brew)"}]}} |
Use the bundled script to generate or edit images.
Generate
uv run {baseDir}/scripts/generate_image.py --prompt "your image description" --filename "output.png" --resolution 1K
Edit (single image)
uv run {baseDir}/scripts/generate_image.py --prompt "edit instructions" --filename "output.png" -i "/path/in.png" --resolution 2K
Multi-image composition (up to 14 images)
uv run {baseDir}/scripts/generate_image.py --prompt "combine these into one scene" --filename "output.png" -i img1.png -i img2.png -i img3.png
API key
GEMINI_API_KEY env varskills."nano-banana-pro".apiKey / skills."nano-banana-pro".env.GEMINI_API_KEY in ~/.openclaw/openclaw.jsonSpecific aspect ratio (optional)
uv run {baseDir}/scripts/generate_image.py --prompt "portrait photo" --filename "output.png" --aspect-ratio 9:16
Notes
1K (default), 2K, 4K.1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9. Without --aspect-ratio / -a, the model picks freely - use this flag for avatars, profile pics, or consistent batch generation.yyyy-mm-dd-hh-mm-ss-name.png.MEDIA: line for OpenClaw to auto-attach on supported chat providers.Control a browser via Chrome DevTools Protocol (CDP) proxy. Use when: (1) navigating to URLs and reading page content, (2) taking screenshots, (3) executing JavaScript in the browser, (4) clicking elements or filling forms, (5) searching and installing Chrome Web Store extensions, (6) interacting with web APIs that require a real browser. NOT for: simple HTTP requests (use curl), local file operations, or when no CDP proxy is available.
Control Eight Sleep pods (status, temperature, alarms, schedules).
Gemini CLI for one-shot Q&A, summaries, and generation.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Local speech-to-text with the Whisper CLI (no API key).