mit einem Klick
nanobanana-skill
// Generate or edit images using Google Gemini API via nanobanana. Triggers: "nanobanana", "generate image", "create image", "edit image", "AI drawing", "图片生成", "AI绘图", "图片编辑", "生成图片".
// Generate or edit images using Google Gemini API via nanobanana. Triggers: "nanobanana", "generate image", "create image", "edit image", "AI drawing", "图片生成", "AI绘图", "图片编辑", "生成图片".
Execute long-running, multi-session tasks autonomously using Claude Code headless mode or in-session hook-based loops. Supports structured task decomposition (for complex projects) and lightweight Ralph-style iteration (for TDD, bug fixing, refactoring). Use this skill whenever the user says "autonomous", "long-running task", "multi-session", "run this in the background", "keep working on this", "batch process", "iterate until done", "ralph loop", or wants any task that requires sustained, unattended execution.
Generate or edit images using OpenAI GPT Image API (gpt-image-2, gpt-image-1, etc). Triggers: "gpt image", "openai image", "generate image with openai", "draw image", "create image", "image generation", "AI drawing", "图片生成", "AI绘图", "生成图片", "画图". Use this skill whenever the user wants to generate or edit images and mentions OpenAI, GPT, or when OPENAI_API_KEY is available.
Create, refine, and benchmark agent skills. Use when building a new skill, updating an existing one, running evals, checking trigger quality, or improving a skill description.
Leverage OpenAI Codex/GPT models for autonomous code implementation. Triggers: "codex", "use gpt", "gpt-5", "let openai", "full-auto", "用codex", "让gpt实现". Use this skill whenever the user wants to delegate coding tasks to OpenAI models, run code reviews via codex, or execute tasks in a sandboxed environment.
Extract subtitles/transcripts from YouTube videos. Triggers: "youtube transcript", "extract subtitles", "video captions", "视频字幕", "字幕提取", "YouTube转文字", "提取字幕".
Create Claude Code custom slash commands with proper structure, frontmatter, and best practices. Use this skill whenever the user wants to create a new command, add a slash command, build a custom command, or mentions "create-command", "new command", "add command", or "make a command" for Claude Code. Also trigger when the user wants to turn a workflow into a reusable command.
| name | nanobanana-skill |
| description | Generate or edit images using Google Gemini API via nanobanana. Triggers: "nanobanana", "generate image", "create image", "edit image", "AI drawing", "图片生成", "AI绘图", "图片编辑", "生成图片". |
| allowed-tools | Read, Write, Glob, Grep, Task, Bash(cat:*), Bash(ls:*), Bash(tree:*), Bash(python3:*) |
Generate or edit images using Google Gemini API through the nanobanana tool.
~/.nanobanana.env or export GEMINI_API_KEY=<your-api-key>python3 -m pip install -r ./requirements.txt if not installed yet../nanobanana.pyAsk the user for:
Run the nanobanana script with appropriate parameters:
python3 ./nanobanana.py --prompt "description of image" --output "filename.png"
Show the user the saved image path when complete
Ask the user for:
Run with input images:
python3 ./nanobanana.py --prompt "editing instructions" --input image1.png image2.png --output "edited.png"
1024x1024 (1:1) - Square832x1248 (2:3) - Portrait1248x832 (3:2) - Landscape864x1184 (3:4) - Portrait1184x864 (4:3) - Landscape896x1152 (4:5) - Portrait1152x896 (5:4) - Landscape768x1344 (9:16) - Portrait (default)1344x768 (16:9) - Landscape1536x672 (21:9) - Ultra-widegemini-3.1-flash-image-preview (default) - Latest, fast generationgemini-3-pro-image-preview - Higher quality, supports thinking/reasoning1K (default)2K4K--no-search - Disable Google Search grounding (enabled by default)--no-think - Disable thinking/reasoning modepython3 ./nanobanana.py --prompt "A serene mountain landscape at sunset with a lake"
python3 ./nanobanana.py \
--prompt "Modern minimalist logo for a tech startup" \
--size 1024x1024 \
--output "logo.png"
python3 ./nanobanana.py \
--prompt "Futuristic cityscape with flying cars" \
--size 1344x768 \
--resolution 2K \
--output "cityscape.png"
python3 ./nanobanana.py \
--prompt "Add a rainbow in the sky" \
--input photo.png \
--output "photo-with-rainbow.png"
python3 ./nanobanana.py \
--prompt "Detailed portrait of a cat in watercolor style" \
--model gemini-3-pro-image-preview \
--output "cat-portrait.png"
If the script fails:
GEMINI_API_KEY is exported or set in ~/.nanobanana.env