Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

imagegen-nano-banana

Étoiles0

Forks0

Mis à jour1 mai 2026 à 06:22

Generate images via Google Gemini Nano Banana Pro (gemini-3-pro-image-preview), the SOTA Google image model as of 2026. Use when the user wants to create, generate, render, or produce an image with Google / Gemini (e.g. "make me an image with gemini", "use nano banana", "nano banana pro", "google image gen", "imagen alternative"). Wraps the v1beta generateContent REST API in a small bash script. Supports up to 14 reference images for blending and 4K output.

Installation

Installer avec Codex ou Claude Copiez ce prompt, collez-le dans Codex, Claude ou un autre assistant, puis laissez-le vérifier la page du skill et l'installer pour vous.

Exécuter dans Manus

Source

oysteinkrog

oysteinkrog/dotfiles

Ouvrir le dépôt GitHub Voir les dépôts du créateur

Téléchargement

Exécuter dans Manus

Métiers associésSOC

Basé sur la classification professionnelle SOC

Développeurs de logicielsProfessions informatiques et mathématiques·SOC 15-1252

Explorateur de fichiers

2 fichiers

SKILL.md

readonly

name	imagegen-nano-banana
description	Generate images via Google Gemini Nano Banana Pro (gemini-3-pro-image-preview), the SOTA Google image model as of 2026. Use when the user wants to create, generate, render, or produce an image with Google / Gemini (e.g. "make me an image with gemini", "use nano banana", "nano banana pro", "google image gen", "imagen alternative"). Wraps the v1beta generateContent REST API in a small bash script. Supports up to 14 reference images for blending and 4K output.
allowed-tools	["Bash","Read"]

imagegen-nano-banana — Gemini Nano Banana Pro from the CLI

Thin shell wrapper around POST generativelanguage.googleapis.com/v1beta/models/<model>:generateContent. Defaults to gemini-3-pro-image-preview (Nano Banana Pro). Decodes the inline base64 image straight to a file.

Tool

~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana

Run with --help for the full flag list. Common invocations:

# Default: 1:1, 2K, gemini-3-pro-image-preview
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana \
  "a cat reading a newspaper"

# Wide 4K poster
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana \
  -a 16:9 -s 4K -o /tmp/sunrise.png \
  "panoramic alpine sunrise, dramatic clouds, photorealistic"

# Blend reference images
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana \
  -i ref1.png -i ref2.png -i ref3.png \
  "compose these into a single cohesive product shot, studio lighting"

# Switch to faster Nano Banana 2 (Flash variant)
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana \
  -m gemini-3.1-flash-image-preview \
  "a quick concept sketch of a cyberpunk corgi"

# Use Google Search grounding for up-to-date data
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana --grounding \
  "infographic of today's NASDAQ close, clean editorial style"

API key

Reads GEMINI_API_KEY in this order:

Environment variable.
$IMAGEGEN_NANO_BANANA_KEYS_FILE if set, else ~/.config/imagegen-nano-banana/keys.env — sourced as shell KEY=value lines.

Set up the file the first time:

mkdir -p ~/.config/imagegen-nano-banana && chmod 700 ~/.config/imagegen-nano-banana
printf 'GEMINI_API_KEY=...\n' > ~/.config/imagegen-nano-banana/keys.env
chmod 600 ~/.config/imagegen-nano-banana/keys.env

Get a key from https://aistudio.google.com/apikey. The script exits with a clear error if the key is missing.

Flags

Flag	Default	Notes
`-p, --prompt`	—	or pass as positional
`-o, --out`	`./imagegen-nano-banana-<ts>.<ext>`	extension follows response mime
`-m, --model`	`gemini-3-pro-image-preview`	see model table below
`-a, --aspect`	`1:1`	`16:9`, `9:16`, `4:3`, `3:4`, `21:9`, `2:3`, `3:2`
`-s, --size`	`2K`	`1K` / `2K` / `4K` (Pro supports all)
`-i, --image`	—	reference image path; repeatable
`--grounding`	off	enable Google Search grounding (Pro only)
`--thinking`	—	`low` / `medium` / `high` reasoning depth

Models

Model ID	Codename	Use for
`gemini-3-pro-image-preview`	Nano Banana Pro	Highest fidelity, 4K, complex layouts, text-in-image
`gemini-3.1-flash-image-preview`	Nano Banana 2	Fast everyday generation
`gemini-2.5-flash-image`	Nano Banana	Original; fastest, cheapest

How to use this skill

Pick a prompt. If the user wasn't specific, ask one short clarifying question about subject + style + aspect ratio rather than guessing.
Run the script via Bash. Pro at 2K typically returns in 8–12s.
The script prints the output path on success — show the image to the user (Read it so it renders inline, or mention the path).
For iteration, tweak the prompt (or add -i previous.png as a reference).

Notes

All Google-generated images carry the imperceptible SynthID watermark. Mention this if the user is producing assets where AI provenance matters.
Multilingual text-in-image is a strength — call it out for posters/ads.
--grounding adds a fixed surcharge per call but lets the model use real-time web context (e.g. "today's weather map", "current charts"). Pro only.
Reference-image blend works up to ~14 inputs; the model treats them as visual context for the prompt, not as a starting canvas to edit.
The default model often returns JPEG (not PNG). The script picks the file extension from the response mime type when no -o is supplied; if you do pass -o foo.png, the file will be written to that exact path even if the bytes are JPEG (most viewers handle the mismatch).
If the response comes back text-only (e.g. safety filter), the script surfaces the model's text reply in the error message.

Plus depuis ce dépôt

même dépôt

consult-oracles

oysteinkrog/dotfiles

Consult Fable (primary oracle) for expert second opinions; escalate to GPT-5.5-Pro only for extremely important or complex tasks (always paired with Fable). Use for complex decisions, architecture choices, debugging hard problems, or when user says "consult oracles", "ask the experts", or wants a second opinion.

2026-06-100

oracle-review

oysteinkrog/dotfiles

Run iterative oracle + agent hardening loop on any artifact (designs, plans, beads, architecture) until findings converge to near-zero. Combines /swarm-oracle with /swarm-review in alternating rounds. Use for the full hardening cycle, not just a single oracle pass. For oracle-only, use /swarm-oracle. For bead-only hardening, use /swarm-beads-quality.

2026-06-100

oracle-consensus

oysteinkrog/dotfiles

Run 2x oracle sessions (FOR + AGAINST stances) to validate design decisions, plans, or bead readiness. Default = two Fable subagents; escalate to PAL 2x GPT-Pro (always paired with Fable) for extremely important or complex validations. Use after design rounds, before implementation, or to challenge architecture decisions.

2026-06-100

sync-human

oysteinkrog/dotfiles

Act as a wise, effective teacher whose goal is to make the human deeply understand the work done in this session (a change, a bug fix, a feature, a design) — i.e. sync the human's mental model up to the agent's. Use when the user says "sync-human", "sync me up", "teach me this session", "make sure I understand", "walk me through what we did", "quiz me on this", or "I want to actually understand this PR/change", or otherwise wants Socratic, gated, incremental teaching with comprehension checks rather than a one-shot summary. Drives understanding at both high level (motivation, impact) and low level (business logic, edge cases) using a running checklist and quizzes.

2026-06-050

agent-mail

oysteinkrog/dotfiles

MCP Agent Mail for multi-agent coordination. Use when agents need file locks, messaging, inboxes, or conflict prevention. Handles macro_start_session, file_reservation_paths, send_message, threading, pre-commit guards.

2026-05-290

secret-lookup

oysteinkrog/dotfiles

Retrieve API tokens, keys, and credentials Oystein has stored locally. Use whenever code, scripts, or shell commands need a secret value: GitHub tokens, Cloudflare, HubSpot, Slack, Zendesk, Jira, Sentry, Anthropic, Apify, Browserbase, Google OAuth, Huma. Use BEFORE searching shell history, session logs, dotfiles, or the filesystem — the canonical store is documented here and the values are reachable via two fish helpers. Also use when adding, rotating, or removing a credential.

2026-05-110

name	imagegen-nano-banana
description	Generate images via Google Gemini Nano Banana Pro (gemini-3-pro-image-preview), the SOTA Google image model as of 2026. Use when the user wants to create, generate, render, or produce an image with Google / Gemini (e.g. "make me an image with gemini", "use nano banana", "nano banana pro", "google image gen", "imagen alternative"). Wraps the v1beta generateContent REST API in a small bash script. Supports up to 14 reference images for blending and 4K output.
allowed-tools	["Bash","Read"]

imagegen-nano-banana — Gemini Nano Banana Pro from the CLI

Tool

~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana

Run with --help for the full flag list. Common invocations:

# Default: 1:1, 2K, gemini-3-pro-image-preview
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana \
  "a cat reading a newspaper"

# Wide 4K poster
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana \
  -a 16:9 -s 4K -o /tmp/sunrise.png \
  "panoramic alpine sunrise, dramatic clouds, photorealistic"

# Blend reference images
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana \
  -i ref1.png -i ref2.png -i ref3.png \
  "compose these into a single cohesive product shot, studio lighting"

# Switch to faster Nano Banana 2 (Flash variant)
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana \
  -m gemini-3.1-flash-image-preview \
  "a quick concept sketch of a cyberpunk corgi"

# Use Google Search grounding for up-to-date data
~/.claude/skills/imagegen-nano-banana/bin/imagegen-nano-banana --grounding \
  "infographic of today's NASDAQ close, clean editorial style"

API key

Reads GEMINI_API_KEY in this order:

Environment variable.
$IMAGEGEN_NANO_BANANA_KEYS_FILE if set, else ~/.config/imagegen-nano-banana/keys.env — sourced as shell KEY=value lines.

Set up the file the first time:

mkdir -p ~/.config/imagegen-nano-banana && chmod 700 ~/.config/imagegen-nano-banana
printf 'GEMINI_API_KEY=...\n' > ~/.config/imagegen-nano-banana/keys.env
chmod 600 ~/.config/imagegen-nano-banana/keys.env

Get a key from https://aistudio.google.com/apikey. The script exits with a clear error if the key is missing.

Flags

Flag	Default	Notes
`-p, --prompt`	—	or pass as positional
`-o, --out`	`./imagegen-nano-banana-<ts>.<ext>`	extension follows response mime
`-m, --model`	`gemini-3-pro-image-preview`	see model table below
`-a, --aspect`	`1:1`	`16:9`, `9:16`, `4:3`, `3:4`, `21:9`, `2:3`, `3:2`
`-s, --size`	`2K`	`1K` / `2K` / `4K` (Pro supports all)
`-i, --image`	—	reference image path; repeatable
`--grounding`	off	enable Google Search grounding (Pro only)
`--thinking`	—	`low` / `medium` / `high` reasoning depth

Models

Model ID	Codename	Use for
`gemini-3-pro-image-preview`	Nano Banana Pro	Highest fidelity, 4K, complex layouts, text-in-image
`gemini-3.1-flash-image-preview`	Nano Banana 2	Fast everyday generation
`gemini-2.5-flash-image`	Nano Banana	Original; fastest, cheapest

How to use this skill

Pick a prompt. If the user wasn't specific, ask one short clarifying question about subject + style + aspect ratio rather than guessing.
Run the script via Bash. Pro at 2K typically returns in 8–12s.
The script prints the output path on success — show the image to the user (Read it so it renders inline, or mention the path).
For iteration, tweak the prompt (or add -i previous.png as a reference).

Notes

All Google-generated images carry the imperceptible SynthID watermark. Mention this if the user is producing assets where AI provenance matters.
Multilingual text-in-image is a strength — call it out for posters/ads.
--grounding adds a fixed surcharge per call but lets the model use real-time web context (e.g. "today's weather map", "current charts"). Pro only.
Reference-image blend works up to ~14 inputs; the model treats them as visual context for the prompt, not as a starting canvas to edit.
The default model often returns JPEG (not PNG). The script picks the file extension from the response mime type when no -o is supplied; if you do pass -o foo.png, the file will be written to that exact path even if the bytes are JPEG (most viewers handle the mismatch).
If the response comes back text-only (e.g. safety filter), the script surfaces the model's text reply in the error message.