Run any Skill in Manus with one click

$pwd:

openai-image-gen

Name: Openai Image Gen
Author: best

// Generate and edit images with the OpenAI Image API, defaulting to gpt-image-2. Requires provider fallback through OPENAI_IMAGE_CONFIG; supports text-to-image, repeated-prompt batch generation, multi-image editing/composition (up to 16 input images), mask edits, and size/quality/background/output-format control. Use when the user asks to create, draw, generate, or edit images with OpenAI / GPT Image models, especially when the built-in image tool has not exposed the latest OpenAI image model yet.

Run Skill in Manus

$ git log --oneline --stat

stars:4

forks:0

updated:May 6, 2026 at 07:26

File Explorer

3 files

SKILL.md

readonly

package.json

"author": "best"

"repository": "best/openclaw-skills"

View GitHub Repository

$ install --globalskills.sh

$ download --local

Run Skill in Manus

[HINT] Download the complete skill directory including SKILL.md and all related files

name	openai-image-gen
description	Generate and edit images with the OpenAI Image API, defaulting to gpt-image-2. Requires provider fallback through OPENAI_IMAGE_CONFIG; supports text-to-image, repeated-prompt batch generation, multi-image editing/composition (up to 16 input images), mask edits, and size/quality/background/output-format control. Use when the user asks to create, draw, generate, or edit images with OpenAI / GPT Image models, especially when the built-in image tool has not exposed the latest OpenAI image model yet.
metadata	{"version":"1.2.0"}

OpenAI Image Generation

Generate and edit images via the bundled Python script with gpt-image-2 as the default model. The script only uses provider entries from OPENAI_IMAGE_CONFIG or --config; single-provider env/CLI overrides are intentionally unsupported.

Use this skill when OpenClaw's built-in image_generate tool does not yet expose the desired OpenAI image model, but a direct OpenAI-compatible Image API endpoint is available.

How to Generate

uv run <skill_dir>/scripts/generate_image.py \
  -p "a moonlit courtyard with cherry blossoms" \
  -f "courtyard.png"

Generate multiple images:

uv run <skill_dir>/scripts/generate_image.py \
  -p "minimal poster of a red fox in snowfall" \
  -f "fox-poster.png" \
  -n 3

Control size / quality / format / background:

uv run <skill_dir>/scripts/generate_image.py \
  -p "transparent icon of a white crane in flight" \
  -f "crane.webp" \
  --size 1024x1024 \
  --quality high \
  --background transparent \
  --output-format webp \
  --output-compression 90

Generate in 4K (gpt-image-2 extended):

uv run <skill_dir>/scripts/generate_image.py \
  -p "a photorealistic UI screenshot" \
  -f "screenshot-4k.png" \
  --size 3840x2160 \
  --quality high

How to Edit

Edit one existing image:

uv run <skill_dir>/scripts/generate_image.py \
  -p "replace the background with a misty bamboo forest" \
  -f "bamboo-scene.png" \
  -i /path/to/input.png

Edit with multiple reference images (up to 16):

uv run <skill_dir>/scripts/generate_image.py \
  -p "merge these into one coherent cinematic portrait" \
  -f "merged-portrait.png" \
  -i face.png -i outfit.png -i background.png

Edit with a mask:

uv run <skill_dir>/scripts/generate_image.py \
  -p "only replace the masked area with blooming sakura branches" \
  -f "masked-edit.png" \
  -i source.png \
  --mask mask.png

Parameters

Param	Required	Description
`-p, --prompt`	Yes	Image description or edit instruction. Repeatable for parallel multi-prompt generation
`-f, --filename`	Yes	Output filename. Timestamp prefix auto-added; repeated prompts are auto-suffixed
`-i, --input-image`	No	Input image path. Repeatable, up to 16
`--mask`	No	Optional PNG mask for edit mode
`-n, --count`	No	Number of images to generate per prompt, 1-10
`-j, --parallel`	No	Max concurrent API calls for repeated prompts
`--size`	No	`auto` `1024x1024` `1536x1024` `1024x1536` `2048x2048` `2048x1152` `3840x2160` (4K) `2160x3840` (4K)
`--quality`	No	`auto` `low` `medium` `high`
`--background`	No	`auto` `transparent` `opaque`
`--output-format`	No	`png` `webp` `jpeg`
`--output-compression`	No	`0-100`, only for `webp` / `jpeg`
`--moderation`	No	Generation-only moderation level: `auto` or `low`
`--input-fidelity`	No	Edit-only fidelity: `low` or `high`
`-m, --model`	No	Override model ID across the provider chain, e.g. `gpt-image-2`
`--config`	No	Path to provider chain config JSON; defaults to `OPENAI_IMAGE_CONFIG`

Provider Fallback

The script is config-only: it requires a provider chain from OPENAI_IMAGE_CONFIG or --config. Do not use OPENAI_IMAGE_API_KEY, OPENAI_IMAGE_BASE_URL, --api-key, or --base-url single-provider overrides.

Config File Schema

{
  "providers": [
    {
      "name": "openai-direct",
      "base_url": null,
      "api_key": "<provider API key>",
      "model": "gpt-image-2"
    },
    {
      "name": "compatible-proxy",
      "base_url": "https://proxy.example.com/v1",
      "api_key": "<provider API key>",
      "model": "gpt-image-2"
    }
  ]
}

api_key may be stored directly in the private config file
api_key_env is also supported if a deployment wants the config to reference an env var name instead
Keep the config file outside the skill repo; never commit real keys
base_url: null means OpenAI direct
Providers are tried top-to-bottom; first success wins
-m, --model may override the model across the provider chain

First-Time Setup

When using this skill and no OPENAI_IMAGE_CONFIG is configured yet:

Ask which OpenAI-compatible image endpoints are available
Create a config JSON following the schema above
Save it to a persistent path outside the skill directory
Set OPENAI_IMAGE_CONFIG to the config file path
Put actual keys in each provider's api_key field, or use api_key_env if env indirection is required

Keep this skill's config separate from general-purpose OPENAI_BASE_URL / OPENAI_API_KEY setups when those are already used for other APIs or proxies.

Error Classification

Retryable → try next provider: 408 409 429 500 502 503 504, timeout, connection failures
Auth errors → skip provider and continue: 401 403
Non-retryable → fail immediately: bad request, invalid params, safety rejection

Output Handling

Prints MEDIA: lines for auto-attachment on supported chat platforms
Report the saved file path to the user; do not read generated images back into context
For permanent hosting, use the chevereto-upload skill after generation
Plain filename output goes to $OPENAI_IMAGE_OUTPUT_DIR/YYYY-MM/timestamp-name.ext

Configuration

Variable	Required	Description
`OPENAI_IMAGE_CONFIG`	Yes*	Path to provider-chain config JSON
provider `api_key` fields	Yes**	Actual provider API keys stored in the private config file
env vars named by `api_key_env`	No	Optional indirection alternative to inline `api_key`
`OPENAI_IMAGE_OUTPUT_DIR`	No	Output directory (default: `~/.openclaw/workspace/images`)

* Or pass --config /path/to/providers.json for an explicit config path. ** Each provider needs either api_key or api_key_env.

Prompt Writing

See references/prompting.md for templates and best practices.

Key principle: write one clear scene paragraph, and for edits explicitly say what must remain unchanged.

Troubleshooting

Error	Cause	Fix
`No providers configured`	Missing or empty config file	Create `OPENAI_IMAGE_CONFIG` or pass `--config`
`All providers failed`	Every provider returned errors	Check endpoint compatibility, keys, and model name
`transparent background requires png or webp`	Invalid format/background combo	Switch output format to `png` or `webp`
`API response contained no savable image data`	Proxy returned unexpected payload	Try OpenAI direct or a fully compatible endpoint
`First run slow (~10s)`	`uv` downloading dependencies	Subsequent runs use cache