원클릭으로
image-gen
AI image generation and editing. Use when users ask to generate, create, or draw images with AI, or edit and modify existing images.
Codex 또는 Claude로 설치 이 Prompt를 복사해 Codex, Claude 또는 다른 어시스턴트에 붙여 넣으면 Skill 페이지를 검토하고 설치를 진행할 수 있습니다.
메뉴
AI image generation and editing. Use when users ask to generate, create, or draw images with AI, or edit and modify existing images.
Codex 또는 Claude로 설치 이 Prompt를 복사해 Codex, Claude 또는 다른 어시스턴트에 붙여 넣으면 Skill 페이지를 검토하고 설치를 진행할 수 있습니다.
Activated when user message contains an <editor /> XML tag (injected by Max Client Video Editor). Guides how to edit Remotion project clips (maxmotion.json) and sequences (.tsx source code).
AI-powered media understanding and analysis for images, videos, and audio. Use when users ask to describe, analyze, summarize, or extract text (OCR) from media files.
AI video generation with text-to-video, image-to-video, and first/last frame control. Use when users ask to generate or create videos from text prompts or images.
Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.
Speech-to-text transcription using Whisper with word-level timestamps. Use when users ask to transcribe audio or video to text, generate subtitles, or recognize speech.
Browser automation with persistent page state. Use when users ask to navigate websites, fill forms, take screenshots, extract web data, test web apps, or automate browser workflows. Trigger phrases include "go to [url]", "click on", "fill out the form", "take a screenshot", "scrape", "automate", "test the website", "log into", or any browser interaction request.
SOC 직업 분류 기준
| name | image-gen |
| description | AI image generation and editing. Use when users ask to generate, create, or draw images with AI, or edit and modify existing images. |
Generate or edit images via Maxgent FAL API proxy.
MAX_API_KEY environment variable (auto-injected by Max)fal-ai/nano-banana-profal-ai/nano-banana-pro/editbun skills/image-gen/image-gen.js \
--model MODEL --prompt "PROMPT" --aspect-ratio RATIO \
--num-images N --output-dir DIR \
[--input-image PATH] [--output-format FORMAT] \
[--seed N] [--guidance-scale X] [--steps N]
Parameters:
--model: auto (recommended), gpt-image-1.5, nano-banana-pro, flux-dev--prompt: image description or editing instruction--aspect-ratio: 1:1, 4:3, 3:4, 16:9, 9:16--num-images: number of images to generate (default 1)--output-dir: output directory — default to $MAX_PROJECT_PATH (the user's project root)--input-image: optional, for image editing mode--output-format: png, jpg, webp (default png)--seed: random seed for reproducibility--guidance-scale: guidance scale for supported models--steps: number of inference steps for supported models# Default routing, text-to-image
bun skills/image-gen/image-gen.js --model auto --prompt "a cat under the starry sky" --aspect-ratio "1:1" --output-dir "$MAX_PROJECT_PATH"
# Specify model
bun skills/image-gen/image-gen.js --model gpt-image-1.5 --prompt "modern building facade, cinematic" --aspect-ratio "16:9" --num-images 2 --output-dir "$MAX_PROJECT_PATH"
# Image editing
bun skills/image-gen/image-gen.js --model auto --prompt "change background to a beach at sunset" --aspect-ratio "1:1" --output-dir "$MAX_PROJECT_PATH" --input-image "/path/to/input.jpg"
MAX_API_KEY exists.$MAX_PROJECT_PATH.