ワンクリックで
veo-32-prompter
// Expert prompt engineering for Google Veo 3.2 (Artemis engine). Use when the user wants to generate a video with Veo 3.2, needs help crafting cinematic prompts, or mentions Veo, Google video generation, or Artemis engine.
// Expert prompt engineering for Google Veo 3.2 (Artemis engine). Use when the user wants to generate a video with Veo 3.2, needs help crafting cinematic prompts, or mentions Veo, Google video generation, or Artemis engine.
AI video generation skill with auto model selection across Seedance 2, Kling 3.0, HappyHorse, and 10+ models. Produces finished multi-shot videos (5–120s) from text, images, URLs, scripts, or audio — including AI music, lip sync, and multi-shot sequencing. No prompts to write, no models to choose. USE FOR: video production, AI video, make a video, product video, brand video, promotional clip, explainer video, short video, TikTok video, Instagram Reel, YouTube Short, product ad, text-to-video, image-to-video, video generation, AI video agent.
Tired of juggling multiple audio APIs? This skill gives you one-command access to TTS, music generation, sound effects, and voice cloning. Use when you want to generate any audio without managing multiple API keys.
AI creative director that turns a user's natural-language idea into a complete storyboard and generates all assets — images, video clips, and audio — automatically. The user only describes what they want; all prompt engineering is handled internally.
Generate short AI videos from text or images — text-to-video, image-to-video, and reference-based generation — with zero API key setup. Use when the user wants to create a video clip, animate an image, or generate video from a description.
Expert prompt engineering for Seedance 2.0. Use when the user wants to generate a video with multimodal assets (images, videos, audio) and needs the best possible prompt.
Tired of juggling 8 API keys? This skill gives you one-command access to Midjourney, Flux, Ideogram, and more, with zero setup. Use when you want to generate any image without worrying about API keys.
| name | veo-3.2-prompter |
| description | Expert prompt engineering for Google Veo 3.2 (Artemis engine). Use when the user wants to generate a video with Veo 3.2, needs help crafting cinematic prompts, or mentions Veo, Google video generation, or Artemis engine. |
| version | 0.1.0 |
| author | wells |
| tags | ["video","generation","prompt","veo","google","artemis","cinematic"] |
This skill transforms a user's scattered multimodal assets (images, videos, audio) and creative intent into a structured, executable prompt for the Google Veo 3.2 video generation model (Artemis engine). It acts as an expert prompt engineer, ensuring the highest quality output from the underlying model.
This skill analyzes all user inputs and generates a single, optimized JSON object containing the final prompt and recommended parameters. The internal workflow (Recognition, Mapping, Construction) is handled automatically and should not be exposed to the user.
atomic_element_mapping.md to classify each asset into its atomic element role(s).RawReferenceImage system.User Request: "Make a cinematic shot of this perfume bottle rotating on a dark surface, like a luxury commercial."
User uploads perfume.png
Agent using veo-3.2-prompter:
The agent internally processes the request and assets, then outputs the final JSON to the next skill in the chain.
Final Output (for internal use):
{
"final_prompt": "Hero shot, a frosted glass perfume bottle with gold cap rotating slowly on a reflective dark surface, three-point studio lighting with soft key and rim light creating subtle caustics, smooth 180-degree arc, hyper-realistic luxury commercial style with shallow depth of field. Crystalline chime, soft ambient pad.",
"reference_images": [
{
"file": "perfume.png",
"reference_type": "SUBJECT"
}
],
"recommended_parameters": {
"model": "veo-3.2-generate",
"duration_seconds": 8,
"aspect_ratio": "16:9",
"resolution": "1080p",
"generate_audio": true
}
}
| Feature | Capability |
|---|---|
| Engine | Artemis — world-model physics simulation (not pixel prediction) |
| Max duration | ~30s native continuous generation |
| Audio | Native dialogue + synchronized SFX |
| Reference images | Up to 3 (STYLE, SUBJECT, SUBJECT_FACE) |
| Video extension | Chain clips via previous video input |
| First/last frame | Specify start and/or end keyframes |
| Resolutions | 720p, 1080p, 4K (with upscaling) |
| Aspect ratios | 16:9, 9:16 |
This skill relies on an internal knowledge base to make informed decisions. The agent MUST consult these files during execution.
references/atomic_element_mapping.md: Core Knowledge. Contains the "Asset Type → Atomic Element" and "Atomic Element → Optimal Reference Method" mapping tables, adapted for Veo 3.2's reference image system.references/veo_syntax_guide.md: Veo 3.2 Gemini API syntax reference, covering RawReferenceImage, GenerateVideosConfig, video extension, and first/last frame specification.