ワンクリックで
yao-image
// Image expert. ALWAYS invoke this skill when you need to read, analyze, describe, or generate images. Use for screenshots, photos, charts, diagrams, AI-generated images, or any visual content.
// Image expert. ALWAYS invoke this skill when you need to read, analyze, describe, or generate images. Use for screenshots, photos, charts, diagrams, AI-generated images, or any visual content.
Robot management expert. ALWAYS invoke this skill when you need to list, create, update, or manage robots, check robot status, trigger robot executions, cancel tasks, or retrieve execution results. Use this skill before guessing robot APIs.
Secret management expert. ALWAYS invoke this skill when you need to read API keys, tokens, or other secrets configured by the user. Never hardcode credentials — use this skill to retrieve them securely.
Workspace file I/O expert. ALWAYS invoke this skill when you need to list workspaces, read or write files in a workspace on a remote node, or browse workspace directories. Use this for cross-node file operations — for local sandbox files, use standard filesystem tools instead.
Agent management expert. ALWAYS invoke this skill when you need to list available agents, download or reference agent source code, deploy agent code to the host, or query the LLM connector matrix. Do not guess agent structures — use this skill first.
Yao process documentation expert. ALWAYS invoke this skill when the user needs to discover available processes, read process signatures, or validate process names. Do not guess process APIs — use this skill first.
Yao process execution expert. ALWAYS invoke this skill when the user needs to call a Yao process, query data models, run scripts, or check process permissions. Do not call processes without checking this skill first.
| name | yao-image |
| description | Image expert. ALWAYS invoke this skill when you need to read, analyze, describe, or generate images. Use for screenshots, photos, charts, diagrams, AI-generated images, or any visual content. |
Use these tools when you encounter images you cannot read natively, or when you need to generate new images.
Send an image to a vision-capable model and get a text description.
tai tool image_read '{"image_path": "/path/to/image.png", "prompt": "Describe this image"}'
tai tool image_read '{"image_path": "https://example.com/photo.jpg", "prompt": "What is shown?"}'
tai tool image_read '{"image_path": "workspace://ws-id/path/to/image.png", "prompt": "Analyze"}'
tai tool image_read '{"image_path": "attach://__yao.attachment/file-id-123", "prompt": "Describe"}'
tai tool image_read '{"image_path": "/path/to/image.png", "prompt": "Describe", "provider": "llm.my-openai:gpt-4o"}'
| Parameter | Type | Required | Description |
|---|---|---|---|
| image_path | string | yes | File path, URL, workspace://, attach://, or yao:// URI |
| prompt | string | no | Analysis instruction (default: describe in detail) |
| max_size | integer | no | Max dimension in pixels for longest edge (default: 1080) |
| provider | string | no | Vision provider connector ID. If omitted, uses default vision model |
Images are automatically resized (preserving aspect ratio) before sending to the vision model. Supported formats: PNG, JPEG, GIF, WebP.
Generate an image from a text prompt and save it to a file.
tai tool image_generate '{"prompt": "A serene mountain landscape at sunset", "output": "landscape.png"}'
tai tool image_generate '{"prompt": "A futuristic city skyline", "provider": "llm.my-openai:dall-e-3", "size": "1792x1024", "output": "output/city.png"}'
| Parameter | Type | Required | Description |
|---|---|---|---|
| prompt | string | yes | Text description of the image to generate |
| output | string | yes | File path to save the generated image (parent dirs created automatically) |
| provider | string | no | Provider connector ID (use image_providers to list). Auto-selects if omitted |
| size | string | no | Image dimensions (default: 1024x1024). Common: 1024x1024, 1024x1792, 1792x1024 |
Important: Always pass output. The tool saves the image directly and returns only the file path and size. Without output, the raw base64 data is returned which may exceed output limits.
Use relative paths (e.g. "output": "fox.png") — they resolve relative to the current working directory ($WORKDIR). No need to prepend $WORKDIR manually.
List available image providers filtered by capability.
tai tool image_providers '{}'
tai tool image_providers '{"capability": "vision"}'
| Parameter | Type | Required | Description |
|---|---|---|---|
| capability | string | no | image_generation (default) or vision |
Returns a list of providers with their available models and connector IDs that can be passed to image_generate or image_read.