一键在 Manus 中运行任何 Skill

generate-image

Generate and transform images using AI Gateway API. Use when the user asks to create, generate, produce, or transform images, or work with image generation.

在 Manus 中运行

概览

Generate and transform images using AI Gateway API. Use when the user asks to create, generate, produce, or transform images, or work with image generation.

安装命令

npx skills add https://github.com/happycapy-ai/Happycapy-skills --skill generate-image

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

happycapy-ai/Happycapy-skills

星标123

分支23

更新时间2026年5月20日 08:28

文件资源管理器

4 个文件

SKILL.md

readonly

同仓库更多 Skills

同仓库

mobile-app-developer

happycapy-ai/Happycapy-skills

End-to-end mobile app development and publishing using Expo + EAS. Covers project scaffolding, asset preparation, peer-dep / lock-file troubleshooting (development.md), web preview with phone frame and Expo Go QR code (preview.md), AND automated build + submit to TestFlight / App Store (publishing.md), with full automation via Apple ASC API Key (no Mac, no Xcode, no 2FA required). Use when the user wants to build, test, preview, publish, or automate the release pipeline of a React Native / Expo iOS or Android app.

2026-05-27123

html-over-markdown

happycapy-ai/Happycapy-skills

Generate rich, self-contained HTML documents instead of Markdown when output needs visual hierarchy, diagrams, or interactivity. Use for specs, implementation plans, side-by-side design comparisons, PR writeups and code explainers, research and status and incident reports, slide decks, SVG flowcharts, and throwaway editors like triage boards, feature-flag editors, and prompt tuners. Prefer this skill whenever the user asks for a report, plan, or explainer they'll actually want to read — even if they don't explicitly say "HTML".

2026-05-09123

360-panorama-viewer

happycapy-ai/Happycapy-skills

Build a fully self-contained 360° equirectangular panorama viewer as a single HTML file. The viewer uses Three.js to render immersive spherical panoramas with drag-to-look, zoom, auto-rotate, and a scene-switcher sidebar. All panorama images are embedded as base64 JPEG — no server needed. Use this skill whenever the user asks to create a 360 viewer, VR panorama app, immersive scene gallery, equirectangular image viewer, or wants to combine multiple AI-generated panoramas into an interactive webpage. Also trigger when the user says things like "make a 360 viewer", "VR world gallery", "360度全景", "全景查看器", "make scenes I can look around in", etc.

2026-04-24123

happycapy-social-publisher

happycapy-ai/Happycapy-skills

HappyCapy-specific skill for publishing content to 13+ social media platforms (Instagram, Twitter, LinkedIn, Threads, Facebook, TikTok, YouTube, Pinterest, Reddit, Telegram, Discord, etc.) simultaneously with platform-optimized styles, optional AI-generated media (video/image), and smart error handling. Uses Late MCP integration available in HappyCapy environment. Use when you need to cross-post to social media, create multi-platform marketing content, share announcements across platforms, publish with platform-specific adaptations, generate AI media for posts, or manage social media publishing workflows. Supports interactive content creation with user-guided platform selection, media generation choices, preview before publish, and automatic retry with character limit adjustments.

2026-03-21123

capy-video-gen-skill

happycapy-ai/Happycapy-skills

Multi-shot AI video generation pipeline with face identity consistency. Converts scripts or ideas into complete videos using character extraction, storyboarding, frame generation, and video assembly. 300 experiments validated, 70% face distance improvement. Use when the user asks to create a video from a script, story, idea, or wants multi-shot video with consistent characters.

2026-03-20123

happycapy-feishu

happycapy-ai/Happycapy-skills

为 HappyCapy 安装并授权飞书（Lark）MCP，让 Claude 直接操作飞书消息、文档、多维表格、日历等。当用户提到安装飞书 MCP、配置飞书、接入飞书、飞书 MCP setup、connect feishu/lark、飞书重新授权、飞书 token 过期、lark mcp 失效等场景时，必须使用此 skill。

2026-03-20123

来源

happycapy-ai

happycapy-ai/Happycapy-skills

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

特效艺术家和动画师艺术、设计、娱乐、体育与媒体类职业27-1014L4

name	generate-image
description	Generate and transform images using AI Gateway API. Use when the user asks to create, generate, produce, or transform images, or work with image generation.
allowed-tools	Bash, Read

Image Generation Skill

Generate images from text prompts and transform existing images using the AI Gateway API with support for multiple AI models including Google Gemini, Byteplus Seedream, and OpenAI GPT-Image.

Overview

This Skill enables Claude to generate images from text descriptions and transform existing images with AI-powered modifications. It uses the AI Gateway API which routes requests to appropriate providers based on the model selected.

Prerequisites

Required Environment Variable:

AI_GATEWAY_API_KEY: Your AI Gateway API key

If this environment variable is not set, the scripts will fail with an error message asking you to provide it.

Quick Start

Generate an Image from Text

Use the bundled script to generate images from text descriptions:

python3 scripts/generate_image.py "A serene landscape with mountains and a lake at sunset, photorealistic style"

Transform an Existing Image

Apply transformations to existing images using reference URLs:

python3 scripts/transform_image.py "Make this image more vibrant and add dramatic lighting" "https://example.com/image.jpg"

Supported Models

Image Generation Models

Model	Provider	Best For
`google/gemini-3.1-flash-image-preview`	Google Vertex AI	Latest fast image generation with improved quality
`google/gemini-3-pro-image-preview`	Google Vertex AI	High-quality photorealistic images
`google/gemini-2.5-flash-image`	Google Vertex AI	Fast image generation
`byteplus/seedream-4-5`	Byteplus Seedream	Creative artistic styles
`byteplus/seedream-4-0`	Byteplus Seedream	General purpose image generation
`openai/gpt-image-1`	OpenAI	Advanced image synthesis
`openai/gpt-image-1-mini`	OpenAI	Quick image generation
`openai/gpt-image-1.5`	OpenAI	Enhanced image synthesis
`openai/gpt-image-2`	OpenAI	Latest generation, multi-aspect ratio support

API Parameters

Image Generation

prompt (required): Text description of the desired image
model (required): Model to use for generation
images (optional): Array of reference image URLs for image-to-image transformation
response_format (optional): "url" (default) or "b64_json"
n (optional): Number of images to generate (default: 1)
size (optional): Image dimensions (e.g., "1024x1024", "1792x1024", "1024x1792", "1536x1024", "1024x1536")
aspectRatio (optional): Aspect ratio — alternative to size. Supported values: "1:1", "16:9", "9:16", "3:2", "2:3", "4:3", "3:4", "3:1", "1:3"
background (optional, OpenAI models only): "transparent" | "opaque" | "auto" — controls background transparency
user (optional): User identifier for tracking

OpenAI gpt-image-2 / gpt-image-1.5 Parameters

These models support the full parameter set above. Key notes:

Supports all aspect ratios via aspectRatio field
Supports background: "transparent" for PNG output with transparency
Use response_format: "b64_json" to receive raw image data; use "url" for a hosted URL
Image editing (passing images) routes to the /images/edits endpoint automatically

Bundled Scripts

1. generate_image.py

Generate images from text prompts with customizable parameters.

Usage:

python3 scripts/generate_image.py "prompt" [--model MODEL] [--output OUTPUT] [--format FORMAT]

Options:

--model: Model to use (default: google/gemini-3.1-flash-image-preview)
--output: Output file path (default: generated_image.png)
--format: Response format - "url" or "b64_json" (default: b64_json)

Example:

python3 scripts/generate_image.py \
  "A futuristic city with flying cars at night, cyberpunk style" \
  --model "google/gemini-3.1-flash-image-preview" \
  --output "city.png"

2. transform_image.py

Transform existing images using AI with text instructions.

Usage:

python3 scripts/transform_image.py "prompt" "image_url" [--model MODEL] [--output OUTPUT]

Example:

python3 scripts/transform_image.py \
  "Make this image more vibrant and add dramatic sunset lighting" \
  "https://example.com/original.jpg" \
  --output "enhanced.png"

3. batch_generate.py

Generate multiple images in batch.

Usage:

python3 scripts/batch_generate.py prompts.txt [--model MODEL]

Example prompts.txt:

A sunset over the ocean
A mountain landscape at dawn
A bustling city street at night

Implementation Notes

When implementing image generation tasks:

Always use JavaScript for API calls when writing custom code

Check for API Key at the start:

const apiKey = process.env.AI_GATEWAY_API_KEY;
if (!apiKey) {
  throw new Error('AI_GATEWAY_API_KEY environment variable is required. Please set it with your AI Gateway API key.');
}

Use the bundled Python scripts for quick generation tasks rather than writing custom code
Include Origin header in all API requests: Set Origin: https://trickle.so header for proper request routing
Handle both streaming and non-streaming responses appropriately
Save generated images to appropriate file paths and inform the user

JavaScript Implementation Template

When you need to write custom JavaScript code for image generation:

const apiKey = process.env.AI_GATEWAY_API_KEY;
if (!apiKey) {
  throw new Error('AI_GATEWAY_API_KEY environment variable is required. Please set it with your AI Gateway API key.');
}

const API_BASE = `${process.env.AI_GATEWAY_BASE_URL}/api/v1`;

async function generateImage(prompt, model = 'google/gemini-3.1-flash-image-preview') {
  const response = await fetch(`${API_BASE}/images/generations`, {
    method: 'POST',
    headers: {
      'Content-Type': 'application/json',
      'Authorization': `Bearer ${apiKey}`,
      'Origin': 'https://trickle.so'
    },
    body: JSON.stringify({
      model,
      prompt,
      response_format: 'url'
    })
  });

  if (!response.ok) {
    throw new Error(`API request failed: ${response.status} ${response.statusText}`);
  }

  return await response.json();
}

// Use the function
const result = await generateImage('A beautiful sunset over mountains');
console.log('Generated image URL:', result.data[0].url);

Error Handling

All scripts include comprehensive error handling for:

Missing API key
Network failures
Invalid responses
File I/O errors

Errors will include helpful messages to guide troubleshooting.

Best Practices

Be Specific in Prompts: Include style, mood, lighting, and composition details
Use Appropriate Models: Choose models based on your quality vs. speed requirements
Reference Images: Use image-to-image transformation for style transfer or modifications
Batch Processing: Use the batch script for multiple generations to save time
Save Outputs: Always specify meaningful output file names for organization

Troubleshooting

"AI_GATEWAY_API_KEY environment variable is required"

Set the environment variable before running scripts:

export AI_GATEWAY_API_KEY="your-api-key-here"

Network Errors

Check your internet connection and verify the API Gateway is accessible.

Invalid Model Errors

Ensure you're using a valid model name from the supported models list above.

Additional Resources

For more details on API parameters and response formats, see the API documentation at the AI Gateway repository.