تشغيل أي مهارة في Manus بنقرة واحدة

$pwd:

corespeed-nanobanana

Name: Corespeed Nanobanana
Author: corespeed-io

// Generate and edit images using Gemini via Corespeed AI Gateway. Use when a user asks to create, draw, design, or generate an image, picture, illustration, icon, logo, banner, thumbnail, or screenshot mockup. Also triggers on image editing (remove background, resize, recolor, combine photos), image analysis (describe, compare, OCR), and text generation with Gemini models (gemini-2.5-flash-image, gemini-2.5-flash, gemini-2.5-pro). Trigger phrases include: "画一个", "生成图片", "做一张图", "帮我P图", "make me an image", "generate a picture", "edit this photo", "what's in this image".

تشغيل في Manus

$ git log --oneline --stat

stars:١

forks:٠

updated:٢٧ مارس ٢٠٢٦ في ٠١:٠٤

مستكشف الملفات

2 ملفات

SKILL.md

readonly

name

corespeed-nanobanana

description

Generate and edit images using Gemini via Corespeed AI Gateway. Use when a user asks to create, draw, design, or generate an image, picture, illustration, icon, logo, banner, thumbnail, or screenshot mockup. Also triggers on image editing (remove background, resize, recolor, combine photos), image analysis (describe, compare, OCR), and text generation with Gemini models (gemini-2.5-flash-image, gemini-2.5-flash, gemini-2.5-pro). Trigger phrases include: "画一个", "生成图片", "做一张图", "帮我P图", "make me an image", "generate a picture", "edit this photo", "what's in this image".

Corespeed NanoBanana — Gemini Image & Text Generation

Auth

Requires CS_AI_GATEWAY_BASE_URL and CS_AI_GATEWAY_API_TOKEN environment variables. These are often already configured — check with echo $CS_AI_GATEWAY_BASE_URL before asking the user to set them. Only prompt the user if they are genuinely missing.

The Corespeed AI Gateway authenticates via Authorization: Bearer <token> header only. The google-genai library defaults to sending x-goog-api-key, which the gateway does not use for auth and will forward to Google upstream — causing a rejection if the value is invalid. The script handles this by:

Setting api_key="gateway" (placeholder required by the library)
Injecting Authorization: Bearer <token> via HttpOptions.headers
Overriding x-goog-api-key to empty string in the same headers dict to prevent upstream rejection

If you modify the client setup or write new scripts against this gateway, follow the same pattern.

Workflow

Pick a model from the table below (default: gemini-2.5-flash-image for image generation)
Run the script with your prompt

Usage

uv run {baseDir}/scripts/gemini.py --prompt "your prompt" -f output.ext [-i input.ext] [--model MODEL]

--prompt, -p — Text prompt (required)
--filename, -f — Output filename (required)
--input, -i — Input image file(s), repeat for multiple
--model, -m — Model name (default: gemini-2.5-flash-image)
--modalities — Response type: auto, image, text, image+text (default: auto)
--json — Output structured JSON (recommended for agent consumption)

Output format is determined by file extension: .png/.jpg → image generation, .txt/.md → text output.

Image Generation

# Text-to-image
uv run {baseDir}/scripts/gemini.py -p "a watercolor fox in autumn forest" -f fox.png

# Image editing
uv run {baseDir}/scripts/gemini.py -p "Remove background, add beach sunset" -f edited.png -i photo.jpg

# Multi-image compositing
uv run {baseDir}/scripts/gemini.py -p "Blend these two scenes together" -f blend.png -i scene1.png -i scene2.png

Image Analysis

# Describe an image
uv run {baseDir}/scripts/gemini.py -p "Describe this image" -f desc.txt -i photo.jpg --model gemini-2.5-flash

# Compare images
uv run {baseDir}/scripts/gemini.py -p "What are the differences?" -f diff.txt -i before.jpg -i after.jpg --model gemini-2.5-flash

Text Generation

# Use the most capable model for complex tasks
uv run {baseDir}/scripts/gemini.py -p "Write a haiku about coding" -f haiku.txt --model gemini-2.5-pro

Models

Model	Type	Best For
gemini-2.5-flash-image	Image + Text	Image generation & editing (default)
gemini-2.5-flash	Text	Fast analysis, vision, general tasks
gemini-2.5-pro	Text	Complex reasoning, highest quality
gemini-2.5-flash-lite	Text	Fastest, simple tasks

Notes

No manual Python setup required. The script uses PEP 723 inline metadata. uv run automatically creates an isolated virtual environment and installs the google-genai dependency on first run.
Image output is returned inline as base64 from the Gemini API — no separate download step.
Use timestamps in filenames: yyyy-mm-dd-hh-mm-ss-name.ext.
Script prints MEDIA: line for OpenClaw to auto-attach generated images.
Do not read generated media back; report the saved path only.
Only gemini-2.5-flash-image can generate images. Other models are text-only.
Use --json for structured output: {"ok": true, "files": [...], "text": "...", "model": "...", "tokens": {...}}

Support

Built by Corespeed. If you need help or run into issues:

💬 Discord: discord.gg/mAfhakVRnJ
🐦 X/Twitter: @CoreSpeed_io
🐙 GitHub: github.com/corespeed-io/skills

related-skills.json

نفس المستودع

corespeed-excalidraw-rendering.md

from "corespeed-io/skills"

Render Excalidraw (.excalidraw) files to PNG images and take screenshots of web pages using headless Chrome via the brow CLI. Use when a user asks to render an Excalidraw diagram to an image, capture a website as a screenshot, or convert a drawing to PNG.

2026-03-231

corespeed-slide.md

from "corespeed-io/skills"

Generate professional PowerPoint (.pptx) presentations using JSX/TSX with Deno. Supports slides, text, shapes, tables, charts (bar, line, pie, donut), images, gradients, shadows, and flexible layouts. Use when a user asks to create presentations, slide decks, pitch decks, reports, or any PPTX file.

2026-03-201

corespeed-studio.md

from "corespeed-io/skills"

Generate video, images, audio, and music using 40+ AI models via fal.ai. Use for video generation (Kling v3, Sora 2, Veo 3.1, LTX 2.3, Pixverse v5), image generation (Nano Banana 2, FLUX 2 Pro/Schnell, GPT Image 1.5, Qwen Image 2 Pro, Recraft V4, Seedream 5), text-to-speech (MiniMax Speech-02 HD), music/sound effects (Beatoven), and utilities (Topaz upscale, background removal, lipsync). Use when a user asks to create videos, generate images, produce voiceovers, create music/sound effects, upscale media, remove backgrounds, or combine multiple AI media models in a single workflow.

2026-03-201

package.json

"author": "corespeed-io"

"repository": "corespeed-io/skills"

فتح مستودع GitHub عرض مستودعات المنشئ

$ install --global

$ download --local

تشغيل في Manus

$ useful --forSOC

مطوّرو البرمجياتمهن الحاسوب والرياضيات15-1252L4

name

corespeed-nanobanana

description

Corespeed NanoBanana — Gemini Image & Text Generation

Auth

Setting api_key="gateway" (placeholder required by the library)
Injecting Authorization: Bearer <token> via HttpOptions.headers
Overriding x-goog-api-key to empty string in the same headers dict to prevent upstream rejection

If you modify the client setup or write new scripts against this gateway, follow the same pattern.

Workflow

Pick a model from the table below (default: gemini-2.5-flash-image for image generation)
Run the script with your prompt

Usage

uv run {baseDir}/scripts/gemini.py --prompt "your prompt" -f output.ext [-i input.ext] [--model MODEL]

--prompt, -p — Text prompt (required)
--filename, -f — Output filename (required)
--input, -i — Input image file(s), repeat for multiple
--model, -m — Model name (default: gemini-2.5-flash-image)
--modalities — Response type: auto, image, text, image+text (default: auto)
--json — Output structured JSON (recommended for agent consumption)

Output format is determined by file extension: .png/.jpg → image generation, .txt/.md → text output.

Image Generation

# Text-to-image
uv run {baseDir}/scripts/gemini.py -p "a watercolor fox in autumn forest" -f fox.png

# Image editing
uv run {baseDir}/scripts/gemini.py -p "Remove background, add beach sunset" -f edited.png -i photo.jpg

# Multi-image compositing
uv run {baseDir}/scripts/gemini.py -p "Blend these two scenes together" -f blend.png -i scene1.png -i scene2.png

Image Analysis

# Describe an image
uv run {baseDir}/scripts/gemini.py -p "Describe this image" -f desc.txt -i photo.jpg --model gemini-2.5-flash

# Compare images
uv run {baseDir}/scripts/gemini.py -p "What are the differences?" -f diff.txt -i before.jpg -i after.jpg --model gemini-2.5-flash

Text Generation

# Use the most capable model for complex tasks
uv run {baseDir}/scripts/gemini.py -p "Write a haiku about coding" -f haiku.txt --model gemini-2.5-pro

Models

Model	Type	Best For
gemini-2.5-flash-image	Image + Text	Image generation & editing (default)
gemini-2.5-flash	Text	Fast analysis, vision, general tasks
gemini-2.5-pro	Text	Complex reasoning, highest quality
gemini-2.5-flash-lite	Text	Fastest, simple tasks

Notes

No manual Python setup required. The script uses PEP 723 inline metadata. uv run automatically creates an isolated virtual environment and installs the google-genai dependency on first run.
Image output is returned inline as base64 from the Gemini API — no separate download step.
Use timestamps in filenames: yyyy-mm-dd-hh-mm-ss-name.ext.
Script prints MEDIA: line for OpenClaw to auto-attach generated images.
Do not read generated media back; report the saved path only.
Only gemini-2.5-flash-image can generate images. Other models are text-only.
Use --json for structured output: {"ok": true, "files": [...], "text": "...", "model": "...", "tokens": {...}}

Support

Built by Corespeed. If you need help or run into issues:

💬 Discord: discord.gg/mAfhakVRnJ
🐦 X/Twitter: @CoreSpeed_io
🐙 GitHub: github.com/corespeed-io/skills

corespeed-nanobanana

Corespeed NanoBanana — Gemini Image & Text Generation

Auth

Workflow

Usage

Image Generation

Image Analysis

Text Generation

Models

Notes

Support

المزيد من هذا المستودع

المزيد من هذا المستودع

Corespeed NanoBanana — Gemini Image & Text Generation

Auth

Workflow

Usage

Image Generation

Image Analysis

Text Generation

Models

Notes

Support