원클릭으로 Manus에서 모든 스킬 실행

generate-image

스타10

포크0

업데이트2026년 5월 30일 21:43

Generate an original image from a text description, locally (Bonsai-Image 4B, ternary-quantized, on a long-lived studio server). Use when the user wants a picture, illustration, avatar, or face created from a description that does not already exist on the web. For existing photos of real things, prefer image search instead; for simple diagrams or line drawings, prefer authoring inline SVG.

설치

Codex 또는 Claude로 설치 이 Prompt를 복사해 Codex, Claude 또는 다른 어시스턴트에 붙여 넣으면 Skill 페이지를 검토하고 설치를 진행할 수 있습니다.

Manus에서 실행

출처

bdambrosio

bdambrosio/Cognitive_workbench

GitHub 저장소 열기 Creator 저장소 보기

다운로드

Manus에서 실행

generate-image

Produces an original image from a prompt using Bonsai-Image 4B (ternary 1.58-bit) running locally on the GPU via the prism-image-studio backend. The backend is a long-lived server; the tool is a thin HTTP client that POSTs the prompt and saves the returned PNG.

The tool talks to the FastAPI backend directly: POST {base}/generate, with a {base}/healthz readiness probe. Base URL defaults to http://localhost:4001 (override with GENERATE_IMAGE_URL). If the backend isn't already running, the tool launches it via the Bonsai-Image-Demo serve.sh and waits for /healthz (a cold boot prewarms the model and can take a couple of minutes). To start it by hand:

BACKEND_PORT=4001 FRONTEND_PORT=3100 ./scripts/serve.sh

4001 is the backend port here. The tool bypasses the Next.js studio frontend (/api/generate), so its port is irrelevant and its prompt-moderation gate is skipped.

Best for invented/illustrative imagery — characters, faces, scenes, styled graphics. It is a generator, not a search: it cannot reproduce a specific real photograph or a named existing image. There is no separate negative prompt; describe exactly what you want (including what to leave out) in the prompt.

Examples

{"thought": "the user wants a friendly assistant face drawn", "tool": "generate-image", "prompt": "a friendly cartoon robot assistant face, large round eyes, gentle smile, flat vector illustration, pastel background, centered"}

{"thought": "generate a calm scene with no people or text", "tool": "generate-image", "prompt": "a quiet misty lake at dawn, soft pastel sky, minimalist, no people, no text", "seed": 7}

이 저장소의 다른 Skills

같은 저장소

get-financial-statements

bdambrosio/Cognitive_workbench

Fetch standardized financial statements (income, balance sheet, cash flow, earnings, company overview) for a ticker from Alpha Vantage. Returns combined annual+quarterly JSON for analysis.

2026-06-1910

look-at-target

bdambrosio/Cognitive_workbench

Aim the ChatterBot head to find and center a target in view — the user, a person, an object, an animal. Runs a closed visual loop (capture, judge where the target is, nudge pan/tilt, repeat) until the target is centered, or reports it could not find the target after searching. Use when the user says point at me, look at me, turn to face someone, find the cat, center on the person. For a one-off snapshot without re-aiming use camera-capture; for a manual fixed angle use head-move.

2026-06-1310

camera-capture

bdambrosio/Cognitive_workbench

Capture a still photo from the ChatterBot head camera. The captured frame is attached to your own visual input, so you can SEE it and answer questions about what is in view — whether the user is present, whether there is a cat, what the scene looks like. The camera rides the pan/tilt head, so it shows whatever the head is currently aimed at; aim first with head-move if needed. To also show the photo to the user on screen, follow with the display tool (the observation includes a ready <img> URL). Use when the user asks what you see, to take a picture or snapshot, or to check whether something or someone is in view.

2026-06-1310

head-move

bdambrosio/Cognitive_workbench

Move the ChatterBot head — aim the pan/tilt camera or play an expressive gesture. The bot is a stationary companion head; this points its gaze, it does NOT drive or navigate. Use when the user asks you to look somewhere, turn toward/away, look up/down, re-center, or nod/shake/scan. Angles are degrees 0-180 with 90 centered (pan 0=full right, 180=full left; tilt 0=down, 180=up, mounting-dependent). Give pan and/or tilt for absolute aim, OR a gesture (not both). Returns the confirmed pose once the head settles.

2026-06-1310

ramana-saying

bdambrosio/Cognitive_workbench

Return one genuine saying of Ramana Maharshi, drawn verbatim from his recorded talks, with source attribution. Use when delivering an authentic Ramana quote with attribution — not a paraphrase or a synthesized reflection. The returned text is a raw quote; add your own brief framing before presenting it.

2026-06-0910

assess

bdambrosio/Cognitive_workbench

Check whether text matches a natural-language condition. Returns "true" or "false". Auto-chunks long texts and short-circuits on first match.

2026-05-2510

name	generate-image
description	Generate an original image from a text description, locally (Bonsai-Image 4B, ternary-quantized, on a long-lived studio server). Use when the user wants a picture, illustration, avatar, or face created from a description that does not already exist on the web. For existing photos of real things, prefer image search instead; for simple diagrams or line drawings, prefer authoring inline SVG.
args	{"prompt":"required string — what to depict, e.g. \"a friendly cartoon robot face, soft smile, flat vector style\". State qualities you want directly; the model follows prompts literally.","steps":"optional int (default 4) — denoising steps; Bonsai is distilled for 4, more buys little","size":"optional int (default 512) — square side length in pixels. 512 is the fast preset; 1024 is the quality preset (slower).","seed":"optional int — fix for reproducible output; omit for variety"}

generate-image

BACKEND_PORT=4001 FRONTEND_PORT=3100 ./scripts/serve.sh

4001 is the backend port here. The tool bypasses the Next.js studio frontend (/api/generate), so its port is irrelevant and its prompt-moderation gate is skipped.

Examples

{"thought": "the user wants a friendly assistant face drawn", "tool": "generate-image", "prompt": "a friendly cartoon robot assistant face, large round eyes, gentle smile, flat vector illustration, pastel background, centered"}

{"thought": "generate a calm scene with no people or text", "tool": "generate-image", "prompt": "a quiet misty lake at dawn, soft pastel sky, minimalist, no people, no text", "seed": 7}