Run any Skill in Manus with one click

$pwd:

sn-image-base

Name: Sn Image Base
Author: OpenSenseNova

// Base-layer skill for the SenseNova-Skills project, providing low-level APIs for image generation, recognition (VLM), and text optimization (LLM). This skill does not preprocess inputs; it only calls backend services and returns results. This skill is not user-facing and is intended for upper-layer skills only.

Run Skill in Manus

$ git log --oneline --stat

stars:2,899

forks:215

updated:May 27, 2026 at 02:22

File Explorer

31 files

SKILL.md

readonly

related-skills.json

same repository

sn-infographic.md

from "OpenSenseNova/SenseNova-Skills"

Generates professional infographics with various layout types and visual styles. Analyzes content, recommends layout and style, and generates publication-ready infographics. Use when user asks to create "infographic", "信息图", "visual summary", or "可视化".

2026-05-272.9k

sn-ppt-creative.md

from "OpenSenseNova/SenseNova-Skills"

Creative-mode PPT pipeline. One full-page 16:9 PNG per slide. LLM / VLM calls go through sn-ppt-standard/lib/model_client.py (shared thin client). Text-to-image (the actual png rendering) goes through sn-image-base/scripts/sn_agent_runner.py. Expects task_pack.json + info_pack.json already written by sn-ppt-entry.

2026-05-252.9k

sn-ppt-doctor.md

from "OpenSenseNova/SenseNova-Skills"

Environment diagnostic for the PPT family. Validates sn-image-base, API keys, Node runtime, and optional deps; interactively writes .env for required vars. Runs before sn-ppt-entry; does not modify sn-image-* skills.

2026-05-252.9k

sn-ppt-standard.md

from "OpenSenseNova/SenseNova-Skills"

Standard-mode PPT pipeline. All LLM / VLM / T2I calls are wrapped in a single CLI entry (scripts/run_stage.py). The main agent's job is simple: emit ONE shell command per stage, never write loops, never write prompts.

2026-05-252.9k

sn-update.md

from "OpenSenseNova/SenseNova-Skills"

Update SenseNova Skills (the sn-* bundle) inside an OpenClaw or hermes-agent install. ALWAYS use this skill when the user says any of: "update SenseNova skills", "update SN skills", "更新 sensenova skills", "更新 sn skills", "刷新 sn-*", "升级 sn-* skills", or names a specific sn-* skill to update (e.g. "更新 sn-ppt-standard", "refresh sn-image-base"). Default scope is the whole sn-* bundle; if the user names specific skills, update ONLY those.

2026-05-072.9k

sn-da-excel-workflow.md

from "OpenSenseNova/SenseNova-Skills"

Excel 数据分析多步编排器。覆盖：(1) 读取多 Sheet Excel 文件并统计行数，(2) 大文件检测（≥10k 行自动 Parquet 优化），(3) 数据清洗（缺失值、文本标准化、无效字符），(4) 条件筛选与分类提取，(5) 跨 Sheet 统计聚合，(6) 导出 Excel/CSV 并提供下载链接。覆盖从数据读取到报告生成全流程，按步骤编排 capability 子 skill。**遇到以下任一情况就主动使用本 skill，不要自行写几行 pandas 就回答**：①用户出现触发词：Excel 分析 / 表格分析 / 数据分析 / 数据清洗 / 数据统计 / 数据筛选 / 数据可视化 / 数据导出 / 汇总统计 / 透视表 / 分组统计 / 交叉分析 / 趋势分析 / 对比分析 / 异常值检测 / 去重 / 缺失值处理 / Excel 报告 / 生成报表 / analyze Excel / data analysis / data cleaning / pivot table；②用户上传或指定了 .xlsx / .xls / .csv 文件并要求分析、清洗、统计或可视化；③任务涉及多 Sheet 读取、条件筛选、分类汇总、图表生成中的任意一项；④用户要求导出带格式的 Excel 报告或下载链接。仅不用于：不涉及表格数据的纯文本处理、图片分析（使用 sn-da-image-caption）、单个公式计算的简单问答。

2026-05-072.9k

package.json

"author": "OpenSenseNova"

"repository": "OpenSenseNova/SenseNova-Skills"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	sn-image-base
description	Base-layer skill for the SenseNova-Skills project, providing low-level APIs for image generation, recognition (VLM), and text optimization (LLM). This skill does not preprocess inputs; it only calls backend services and returns results. This skill is not user-facing and is intended for upper-layer skills only.
triggers	["SenseNova-Skills Image Generation","SenseNova-Skills 图像基础工具","sn 图像基础工具","SenseNova 图像基础工具","SenseNova Image Generation","sn-image-base"]
metadata	{"project":"SenseNova-Skills","tier":0,"category":"infrastructure","user_visible":false}

sn-image-base

Dependency Installation

pip install -r requirements.txt

Overview

sn-image-base is the base-layer skill (tier 0) of the SenseNova-Skills project and provides three low-level tools:

sn-image-generate: image generation (calls text-to-image-no-enhance API)
sn-image-recognize: image recognition (uses VLM to analyze image content)
sn-text-optimize: text optimization (uses LLM to process text)

This skill does not perform any input preprocessing and only calls backend services to return results.

Tools List

sn-image-generate

Image generation tool that calls the text-to-image-no-enhance API.

--prompt is required; all other parameters are optional:

Parameter	Type	Default	Description
`--prompt`	string	Required	Prompt text for image generation
`--negative-prompt`	string	`""`	Negative prompt
`--image-size`	string	`2k`	Image size preset (case-insensitive). Recommended: `2k`. `4k` optional, needs model support (sensenova rejects it → `status=failed`). Other values → `status=failed`.
`--aspect-ratio`	string	`16:9`	Aspect ratio, e.g. `1:1`, `16:9`, `9:16`
`--seed`	int	`None`	Random seed for reproducible generation
`--unet-name`	string	`None`	Specify a UNet model name
`--api-key`	string	`SN_IMAGE_GEN_API_KEY` -> `SN_API_KEY`	API key (CLI argument has priority; `MissingApiKeyError` is raised when all are empty)
`--base-url`	string	`SN_IMAGE_GEN_BASE_URL` -> `SN_BASE_URL`	API base URL (CLI argument has priority)
`--poll-interval`	float	`5.0`	Polling interval (seconds)
`--timeout`	float	`300.0`	Timeout (seconds)
`--insecure`	flag	`False`	Disable TLS verification
`--save-path`	Path	Auto-generated	Save path

sn-image-recognize

Image recognition tool that uses VLM (Vision Language Model) to analyze image content. Supports multiple image inputs.

--images and --user-prompt (or --user-prompt-path) are required. All other parameters use three-level defaults (CLI > env var > built-in default):

Parameter	Type	Built-in Default	Env Var	Description
`--api-key`	string	No hardcoded default	`SN_VISION_API_KEY` -> `SN_CHAT_API_KEY` -> `SN_API_KEY`	Chat runtime API key; raises `MissingApiKeyError` when all are unset
`--base-url`	string	`SN_CHAT_BASE_URL` default	`SN_VISION_BASE_URL` -> `SN_CHAT_BASE_URL` -> `SN_BASE_URL`	Vision provider base URL; falls back to shared chat/global provider
`--model`	string	`sensenova-6.7-flash-lite`	`SN_VISION_MODEL` -> `SN_CHAT_MODEL`	Vision-capable model name
`--vlm-type`	string	`openai-completions`	`SN_VISION_TYPE` -> `SN_CHAT_TYPE`	Chat protocol type override
`--user-prompt-path`	string	`None`	-	Local file path, mutually exclusive with `--user-prompt`
`--system-prompt-path`	string	`None`	-	Local file path, mutually exclusive with `--system-prompt`

Available values for --vlm-type:

openai-completions: OpenAI-compatible /v1/chat/completions interface
anthropic-messages: Anthropic Messages /v1/messages interface

sn-text-optimize

Text optimization tool that uses LLM (Language Model) to optimize text content. Does not accept image inputs.

--user-prompt (or --user-prompt-path) is required. All other parameters use three-level defaults (CLI > env var > built-in default):

Parameter	Type	Built-in Default	Env Var	Description
`--api-key`	string	No hardcoded default	`SN_TEXT_API_KEY` -> `SN_CHAT_API_KEY` -> `SN_API_KEY`	Chat runtime API key; raises `MissingApiKeyError` when all are unset
`--base-url`	string	`SN_CHAT_BASE_URL` default	`SN_TEXT_BASE_URL` -> `SN_CHAT_BASE_URL` -> `SN_BASE_URL`	Text provider base URL; falls back to shared chat/global provider
`--model`	string	`sensenova-6.7-flash-lite`	`SN_TEXT_MODEL` -> `SN_CHAT_MODEL`	Text model name
`--llm-type`	string	`openai-completions`	`SN_TEXT_TYPE` -> `SN_CHAT_TYPE`	Chat protocol type override
`--user-prompt-path`	string	`None`	-	Local file path, mutually exclusive with `--user-prompt`
`--system-prompt-path`	string	`None`	-	Local file path, mutually exclusive with `--system-prompt`

Available values for --llm-type:

openai-completions: OpenAI-compatible /v1/chat/completions interface
anthropic-messages: Anthropic Messages /v1/messages interface

VLM vs LLM

Tool	Model Type	Image Input	Interface Type Parameter
`sn-image-recognize`	VLM (Vision Language Model)	Yes, supports multiple images	`--vlm-type`
`sn-text-optimize`	LLM (Language Model)	No, text only	`--llm-type`

Usage

All tools are called through the unified sn_agent_runner.py entrypoint:

# Image generation (only prompt required; api-key/base-url have defaults)
python scripts/sn_agent_runner.py sn-image-generate \
    --prompt "..."

# Image generation (override base-url)
python scripts/sn_agent_runner.py sn-image-generate \
    --prompt "..." \
    --base-url "https://custom-endpoint.com/v1"

# Image generation (explicitly override api-key)
python scripts/sn_agent_runner.py sn-image-generate \
    --prompt "..." \
    --api-key "sk-xxx"

# Image recognition (VLM) - minimal call (uses built-in Sensenova defaults)
python scripts/sn_agent_runner.py sn-image-recognize \
    --user-prompt "Describe the image" \
    --images "path/to/image.png"

# Image recognition (VLM) - override to Anthropic Claude API compatible (messages interface)
python scripts/sn_agent_runner.py sn-image-recognize \
    --user-prompt "Describe the image" \
    --images "path/to/image.png" \
    --api-key "sk-ant-xxx" \
    --base-url "https://api.anthropic.com" \
    --model "claude-sonnet-4-6" \
    --vlm-type "anthropic-messages"

# Text optimization (LLM) - minimal call (uses built-in Sensenova defaults)
python scripts/sn_agent_runner.py sn-text-optimize \
    --user-prompt "Optimize the text: ..."

# Text optimization (LLM) - override to Anthropic Claude API compatible (messages interface)
python scripts/sn_agent_runner.py sn-text-optimize \
    --user-prompt "Optimize the text: ..." \
    --api-key "sk-ant-xxx" \
    --base-url "https://api.anthropic.com" \
    --model "claude-sonnet-4-6" \
    --llm-type "anthropic-messages"

Default Parameter Behavior

Authentication parameters for sn-image-generate have the following default behavior:

Parameter	Default	Override	Description
`--base-url`	`SN_IMAGE_GEN_BASE_URL` -> `SN_BASE_URL`	`--base-url "..."`	CLI argument has priority
`--api-key`	`SN_IMAGE_GEN_API_KEY` -> `SN_API_KEY`	`--api-key "..."`	CLI argument has priority; throws `MissingApiKeyError` if all values are empty

sn-image-recognize and sn-text-optimize use priority: CLI argument > command-specific env var > shared SN_CHAT_* env var > global SN_* env var > built-in default.

Parameter	Built-in Default	Vision Env Var	Text Env Var
`--api-key`	None (must be provided)	`SN_VISION_API_KEY` -> `SN_CHAT_API_KEY` -> `SN_API_KEY`	`SN_TEXT_API_KEY` -> `SN_CHAT_API_KEY` -> `SN_API_KEY`
`--base-url`	`https://token.sensenova.cn/v1`	`SN_VISION_BASE_URL` -> `SN_CHAT_BASE_URL` -> `SN_BASE_URL`	`SN_TEXT_BASE_URL` -> `SN_CHAT_BASE_URL` -> `SN_BASE_URL`
`--model`	`sensenova-6.7-flash-lite`	`SN_VISION_MODEL` -> `SN_CHAT_MODEL`	`SN_TEXT_MODEL` -> `SN_CHAT_MODEL`
`--vlm-type` / `--llm-type`	`openai-completions`	`SN_VISION_TYPE` -> `SN_CHAT_TYPE`	`SN_TEXT_TYPE` -> `SN_CHAT_TYPE`

api_key resolution order (high to low): CLI --api-key > command-specific key (SN_VISION_API_KEY/SN_TEXT_API_KEY) > SN_CHAT_API_KEY > SN_API_KEY. If all are unset, MissingApiKeyError is raised.

Only --api-key must be provided via CLI or environment; base URL, model, and interface type have shared chat defaults.

Agent Configuration Integration

The agent can automatically read parameters from openclaw.json without manual input:

CLI Parameter	openclaw.json Field	Example
`--base-url`	`providers.<name>.baseUrl`	`https://api.anthropic.com`
`--llm-type`	`providers.<name>.api`	`anthropic-messages` / `openai-completions`
`--vlm-type`	`providers.<name>.api`	`anthropic-messages` / `openai-completions`
`--model`	`providers.<name>.models[].id`	`claude-sonnet-4-6`
`--api-key`	`providers.<name>.apiKey` or env var	`sk-cp-...`

Note: --llm-type and --vlm-type share the same providers.<name>.api field and are used by LLM and VLM tools respectively.

Mapping between provider.api and interface type:

api Value	Corresponding `--llm-type` / `--vlm-type`	Endpoint Path
`anthropic-messages`	`anthropic-messages`	`/v1/messages`
`openai-completions`	`openai-completions`	`/v1/chat/completions`
`openai-responses`	(future extension)	`/responses`

Mapping Between base-url and Interface Type

Different API types have different requirements for base-url format:

Type	`--llm-type` / `--vlm-type`	Recommended base-url	Code Appended Path	Final URL Example
LLM	`openai-completions`	`https://token.sensenova.cn/v1`	`/chat/completions`	`https://token.sensenova.cn/v1/chat/completions`
LLM	`anthropic-messages`	`https://api.anthropic.com/v1`	`/messages`	`https://api.anthropic.com/v1/messages`
VLM	`openai-completions`	`https://token.sensenova.cn/v1`	`/chat/completions`	`https://token.sensenova.cn/v1/chat/completions`
VLM	`anthropic-messages`	`https://api.anthropic.com/v1`	`/messages`	`https://api.anthropic.com/v1/messages`

Note:

Recommended chat base URLs include the provider API version path, for example /v1.
For compatibility, if the configured chat base URL has no path, the runner appends /v1/chat/completions or /v1/messages.
If the configured chat base URL already has a path such as /v1, the runner appends only /chat/completions or /messages.
Some providers use versioned paths other than /v1, such as Gemini's /v1beta/openai.

Output Format

All tools support two output formats:

--output-format text (default): outputs plain text result
--output-format json: outputs JSON, including status and elapsed_seconds (runtime in seconds, rounded to 2 decimals)

JSON output for sn-image-recognize and sn-text-optimize also includes model, base_url, and interface_type to verify the effective runtime configuration:

{
  "status": "ok",
  "result": "...",
  "model": "sensenova-6.7-flash-lite",
  "base_url": "https://token.sensenova.cn/v1",
  "interface_type": "openai-completions",
  "elapsed_seconds": 1.23
}

On failure:

{
  "status": "failed",
  "error": "error message",
  "elapsed_seconds": 0.05
}

Input/Output Specification

See references/api_spec.md for details.

sn-image-base

More from this repository

More from this repository

sn-image-base

Dependency Installation

Overview

Tools List

sn-image-generate

sn-image-recognize

sn-text-optimize

VLM vs LLM

Usage

Default Parameter Behavior

Agent Configuration Integration

Mapping Between base-url and Interface Type

Output Format

Input/Output Specification

sn-image-base

Dependency Installation

Overview

Tools List

sn-image-generate

sn-image-recognize

sn-text-optimize

VLM vs LLM

Usage

Default Parameter Behavior

Agent Configuration Integration

Mapping Between base-url and Interface Type

Output Format

Input/Output Specification