Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

glm-vision

Name: Glm Vision
Author: archibate

// This skill should be used when the user sends an image and asks to "analyze this image", "describe this picture", "what's in this image", or any request requiring visual understanding of images. Provides image analysis using Zhipu GLM-4.6V multimodal model.

Ejecutar en Manus

$ git log --oneline --stat

stars:102

forks:19

updated:2 de abril de 2026, 01:56

Explorador de archivos

3 archivos

SKILL.md

readonly

name	glm-vision
description	This skill should be used when the user sends an image and asks to "analyze this image", "describe this picture", "what's in this image", or any request requiring visual understanding of images. Provides image analysis using Zhipu GLM-4.6V multimodal model.
version	0.1.0

GLM Vision - 图片分析技能

概述

使用智谱 GLM-4.6V 多模态模型分析图片内容。支持：

图片内容描述和理解
图片中的文字提取（OCR）
图片元素识别和分析
多图对比分析
视频内容理解

使用场景

当用户发送图片并提出以下类型问题时触发：

"这张图片里有什么？"
"请描述这张图片"
"分析这个截图"
"这是什么？"
"图片里的文字是什么？"
"比较这两张图片"

API 配置

环境变量

需要设置 ZHIPU_API_KEY 环境变量：

export ZHIPU_API_KEY="your-api-key-here"

或者在 ~/.claude/settings.json 中配置：

{
  "env": {
    "ZHIPU_API_KEY": "your-api-key-here"
  }
}

模型选择

模型	用途	价格
`glm-4.6v-flash`	免费版，日常使用	免费
`glm-4.6v`	旗舰版，复杂推理	付费
`glm-4.6v-flashx`	轻量高速版	付费

默认使用 glm-4.6v-flash 免费模型。

调用方式

方式一：使用辅助脚本（推荐）

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  --image "/path/to/image.jpg" \
  --prompt "请描述这张图片" \
  [--model "glm-4.6v-flash"]

参数说明：

--image / -i: 图片路径（支持 URL 或本地文件）
--prompt / -p: 分析提示词
--model / -m: 模型名称（可选，默认免费版）
--detail / -d: 启用详细思考模式

方式二：直接 API 调用

import base64
from openai import OpenAI

client = OpenAI(
    api_key=os.environ.get("ZHIPU_API_KEY"),
    base_url="https://open.bigmodel.cn/api/paas/v4"
)

# 读取本地图片
with open("image.jpg", "rb") as f:
    img_base64 = base64.b64encode(f.read()).decode("utf-8")

response = client.chat.completions.create(
    model="glm-4.6v-flash",
    messages=[{
        "role": "user",
        "content": [
            {"type": "image_url", "image_url": {"url": img_base64}},
            {"type": "text", "text": "请描述这张图片"}
        ]
    }]
)
print(response.choices[0].message.content)

方式三：URL 图片

response = client.chat.completions.create(
    model="glm-4.6v-flash",
    messages=[{
        "role": "user",
        "content": [
            {"type": "image_url", "image_url": {"url": "https://example.com/image.jpg"}},
            {"type": "text", "text": "分析这张图片"}
        ]
    }]
)

工作流程

接收图片: 用户发送图片，保存到临时目录
构建请求: 将图片转为 base64 或使用 URL
调用 API: 发送到 GLM-4.6V 模型
返回结果: 解析并展示分析结果

支持的图片格式

JPEG / JPG
PNG
GIF（静态）
WebP
BMP

示例用法

基础图片描述

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  -i "/tmp/screenshot.png" \
  -p "描述这张截图的内容"

详细分析

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  -i "/tmp/photo.jpg" \
  -p "分析这张图片的构图、色彩和主题" \
  --detail

OCR 文字提取

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  -i "/tmp/document.png" \
  -p "提取图片中的所有文字，保持原有格式"

URL 图片分析

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  -i "https://example.com/image.jpg" \
  -p "这张图片展示的是什么？"

注意事项

API Key 安全: 不要在代码中硬编码 API Key
图片大小: 建议图片小于 10MB，过大的图片会增加延迟
网络连接: 需要稳定的网络访问智谱 API
并发限制: 免费用户并发数为 5

参考资源

references/api-reference.md - 完整 API 参考文档
scripts/analyze_image.py - 图片分析辅助脚本

related-skills.json

mismo repositorio

agent-browser.md

from "archibate/dotfiles-opencode"

Browser automation CLI for AI agents. Use when needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. TRIGGER when user requests to "open a website", "fill out a form", "click a button", "take a screenshot", "debug this in browser", "scrape data from a page", "test this web app", "login to a site", "frontend UI/UX aesthetics", "automate browser actions", or any task requiring programmatic web interaction.

2026-04-02102

frontend-design.md

from "archibate/dotfiles-opencode"

Design and implement distinctive, production-ready frontend interfaces with strong aesthetic direction. Use when asked to create or restyle web pages, components, or applications (HTML/CSS/JS, React, Vue, etc.).

2026-04-02102

jina-ai.md

from "archibate/dotfiles-opencode"

Use Jina AI APIs for converting URLs to LLM-friendly Markdown (Reader) and searching the web (Search).

2026-04-02102

mcp-duckgo.md

from "archibate/dotfiles-opencode"

This skill should be used for web search and content scraping via DuckDuckGo MCP Server.

2026-04-02102

openscad.md

from "archibate/dotfiles-opencode"

Create and render OpenSCAD 3D models. Generate preview images from multiple angles, extract customizable parameters, validate syntax, and export STL files for 3D printing.

2026-04-02102

pueue.md

from "archibate/dotfiles-opencode"

This skill should be used before running non-interactive long-running tasks, computation intensive tasks, background tasks, or needs guidance on the pueue CLI tool usage. TRIGGER when user says "use pueue", "run in background", "queue this task", or when about to run any long-running (>2 min) task.

2026-04-02102

package.json

"author": "archibate"

"repository": "archibate/dotfiles-opencode"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas15-1252L4

name	glm-vision
description	This skill should be used when the user sends an image and asks to "analyze this image", "describe this picture", "what's in this image", or any request requiring visual understanding of images. Provides image analysis using Zhipu GLM-4.6V multimodal model.
version	0.1.0

GLM Vision - 图片分析技能

概述

使用智谱 GLM-4.6V 多模态模型分析图片内容。支持：

图片内容描述和理解
图片中的文字提取（OCR）
图片元素识别和分析
多图对比分析
视频内容理解

使用场景

当用户发送图片并提出以下类型问题时触发：

"这张图片里有什么？"
"请描述这张图片"
"分析这个截图"
"这是什么？"
"图片里的文字是什么？"
"比较这两张图片"

API 配置

环境变量

需要设置 ZHIPU_API_KEY 环境变量：

export ZHIPU_API_KEY="your-api-key-here"

或者在 ~/.claude/settings.json 中配置：

{
  "env": {
    "ZHIPU_API_KEY": "your-api-key-here"
  }
}

模型选择

模型	用途	价格
`glm-4.6v-flash`	免费版，日常使用	免费
`glm-4.6v`	旗舰版，复杂推理	付费
`glm-4.6v-flashx`	轻量高速版	付费

默认使用 glm-4.6v-flash 免费模型。

调用方式

方式一：使用辅助脚本（推荐）

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  --image "/path/to/image.jpg" \
  --prompt "请描述这张图片" \
  [--model "glm-4.6v-flash"]

参数说明：

--image / -i: 图片路径（支持 URL 或本地文件）
--prompt / -p: 分析提示词
--model / -m: 模型名称（可选，默认免费版）
--detail / -d: 启用详细思考模式

方式二：直接 API 调用

import base64
from openai import OpenAI

client = OpenAI(
    api_key=os.environ.get("ZHIPU_API_KEY"),
    base_url="https://open.bigmodel.cn/api/paas/v4"
)

# 读取本地图片
with open("image.jpg", "rb") as f:
    img_base64 = base64.b64encode(f.read()).decode("utf-8")

response = client.chat.completions.create(
    model="glm-4.6v-flash",
    messages=[{
        "role": "user",
        "content": [
            {"type": "image_url", "image_url": {"url": img_base64}},
            {"type": "text", "text": "请描述这张图片"}
        ]
    }]
)
print(response.choices[0].message.content)

方式三：URL 图片

response = client.chat.completions.create(
    model="glm-4.6v-flash",
    messages=[{
        "role": "user",
        "content": [
            {"type": "image_url", "image_url": {"url": "https://example.com/image.jpg"}},
            {"type": "text", "text": "分析这张图片"}
        ]
    }]
)

工作流程

接收图片: 用户发送图片，保存到临时目录
构建请求: 将图片转为 base64 或使用 URL
调用 API: 发送到 GLM-4.6V 模型
返回结果: 解析并展示分析结果

支持的图片格式

JPEG / JPG
PNG
GIF（静态）
WebP
BMP

示例用法

基础图片描述

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  -i "/tmp/screenshot.png" \
  -p "描述这张截图的内容"

详细分析

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  -i "/tmp/photo.jpg" \
  -p "分析这张图片的构图、色彩和主题" \
  --detail

OCR 文字提取

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  -i "/tmp/document.png" \
  -p "提取图片中的所有文字，保持原有格式"

URL 图片分析

python3 ~/.claude/skills/glm-vision/scripts/analyze_image.py \
  -i "https://example.com/image.jpg" \
  -p "这张图片展示的是什么？"

注意事项

API Key 安全: 不要在代码中硬编码 API Key
图片大小: 建议图片小于 10MB，过大的图片会增加延迟
网络连接: 需要稳定的网络访问智谱 API
并发限制: 免费用户并发数为 5

参考资源

references/api-reference.md - 完整 API 参考文档
scripts/analyze_image.py - 图片分析辅助脚本

glm-vision

GLM Vision - 图片分析技能

概述

使用场景

API 配置

环境变量

模型选择

调用方式

方式一：使用辅助脚本（推荐）

方式二：直接 API 调用

方式三：URL 图片

工作流程

支持的图片格式

示例用法

基础图片描述

详细分析

OCR 文字提取

URL 图片分析

注意事项

参考资源

Más de este repositorio

Más de este repositorio

GLM Vision - 图片分析技能

概述

使用场景

API 配置

环境变量

模型选择

调用方式

方式一：使用辅助脚本（推荐）

方式二：直接 API 调用

方式三：URL 图片

工作流程

支持的图片格式

示例用法

基础图片描述

详细分析

OCR 文字提取

URL 图片分析

注意事项

参考资源