Skip to main content
Run any Skill in Manus
with one click
$pwd:

glmv-caption

// Generate captions (descriptions) for images, videos, and documents using ZhiPu GLM-V multimodal model series. Use this skill whenever the user wants to describe, caption, summarize, or interpret the content of images, videos, or files. Supports single/multiple inputs, URLs, local paths, and base64 (images only).

$ git log --oneline --stat
stars:2,319
forks:169
updated:March 30, 2026 at 08:42
File Explorer
2 files
SKILL.md
readonly