Skip to main content
Run any Skill in Manus
with one click
$pwd:

glmv-grounding

// A skill that uses GLM-V native grounding capabilities for coordinate conversion, bounding-box visualization, and more. GLM-V native grounding can locate any target specified by the prompt in an image and output relative coordinates normalized to 0-1000 based on image size. Coordinate formats include 2D bounding box (default), 2D points, and 3D bounding box. GLM-V also supports spatiotemporal localization and tracking of multiple prompt-specified targets in videos, outputting 2D bounding boxes per second.

$ git log --oneline --stat
stars:2,319
forks:169
updated:March 30, 2026 at 08:42
File Explorer
8 files
SKILL.md
readonly