一键导入
grok-image-to-video
// Animate a local image into a short mp4 video through Hermes Web UI using xAI Grok Imagine.
// Animate a local image into a short mp4 video through Hermes Web UI using xAI Grok Imagine.
Generate or edit images through Hermes Web UI using the selected/requested profile's fun-codex provider from config.yaml.
Create rich diagrams, data visualizations, technical architecture views, and editorial content cards directly in Markdown using the Markdown Viewer Agent Skills pack. Use for Mermaid-like diagram requests, PlantUML architecture diagrams, Vega charts, JSON Canvas maps, infographics, UML, cloud/network/security/data/IoT diagrams, and polished Markdown documentation visuals.
Create editable AI video projects with Remotion and React, then preview and render them to MP4. Use for vertical short videos, product demos, story-driven animations, HUD/tech visuals, feed ads, tutorial videos, subtitles, voiceover, sound effects, and code-based video iteration.
Create AI videos with HyperFrames in Hermes using HTML, CSS, and JavaScript compositions, then validate and render them to MP4. Use for short video intros, cinematic trailers, product promos, subtitle animations, HUD/tech visuals, web-to-video work, and motion graphics.
| name | grok-image-to-video |
| description | Animate a local image into a short mp4 video through Hermes Web UI using xAI Grok Imagine. |
| version | 1.0.0 |
| author | Ekko |
| license | MIT |
| platforms | ["linux","macos","windows"] |
| metadata | {"hermes":{"tags":["xAI","Grok","image-to-video","video-generation","media"]}} |
| prerequisites | {"commands":["curl"]} |
Use this skill when the user wants to animate a local image into a short video with xAI Grok Imagine.
Do not use any built-in image or video generation tool as a fallback. If the Hermes Web UI endpoint returns 401, 403, connection failure, or any other error, stop and report the Hermes Web UI error to the user.
Call the local Hermes Web UI media endpoint. Pass a local image path; the server will check for xAI credentials, read the file, convert it to a base64 data URI, call xAI, poll until completion, and optionally save the generated mp4.
Endpoint:
POST <Hermes Web UI base URL>/api/hermes/media/grok-image-to-video
Resolve the Hermes Web UI base URL in this order:
HERMES_WEB_UI_URL environment variable, if set.http://127.0.0.1:${PORT}, if PORT is set.http://127.0.0.1:8648 for local development.When Hermes Web UI is running from the provided Docker Compose setup, the default external URL is http://127.0.0.1:6060.
Authentication:
The endpoint is protected by Hermes Web UI auth. Always send the Hermes Web UI server bearer token. This token is accepted only by Hermes Web UI media generation endpoints for agent skills; it is not a general Web UI login token.
Resolve the token in this order:
AUTH_TOKEN environment variable, if set.${HERMES_WEB_UI_HOME}/.token, if HERMES_WEB_UI_HOME is set.${HERMES_WEBUI_STATE_DIR}/.token, if HERMES_WEBUI_STATE_DIR is set.~/.hermes-web-ui/.token.Profile selection:
Use the current Hermes profile from the run instructions by sending X-Hermes-Profile.
If the run instructions include [Current Hermes profile: <name>], include:
-H "X-Hermes-Profile: <name>"
Replace <name> with the exact profile name from the run instructions. Never send a placeholder value such as <name> or <current-hermes-profile>.
If no current profile is provided, omit the header and let the server fall back to the current Hermes active profile.
Required JSON fields:
image_path: local path to a png, jpeg, or webp image.prompt: motion and style instructions for the generated video.Optional JSON fields:
duration: seconds, 1 to 15. Defaults to 8.output_path: local path where the server should save the mp4. If omitted, the server saves to ${HERMES_WEB_UI_HOME:-~/.hermes-web-ui}/media/<request_id>.mp4 and creates the media directory if needed.timeout_ms: maximum wait time. Defaults to 600000.Example:
TOKEN="${AUTH_TOKEN:-}"
if [ -z "$TOKEN" ] && [ -n "${HERMES_WEB_UI_HOME:-}" ] && [ -f "$HERMES_WEB_UI_HOME/.token" ]; then
TOKEN="$(cat "$HERMES_WEB_UI_HOME/.token")"
fi
if [ -z "$TOKEN" ] && [ -n "${HERMES_WEBUI_STATE_DIR:-}" ] && [ -f "$HERMES_WEBUI_STATE_DIR/.token" ]; then
TOKEN="$(cat "$HERMES_WEBUI_STATE_DIR/.token")"
fi
if [ -z "$TOKEN" ] && [ -f "$HOME/.hermes-web-ui/.token" ]; then
TOKEN="$(cat "$HOME/.hermes-web-ui/.token")"
fi
if [ -z "$TOKEN" ]; then
echo "Missing Hermes Web UI token. Check AUTH_TOKEN, HERMES_WEB_UI_HOME, HERMES_WEBUI_STATE_DIR, or ~/.hermes-web-ui/.token." >&2
exit 1
fi
BASE_URL="${HERMES_WEB_UI_URL:-}"
if [ -z "$BASE_URL" ]; then
BASE_URL="http://127.0.0.1:${PORT:-8648}"
fi
BASE_URL="${BASE_URL%/}"
curl -sS -X POST "$BASE_URL/api/hermes/media/grok-image-to-video" \
-H "Authorization: Bearer $TOKEN" \
-H 'Content-Type: application/json' \
-d '{
"image_path": "/absolute/path/to/input.png",
"prompt": "Animate the subject with a slow cinematic push-in and subtle natural motion.",
"duration": 8,
"output_path": "/absolute/path/to/output.mp4"
}'
If the response has code: "missing_xai_token", tell the user to set XAI_API_KEY or complete xAI OAuth login in Hermes Web UI before retrying.
Return the generated output_path.