Skip to main content
Run any Skill in Manus
with one click

ai-multimodal

Analyze images/audio/video with Gemini API (better vision than Claude). Generate images (Imagen 4), videos (Veo 3). Use for vision analysis, transcription, OCR, design extraction, multimodal AI.

Overview

Analyze images/audio/video with Gemini API (better vision than Claude). Generate images (Imagen 4), videos (Veo 3). Use for vision analysis, transcription, OCR, design extraction, multimodal AI.

Install command
npx skills add https://github.com/hoanghd218/landing-page-ai-kientruc-a-Ninh --skill ai-multimodal

Copy and paste this command into Claude Code to install the skill

Stars0
Forks0
UpdatedJanuary 28, 2026 at 14:46
File Explorer
19 files
SKILL.md
readonly