تشغيل أي مهارة في Manus
بنقرة واحدة
بنقرة واحدة
تشغيل أي مهارة في Manus بنقرة واحدة
ابدأ الآن$pwd:
$ git log --oneline --stat
stars:٣١٢
forks:٣٦
updated:٢ مايو ٢٠٢٦ في ١٧:٢٦
مستكشف الملفات
SKILL.md
Extract text from PDF documents
Extract transcripts from YouTube videos via the YouTube caption system
Search the web and ingest results as wiki pages
Fetch and extract text from web URLs
Extract text from Microsoft Word documents
Extract data from Excel and CSV files
| name | image |
| version | 1.0 |
| description | Extract text from images using a vision LLM |
| entry | {"script":"scripts/main.py","class":"ImageSkill"} |
| triggers | {"extensions":[".png",".jpg",".jpeg",".webp",".gif",".tiff"],"intents":["image","screenshot","diagram","photo"]} |
| requires | [] |
| author | axoviq.com |
| license | AGPL-3.0-or-later |
Base64-encodes the image and returns it in metadata for the IngestAgent to
process via a vision-capable LLM. The text field is left empty at extract
time and filled in by the agent.
.png, .jpg, .jpeg, .webp, .gif, or .tiffimage, screenshot, diagram, photo