Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

deepseek-vision

// Use whenever the user references an image (local file path or http/https URL — screenshot, photo, diagram, UI capture, chart, error dialog) and you need to know what's in it to answer or act. Calls a vision model (Qwen3.6-Flash by default) via DashScope and returns a text description you can reason over. Especially important when running on a text-only backend like DeepSeek V4, but also useful as a dedicated OCR / detail extractor even when the main model is multimodal.

$ git log --oneline --stat
stars:٤٩
forks:٦
updated:٢٩ أبريل ٢٠٢٦ في ٠٢:٠٨
مستكشف الملفات
2 ملفات
SKILL.md
readonly