Run any Skill in Manus with one click

$pwd:

mmskills

Name: Mmskills
Author: DeepExperience

// Use MMSkills for GUI, computer-use, visual-agent, OSWorld, Ubuntu desktop, macOS, Minecraft, or Mario tasks where reusable multimodal skill packages can guide planning, state recognition, or verification. Also use when installing, downloading, searching, or adapting MMSkills packages for Codex, OpenClaw, Claude Code, or other agent products.

Run Skill in Manus

$ git log --oneline --stat

stars:316

forks:22

updated:May 18, 2026 at 12:06

File Explorer

8 files

SKILL.md

readonly

name	mmskills
description	Use MMSkills for GUI, computer-use, visual-agent, OSWorld, Ubuntu desktop, macOS, Minecraft, or Mario tasks where reusable multimodal skill packages can guide planning, state recognition, or verification. Also use when installing, downloading, searching, or adapting MMSkills packages for Codex, OpenClaw, Claude Code, or other agent products.

MMSkills Agent Adapter

MMSkills Agent Adapter is a unified adapter for locating and using multimodal skill packages from the public MMSkills dataset. The same skill package schema should be used across Codex, OpenClaw, Claude Code, and other agent products; product-specific integration should stay in thin adapters, not in duplicated skill content.

Asset Source

The full public skill library is hosted on Hugging Face:

Dataset: https://huggingface.co/datasets/zhangkangning/mmskills
Paper page: https://huggingface.co/papers/2605.13527
Project website: https://deepexperience.github.io/MMSkills/skills.html

The dataset currently exposes 515 packages across Ubuntu, macOS, VAB-Minecraft, and Mario. Do not assume all packages are installed locally. Search and download only the relevant packages when needed.

Quick Commands

Run these commands from this skill directory:

python scripts/search_skills.py "chrome bookmark" --package ubuntu
python scripts/download_skill.py ubuntu/chrome/CHROME_Manage_Bookmarks_Reading_List_And_Shortcuts
python scripts/inspect_skill.py ~/.cache/mmskills/skills/ubuntu/chrome/CHROME_Manage_Bookmarks_Reading_List_And_Shortcuts

Use MMSKILLS_HOME=/path/to/cache to store downloaded packages outside the default ~/.cache/mmskills.

Use Workflow

Infer the task surface: package, platform, app, and operation family.
Search the dataset index with scripts/search_skills.py.
Download the best-matching package with scripts/download_skill.py if it is not already local.
Read the package in this order:
- SKILL.md for procedure, applicability, preconditions, and failure modes.
- runtime_state_cards.json for compact state cues and verification signals.
- Images/ only when visual grounding, state comparison, or UI verification is needed.
Use the package as guidance, not as a hard coordinate script. Transfer the operational pattern, visible cues, and verification logic to the live task state.

Cross-Agent Contract

MMSkills packages are product-neutral. Each package may contain:

<package>/<domain>/<skill>/
├── SKILL.md
├── runtime_state_cards.json
├── plan.json
└── Images/

Codex, OpenClaw, and Claude Code should share this package format. The only product-specific pieces should be:

where packages are cached;
which tool opens images or screenshots;
how the agent routes retrieved guidance back into its planner;
whether the product can use visual references directly or should fall back to text-only cues.

Read references/integration_targets.md before implementing a new product adapter. Read references/asset_sources.md when changing dataset URLs, cache locations, or download behavior.

When Visual Evidence Is Useful

Load visual references when:

the task depends on recognizing a specific UI state, dialog, control, or result surface;
text-only procedure is ambiguous;
the package has a verification state that matches the current screenshot;
the agent needs a recovery cue after a failed or uncertain GUI action.

Prefer focus crops for local control recognition and full frames for global state recognition.

related-skills.json

same repository

configure-default-search-engine-and-search-preferences.md

from "DeepExperience/MMSkills"

Use Chrome search-engine settings and Google Search Settings surfaces to change providers or inspect search-display preferences without inventing controls that are not visible.

2026-05-07316

manage-bookmarks-reading-list-and-shortcuts.md

from "DeepExperience/MMSkills"

Save the current Chrome page, choose or create a destination, and verify the saved result. Use the image cards only when the current UI genuinely matches the bookmark states they show.

2026-05-07316

search-web-and-open-target-result.md

from "DeepExperience/MMSkills"

Use Google search in Chrome to submit a query, choose the intended result, and verify that the browser has opened the requested destination page instead of a similarly named result. This skill stops at the landing page and does not cover deep in-site browsing or long-page extraction.

2026-05-07316

gimp-gimp-save-projects-and-export-edited-images.md

from "DeepExperience/MMSkills"

Save editable GIMP workfiles and preserve layered project state before handing off to export when needed.

2026-05-07316

create-chart-on-target-sheet-with-exact-title-and-type.md

from "DeepExperience/MMSkills"

Create a chart on the requested destination sheet while honoring an exact chart title and chart family.

2026-05-07316

sort-and-filter-calc-tables.md

from "DeepExperience/MMSkills"

Apply sorts, auto-filters, and range filters to Calc tables and verify the visible filtered rows or ordering on the sheet.

2026-05-07316

package.json

"author": "DeepExperience"

"repository": "DeepExperience/MMSkills"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	mmskills
description	Use MMSkills for GUI, computer-use, visual-agent, OSWorld, Ubuntu desktop, macOS, Minecraft, or Mario tasks where reusable multimodal skill packages can guide planning, state recognition, or verification. Also use when installing, downloading, searching, or adapting MMSkills packages for Codex, OpenClaw, Claude Code, or other agent products.

MMSkills Agent Adapter

Asset Source

The full public skill library is hosted on Hugging Face:

Dataset: https://huggingface.co/datasets/zhangkangning/mmskills
Paper page: https://huggingface.co/papers/2605.13527
Project website: https://deepexperience.github.io/MMSkills/skills.html

Quick Commands

Run these commands from this skill directory:

python scripts/search_skills.py "chrome bookmark" --package ubuntu
python scripts/download_skill.py ubuntu/chrome/CHROME_Manage_Bookmarks_Reading_List_And_Shortcuts
python scripts/inspect_skill.py ~/.cache/mmskills/skills/ubuntu/chrome/CHROME_Manage_Bookmarks_Reading_List_And_Shortcuts

Use MMSKILLS_HOME=/path/to/cache to store downloaded packages outside the default ~/.cache/mmskills.

Use Workflow

Infer the task surface: package, platform, app, and operation family.
Search the dataset index with scripts/search_skills.py.
Download the best-matching package with scripts/download_skill.py if it is not already local.
Read the package in this order:
- SKILL.md for procedure, applicability, preconditions, and failure modes.
- runtime_state_cards.json for compact state cues and verification signals.
- Images/ only when visual grounding, state comparison, or UI verification is needed.
Use the package as guidance, not as a hard coordinate script. Transfer the operational pattern, visible cues, and verification logic to the live task state.

Cross-Agent Contract

MMSkills packages are product-neutral. Each package may contain:

<package>/<domain>/<skill>/
├── SKILL.md
├── runtime_state_cards.json
├── plan.json
└── Images/

Codex, OpenClaw, and Claude Code should share this package format. The only product-specific pieces should be:

where packages are cached;
which tool opens images or screenshots;
how the agent routes retrieved guidance back into its planner;
whether the product can use visual references directly or should fall back to text-only cues.

Read references/integration_targets.md before implementing a new product adapter. Read references/asset_sources.md when changing dataset URLs, cache locations, or download behavior.

When Visual Evidence Is Useful

Load visual references when:

the task depends on recognizing a specific UI state, dialog, control, or result surface;
text-only procedure is ambiguous;
the package has a verification state that matches the current screenshot;
the agent needs a recovery cue after a failed or uncertain GUI action.

Prefer focus crops for local control recognition and full frames for global state recognition.

mmskills

MMSkills Agent Adapter

Asset Source

Quick Commands

Use Workflow

Cross-Agent Contract

When Visual Evidence Is Useful

More from this repository

More from this repository

MMSkills Agent Adapter

Asset Source

Quick Commands

Use Workflow

Cross-Agent Contract

When Visual Evidence Is Useful