تشغيل أي مهارة في Manus بنقرة واحدة

$pwd:

mineru

Name: Mineru
Author: xpert-ai

// Convert documents (PDF, Word, PPT, images, HTML) to Markdown using the MinerU cloud API. Use this skill whenever the user wants to parse, extract, or convert a document into Markdown or other text formats, whether from a local file or a URL. Also trigger when the user mentions MinerU, asks to read or extract a PDF/doc/ppt, wants OCR on a scanned document, or needs structured text from any supported file type.

تشغيل في Manus

$ git log --oneline --stat

stars:٦

forks:١٠

updated:٢٥ مارس ٢٠٢٦ في ٠٦:١٣

مستكشف الملفات

3 ملفات

SKILL.md

readonly

name

mineru

description

Convert documents (PDF, Word, PPT, images, HTML) to Markdown using the MinerU cloud API. Use this skill whenever the user wants to parse, extract, or convert a document into Markdown or other text formats, whether from a local file or a URL. Also trigger when the user mentions MinerU, asks to read or extract a PDF/doc/ppt, wants OCR on a scanned document, or needs structured text from any supported file type.

MinerU Document Converter

Convert documents to Markdown (and optionally Docx/HTML/LaTeX) via the MinerU cloud API. This skill wraps a Python CLI script that handles the full workflow: submit, poll, and download results.

Supported file types

PDF, Doc, Docx, PPT, PPTx, PNG, JPG, JPEG, WebP, GIF, BMP, HTML

How to use

Run the CLI script through sandbox_shell with Python. The script path inside the sandbox is:

python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py

Converting a URL

python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --url "https://example.com/paper.pdf"

Converting a local file

python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file /path/to/document.pdf

Choosing the right model

The --model flag selects the parsing engine (only applies to the precise API):

Model	Best for	Notes
`pipeline`	General documents (default)	Fast, good baseline
`vlm`	Complex layouts, mixed content	Recommended for best quality
`MinerU-HTML`	HTML-heavy documents	Specialized for web content

For best results, default to --model vlm unless the user has a reason to prefer speed over quality.

Common options

Flag	Purpose	Example
`--url URL`	Parse a document from a URL	`--url "https://..."`
`--file PATH`	Parse a local file	`--file ./report.pdf`
`--model MODEL`	Select model (`pipeline`, `vlm`, `MinerU-HTML`)	`--model vlm`
`--ocr`	Enable OCR for scanned documents	`--ocr`
`--pages RANGE`	Parse specific pages only	`--pages 1-10`
`--language LANG`	Document language (default: `ch`)	`--language en`
`--formats FMT...`	Additional output formats (precise API only)	`--formats docx html`
`--agent`	Force using the lightweight API	`--agent`

Examples

# Best quality conversion of a local PDF
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file ./report.pdf --model vlm

# OCR a scanned document
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file ./scan.pdf --ocr --model vlm

# Convert first 5 pages only, also produce a docx
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file ./book.pdf --pages 1-5 --formats docx

# Parse from URL with English language hint
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --url "https://arxiv.org/pdf/xxx" --language en --model vlm

# Force lightweight API (no token needed, but 10MB / 20 page limit)
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file ./small.pdf --agent

API selection logic

The script uses two MinerU APIs with automatic fallback:

Precise Parsing API (primary). Uses a MinerU token. The script checks MINERU_TOKEN, then MINERU_TOKEN_FILE, then the middleware-managed secret file. Supports files up to 200MB / 600 pages, formula and table detection, multiple output formats, and the VLM model.
Agent Lightweight API (fallback). No token needed, but limited to 10MB / 20 pages, Markdown-only output, and no formula or table detection.

If this middleware has an API token configured, it securely provisions the token inside the sandbox and the script reads it automatically. Do not hardcode secrets in the command. If no token is configured, let the user know the script will fall back to the lightweight API and its limits.

Output location

All results are saved under a per-run directory in the current working directory:

Local files use mineru_{原文件名去扩展名} such as mineru_report/
URL inputs use the URL file name when available
If a URL has no usable file name, the script falls back to mineru_{task_id}/
If the directory already exists, the script appends _2, _3, and so on

After conversion completes, the script prints the actual saved directory and file paths.

After conversion

Check the script output for the actual mineru_* output directory and saved file paths
Read the resulting .md file and present the content to the user
If the result is a .zip file from the precise API, mention where it was saved
If extra formats were requested, mention those files too

Troubleshooting

MINERU_TOKEN not set: the precise API needs a token, otherwise the script falls back to the lightweight API
Warning: unable to read MINERU_TOKEN_FILE: the explicit token file path is unreadable, so the script continues without a token
Timeout: large documents can take several minutes; the script polls for up to 10 minutes
File too large for lightweight API: ask the user to configure MINERU_TOKEN or narrow --pages

related-skills.json

نفس المستودع

playwright-cli.md

from "xpert-ai/xpert-plugins"

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.

2026-05-226

lark-cli.md

from "xpert-ai/xpert-plugins"

Interact with Lark/Feishu Open Platform using the official CLI tool. Use this skill when the user wants to manage calendar events, send messages, work with documents, manage spreadsheets, handle tasks, or interact with any Lark/Feishu business domain. Supports both user-level (OAuth) and bot-level (App ID/Secret) authentication.

2026-03-286

markitdown-cli.md

from "xpert-ai/xpert-plugins"

Use this skill when the user wants to convert documents, URLs, or typed stdin content to Markdown with the markitdown CLI. It covers local files, URLs, format hints for stdin, batch conversion, third-party plugins, and Azure Document Intelligence workflows.

2026-03-256

zip-unzip-cli.md

from "xpert-ai/xpert-plugins"

Use this skill whenever the user wants to compress or decompress files with zip or unzip inside the Xpert sandbox. Trigger when they mention creating zip archives, extracting zip files, listing archive contents, testing integrity, excluding files during compression, password-based extraction, or split zip archives.

2026-03-256

package.json

"author": "xpert-ai"

"repository": "xpert-ai/xpert-plugins"

فتح مستودع GitHub عرض مستودعات المنشئ

$ install --global

$ download --local

تشغيل في Manus

$ useful --forSOC

مطوّرو البرمجياتمهن الحاسوب والرياضيات15-1252L4

name

mineru

description

MinerU Document Converter

Convert documents to Markdown (and optionally Docx/HTML/LaTeX) via the MinerU cloud API. This skill wraps a Python CLI script that handles the full workflow: submit, poll, and download results.

Supported file types

PDF, Doc, Docx, PPT, PPTx, PNG, JPG, JPEG, WebP, GIF, BMP, HTML

How to use

Run the CLI script through sandbox_shell with Python. The script path inside the sandbox is:

python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py

Converting a URL

python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --url "https://example.com/paper.pdf"

Converting a local file

python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file /path/to/document.pdf

Choosing the right model

The --model flag selects the parsing engine (only applies to the precise API):

Model	Best for	Notes
`pipeline`	General documents (default)	Fast, good baseline
`vlm`	Complex layouts, mixed content	Recommended for best quality
`MinerU-HTML`	HTML-heavy documents	Specialized for web content

For best results, default to --model vlm unless the user has a reason to prefer speed over quality.

Common options

Flag	Purpose	Example
`--url URL`	Parse a document from a URL	`--url "https://..."`
`--file PATH`	Parse a local file	`--file ./report.pdf`
`--model MODEL`	Select model (`pipeline`, `vlm`, `MinerU-HTML`)	`--model vlm`
`--ocr`	Enable OCR for scanned documents	`--ocr`
`--pages RANGE`	Parse specific pages only	`--pages 1-10`
`--language LANG`	Document language (default: `ch`)	`--language en`
`--formats FMT...`	Additional output formats (precise API only)	`--formats docx html`
`--agent`	Force using the lightweight API	`--agent`

Examples

# Best quality conversion of a local PDF
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file ./report.pdf --model vlm

# OCR a scanned document
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file ./scan.pdf --ocr --model vlm

# Convert first 5 pages only, also produce a docx
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file ./book.pdf --pages 1-5 --formats docx

# Parse from URL with English language hint
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --url "https://arxiv.org/pdf/xxx" --language en --model vlm

# Force lightweight API (no token needed, but 10MB / 20 page limit)
python3 /workspace/.xpert/skills/mineru-cli/scripts/mineru.py --file ./small.pdf --agent

API selection logic

The script uses two MinerU APIs with automatic fallback:

Precise Parsing API (primary). Uses a MinerU token. The script checks MINERU_TOKEN, then MINERU_TOKEN_FILE, then the middleware-managed secret file. Supports files up to 200MB / 600 pages, formula and table detection, multiple output formats, and the VLM model.
Agent Lightweight API (fallback). No token needed, but limited to 10MB / 20 pages, Markdown-only output, and no formula or table detection.

Output location

All results are saved under a per-run directory in the current working directory:

Local files use mineru_{原文件名去扩展名} such as mineru_report/
URL inputs use the URL file name when available
If a URL has no usable file name, the script falls back to mineru_{task_id}/
If the directory already exists, the script appends _2, _3, and so on

After conversion completes, the script prints the actual saved directory and file paths.

After conversion

Check the script output for the actual mineru_* output directory and saved file paths
Read the resulting .md file and present the content to the user
If the result is a .zip file from the precise API, mention where it was saved
If extra formats were requested, mention those files too

Troubleshooting

MINERU_TOKEN not set: the precise API needs a token, otherwise the script falls back to the lightweight API
Warning: unable to read MINERU_TOKEN_FILE: the explicit token file path is unreadable, so the script continues without a token
Timeout: large documents can take several minutes; the script polls for up to 10 minutes
File too large for lightweight API: ask the user to configure MINERU_TOKEN or narrow --pages

mineru

MinerU Document Converter

Supported file types

How to use

Converting a URL

Converting a local file

Choosing the right model

Common options

Examples

API selection logic

Output location

After conversion

Troubleshooting

المزيد من هذا المستودع

MinerU Document Converter

Supported file types

How to use

Converting a URL

Converting a local file

Choosing the right model

Common options

Examples

API selection logic

Output location

After conversion

Troubleshooting

المزيد من هذا المستودع