with one click
markitdown-skill
// OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), HTML, YouTube.
// OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), HTML, YouTube.
| name | markitdown-skill |
| description | OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), HTML, YouTube. |
| metadata | {"openclaw":{"emoji":"📄","homepage":"https://github.com/karmanverma/markitdown-skill","requires":{"bins":["python3","pip","markitdown"]},"install":[{"id":"markitdown","kind":"pip","package":"markitdown[all]","bins":["markitdown"],"label":"Install MarkItDown CLI (pip)"}]}} |
Documentation and utilities for converting documents to Markdown using Microsoft's MarkItDown library.
Note: This skill provides documentation and a batch script. The actual conversion is done by the
markitdownCLI/library installed via pip.
Use markitdown for:
# Convert file to markdown
markitdown document.pdf -o output.md
# Convert URL
markitdown https://example.com/docs -o docs.md
| Format | Features |
|---|---|
| Text extraction, structure | |
| Word (.docx) | Headings, lists, tables |
| PowerPoint | Slides, text |
| Excel | Tables, sheets |
| Images | OCR + EXIF metadata |
| Audio | Speech transcription |
| HTML | Structure preservation |
| YouTube | Video transcription |
The skill requires Microsoft's markitdown CLI:
pip install 'markitdown[all]'
Or install specific formats only:
pip install 'markitdown[pdf,docx,pptx]'
markitdown https://github.com/user/repo/blob/main/README.md -o readme.md
markitdown document.pdf -o document.md
# Using included script
python ~/.openclaw/skills/markitdown/scripts/batch_convert.py docs/*.pdf -o markdown/ -v
# Or shell loop
for file in docs/*.pdf; do
markitdown "$file" -o "${file%.pdf}.md"
done
from markitdown import MarkItDown
md = MarkItDown()
result = md.convert("document.pdf")
print(result.text_content)
pip install 'markitdown[all]'
# Ubuntu/Debian
sudo apt-get install tesseract-ocr
# macOS
brew install tesseract
| Component | Source |
|---|---|
markitdown CLI | Microsoft's pip package |
markitdown Python API | Microsoft's pip package |
scripts/batch_convert.py | This skill (utility) |
| Documentation | This skill |
Manage QQ Mail (QQ邮箱) via IMAP/SMTP. Use when the user wants to read, send, search, or manage emails from their QQ mailbox. Supports reading inbox, sending emails with attachments, searching by subject/sender/date, and listing folders.
用于查询国内城市实时天气,当用户询问天气相关问题时自动触发