一键在 Manus 中运行任何 Skill

camscanner-pdf2markdown

星标2

分支0

更新时间2026年3月30日 08:28

Use CamScanner to convert PDF documents to Markdown format, powered by a high-precision document parsing engine that intelligently decomposes paragraphs, precisely recognizes tables and multiple element types, and outputs structured results in reading order, empowering large language models to accurately understand document content. Use when the user wants to convert PDF files to Markdown, extract content, summarize, or process PDFs. Triggers on "PDF to Markdown", "convert PDF to md", "extract PDF content as Markdown", or when the user has a PDF and needs it as Markdown for further editing or processing.

安装

用 Codex 或 Claude 帮你安装复制这段 Prompt，粘贴到 Codex、Claude 或其他助手里，让它检查 Skill 页面并帮你完成安装。

在 Manus 中运行

来源

camscanner-ai

camscanner-ai/CamScanner-skills

打开 GitHub 仓库查看创作者相关仓库

下载

在 Manus 中运行

相关职业SOC

基于 SOC 职业分类

综合办公文员办公室与行政支持类职业·SOC 43-9061

SKILL.md

readonly

同仓库更多 Skills

同仓库

camscanner-extract-formula

camscanner-ai/CamScanner-skills

Use CamScanner to extract formulas from images. Powered by OCR recognition engine that detects formula regions in images, crops them, and stitches into a single clean PNG image. Use when the user wants to extract mathematical formulas, equations, or expressions from images (photos, screenshots, scanned documents). Triggers on "extract formula", "get formulas from image", "crop formulas", "formula extraction", or when the user has an image containing math formulas and needs them extracted as a clean image.

2026-04-302

camscanner-image-detect-aigc

camscanner-ai/CamScanner-skills

Use CamScanner to detect whether an image was generated by an AI model (e.g. Stable Diffusion, Midjourney, DALL·E). Powered by an AIGC-detection engine that classifies an image as genuine, suspected AI-generated, or AI-generated, with a confidence score. Returns a JSON result containing `ai_check_result` (1/2/3), `confidence`, and `result_text`. Use when the user asks whether a photo is AI-generated, wants to verify an image's authenticity against AI generation, or asks "is this AI art / Stable Diffusion / Midjourney?". Triggers on "检测AI生成", "是不是AI画的", "AIGC检测", "AI图片识别", "detect AI-generated image", "is this AI art", "is this diffusion / midjourney", or when the user shares an image and asks whether it was produced by AI.

2026-04-302

camscanner-image-detect-tampering

camscanner-ai/CamScanner-skills

Use CamScanner to detect whether an image has been PS-edited, manipulated, or tampered with. Powered by a manipulation-detection engine that identifies photo-editing traces, splicing, retouching, and other signs of tampering. Returns a JSON result with `is_tampered` (boolean) and `result_text` (human-readable). Use when the user asks whether a photo is genuine, wants to verify an image's authenticity, or asks "is this PS-ed / photoshopped / edited". Triggers on "检测图片是否PS", "是否被篡改", "图片验真", "PS检测", "detect image tampering", "is this photoshopped", "check if image was edited", or when the user shares an image and asks whether it has been modified.

2026-04-302

camscanner-image-erase-handwriting

camscanner-ai/CamScanner-skills

Use CamScanner to erase handwriting from images while preserving the printed content and original layout. Powered by a high-precision image enhancement engine that intelligently detects and removes handwritten strokes, annotations, and signatures, leaving the underlying document clean and legible. Use when the user wants to remove handwriting from a photo or scan, clean up annotated documents, or restore a blank form filled by hand. Triggers on "erase handwriting", "remove handwriting from image", "clean handwritten notes", "remove annotations from scan", or when the user has an image with handwritten content that needs to be removed.

2026-04-302

camscanner-image-hd

camscanner-ai/CamScanner-skills

Use CamScanner to enhance image clarity and quality. Applies auto-crop then HD super filter to produce a cleaner, sharper image. Optionally removes moire patterns when explicitly requested. Use when the user wants to make an image clearer, sharper, higher quality, or HD. Triggers on "HD image", "enhance image clarity", "make image clearer", "sharpen image", "high definition", "improve image quality", "super filter", or when the user wants to improve the visual quality of a photo or scanned document. Only use demoire mode when the user explicitly mentions removing moire patterns.

2026-04-302

camscanner-image-remove-watermark

camscanner-ai/CamScanner-skills

Use CamScanner to remove watermarks from images while preserving the underlying content and original layout. Powered by a high-precision image enhancement engine that intelligently detects and erases overlaid watermarks, stamps, and translucent logos, leaving the underlying document clean and legible. Use when the user wants to remove watermarks from a photo or scan, clean up stamped documents, or recover a clean copy of a watermarked image. Triggers on "remove watermark", "erase watermark from image", "delete watermark", "clean watermarked scan", "unwatermark", or when the user has an image with a watermark that needs to be removed.

2026-04-302

name	camscanner-pdf2markdown
description	Use CamScanner to convert PDF documents to Markdown format, powered by a high-precision document parsing engine that intelligently decomposes paragraphs, precisely recognizes tables and multiple element types, and outputs structured results in reading order, empowering large language models to accurately understand document content. Use when the user wants to convert PDF files to Markdown, extract content, summarize, or process PDFs. Triggers on "PDF to Markdown", "convert PDF to md", "extract PDF content as Markdown", or when the user has a PDF and needs it as Markdown for further editing or processing.
metadata	{"author":"CamScanner","version":"1.0","openclaw":{"emoji":"📄","requires":{"bins":["curl","jq"]}},"homepage":"https://www.camscanner.com"}

CamScanner PDF to Markdown

Overview

CamScanner provides a high-precision document parsing engine that converts PDF documents to Markdown format. It intelligently decomposes document paragraphs, precisely recognizes tables and multiple element types, and outputs structured results in reading order — empowering large language models to accurately understand document content. The workflow is a 3-step pipeline: upload the PDF, convert it, then download the result.

When to Use

User wants to convert a PDF to Markdown
User wants to extract text/content from a PDF as Markdown
User has a PDF and needs it as Markdown for further editing or processing

Privacy & Data

Important: Privacy & Data Flow Notice

Third-party service: This skill sends your files to CamScanner's official servers (ai-tools.camscanner.com) for processing.

Data retention: CamScanner servers process your files in real-time. Files are not permanently stored on the server.

Local files: Output files are saved to your local filesystem at the path you specify.

API Reference

Base URL: https://ai-tools.camscanner.com

Supported Conversions

source_type	target_type	Output
pdf	md	.md

Step 1: Upload PDF

BASE="https://ai-tools.camscanner.com"

IN_FILE_ID=$(curl -sS -X POST "$BASE/v1/tools/upload_file/execute" \
  -H "Content-Type: application/octet-stream" \
  --data-binary "@/path/to/document.pdf" | jq -r '.tool_result.data.file_id')

Response:

{
  "code": 200,
  "tool": "upload_file",
  "tool_result": {
    "success": true,
    "data": {
      "file_id": "file_1741857600_ab12cd34ef56",
      "size": 24576
    }
  }
}

Step 2: Convert PDF to Markdown

OUT_FILE_ID=$(curl -sS -X POST "$BASE/v1/tools/convert_pdf/execute" \
  -H "Content-Type: application/json" \
  -d "{\"file_id\":\"$IN_FILE_ID\",\"source_type\":\"pdf\",\"target_type\":\"md\",\"output_mode\":\"file_id\"}" \
  | jq -r '.tool_result.data.file_id')

Response:

{
  "code": 200,
  "tool": "convert_pdf",
  "tool_result": {
    "success": true,
    "data": {
      "file_id": "file_1741857701_9988aabbccdd",
      "target_type": "md"
    }
  }
}

Step 3: Download Result

curl -sS -X POST "$BASE/v1/tools/download_file/execute?response_mode=raw" \
  -H "Content-Type: application/json" \
  -d "{\"file_id\":\"$OUT_FILE_ID\"}" \
  -o /path/to/output.md

Critical: The response_mode=raw query parameter is required to get the binary file. Without it, the response is JSON.

Quick Reference: Complete Pipeline

BASE="https://ai-tools.camscanner.com"
INPUT_PDF="/path/to/document.pdf"
OUTPUT_FILE="/path/to/output.md"

# Upload
IN_FILE_ID=$(curl -sS -X POST "$BASE/v1/tools/upload_file/execute" \
  -H "Content-Type: application/octet-stream" \
  --data-binary "@$INPUT_PDF" | jq -r '.tool_result.data.file_id')

# Convert
OUT_FILE_ID=$(curl -sS -X POST "$BASE/v1/tools/convert_pdf/execute" \
  -H "Content-Type: application/json" \
  -d "{\"file_id\":\"$IN_FILE_ID\",\"source_type\":\"pdf\",\"target_type\":\"md\",\"output_mode\":\"file_id\"}" \
  | jq -r '.tool_result.data.file_id')

# Download
curl -sS -X POST "$BASE/v1/tools/download_file/execute?response_mode=raw" \
  -H "Content-Type: application/json" \
  -d "{\"file_id\":\"$OUT_FILE_ID\"}" \
  -o "$OUTPUT_FILE"

Common Mistakes

Mistake	Fix
Forgetting `response_mode=raw` on download	Always append `?response_mode=raw` to the download URL
Wrong Content-Type on upload	Upload uses `application/octet-stream`, not `multipart/form-data`
Using GET instead of POST	All three endpoints use POST
Missing `source_type` in convert request	Always include `"source_type": "pdf"`
Missing `output_mode` in convert request	Always include `"output_mode": "file_id"` to get a downloadable file_id

Error Handling

Check each step before proceeding:

# After upload
if [ -z "$IN_FILE_ID" ] || [ "$IN_FILE_ID" = "null" ]; then
  echo "Upload failed"; exit 1
fi

# After convert
if [ -z "$OUT_FILE_ID" ] || [ "$OUT_FILE_ID" = "null" ]; then
  echo "Conversion failed"; exit 1
fi