Run any Skill in Manus with one click

Get Started

reading-pdfs

Use when reading a PDF — converts to markdown via Mistral OCR with local caching.

Run Skill in Manus

Overview

Use when reading a PDF — converts to markdown via Mistral OCR with local caching.

Install command

npx skills add https://github.com/dzackgarza/ai --skill reading-pdfs

Copy and paste this command into Claude Code to install the skill

Source

dzackgarza/ai

Stars0

Forks0

UpdatedMay 31, 2026 at 12:36

SKILL.md

readonly

More from this repository

same repository

goalcraft

dzackgarza/ai

Turn a rough draft, vague ambition, or messy task brief into a powerful Codex /goal objective and, when needed, companion workflow docs for long-running autonomous work. Use when the user asks to write, improve, format, sharpen, stress-test, or activate a Codex goal, thread goal, durable goal, or /goal prompt.

2026-05-310

huggingface-hub

dzackgarza/ai

HuggingFace hf CLI: search/download/upload models, datasets.

2026-05-310

kanban-orchestrator

dzackgarza/ai

Decomposition playbook + specialist-roster conventions + anti-temptation rules for an orchestrator profile routing work through Kanban. The "don't do the work yourself" rule and the basic lifecycle are auto-injected into every kanban worker's system prompt; this skill is the deeper playbook when you're specifically playing the orchestrator role.

2026-05-310

kanban-worker

dzackgarza/ai

Pitfalls, examples, and edge cases for Hermes Kanban workers. The lifecycle itself is auto-injected into every worker's system prompt as KANBAN_GUIDANCE (from agent/prompt_builder.py); this skill is what you load when you want deeper detail on specific scenarios.

2026-05-310

zotero-api

dzackgarza/ai

Use when you need to query Zotero data, find references, export citations, search for papers, or fetch PDFs using the local Zotero Web API cache.

2026-05-310

zotero

dzackgarza/ai

Manage Zotero reference libraries via the Web API. Search, list, add items by DOI/ISBN/PMID (with duplicate detection), delete/trash items, update metadata and tags, export in BibTeX/RIS/CSL-JSON, batch-add from files, check PDF attachments, cross-reference citations, find missing DOIs via CrossRef, and fetch open-access PDFs. Supports --json output for scripting. Use when the user asks about academic references, citation management, literature libraries, PDFs for papers, bibliography export, or Zotero specifically.

2026-05-310

Source

dzackgarza

dzackgarza/ai

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	reading-pdfs
description	Use when reading a PDF — converts to markdown via Mistral OCR with local caching.
metadata	{"author":"dzack","version":"0.1.0"}

Reading PDFs with Mistral OCR

Overview

Use Mistral OCR API as the first attempt for converting PDFs to markdown. It has some amount of free usage on the free tier.

Important: Before extracting any PDF, check if it already exists in the local collection at ~/pdfs/.

PDF Storage Structure

~/pdfs/
├── arxiv/
│   └── {arxiv_key}/
│       ├── paper.pdf        # Original PDF
│       └── paper.md         # Extracted markdown
├── other/
    └── {filename}/
        ├── content.pdf
        └── content.md

Always save the original PDF alongside the extracted markdown. Name them paper.pdf and paper.md for arXiv papers.

For arXiv papers:

Download URL: https://arxiv.org/pdf/{arxiv_id}.pdf
Store as: ~/pdfs/arxiv/{arxiv_id}/paper.md

Workflow

Check if already extracted - Look for ~/pdfs/arxiv/{arxiv_key}/paper.md
If not exists - Download PDF, extract with OCR, save to appropriate location
Return the markdown content

Using Mistral OCR

Installation

pip install mistralai

Basic OCR Extraction

import os
from mistralai import Mistral

# Use environment variable - never hardcode API keys
api_key = os.environ.get("MISTRAL_API_KEY")
client = Mistral(api_key=api_key)

# Upload PDF
with open("/path/to/file.pdf", "rb") as f:
    uploaded = client.files.upload(
        file={"file_name": "file.pdf", "content": f.read()},
        purpose="ocr"
    )

# Get signed URL
signed_url = client.files.get_signed_url(file_id=uploaded.id, expiry=1)

# Process with OCR
response = client.ocr.process(
    document={"document_url": signed_url.url},
    model="mistral-ocr-latest"
)

# Extract markdown
markdown = "\n\n".join(page.markdown for page in response.pages)

Utility Function

def extract_pdf_to_markdown(pdf_path: str) -> str:
    """Extract PDF to markdown using Mistral OCR."""
    import os
    from mistralai import Mistral

    api_key = os.environ.get("MISTRAL_API_KEY")
    client = Mistral(api_key=api_key)

    with open(pdf_path, "rb") as f:
        uploaded = client.files.upload(
            file={"file_name": os.path.basename(pdf_path), "content": f.read()},
            purpose="ocr"
        )

    signed_url = client.files.get_signed_url(file_id=uploaded.id, expiry=1)

    response = client.ocr.process(
        document={"document_url": signed_url.url},
        model="mistral-ocr-latest"
    )

    return "\n\n".join(page.markdown for page in response.pages)

Downloading and Extracting an ArXiv Paper

import os
import urllib.request

def get_arxiv_paper(arxiv_id: str, base_dir: str = os.path.expanduser("~/pdfs/arxiv")) -> str:
    """
    Download arXiv paper and extract to markdown if not already cached.

    Args:
        arxiv_id: e.g., "0704.0001"
        base_dir: Base directory for PDF storage

    Returns:
        Path to extracted markdown file
    """
    # Check if already extracted
    paper_dir = os.path.join(base_dir, arxiv_id)
    paper_md = os.path.join(paper_dir, "paper.md")

    if os.path.exists(paper_md):
        with open(paper_md, "r") as f:
            return f.read()

    # Create directory
    os.makedirs(paper_dir, exist_ok=True)

    # Download PDF
    pdf_path = os.path.join(paper_dir, "paper.pdf")
    if not os.path.exists(pdf_path):
        url = f"https://arxiv.org/pdf/{arxiv_id}.pdf"
        urllib.request.urlretrieve(url, pdf_path)

    # Extract to markdown
    markdown = extract_pdf_to_markdown(pdf_path)

    # Save markdown
    with open(paper_md, "w") as f:
        f.write(markdown)

    return markdown

Example Usage

# Get a paper (downloads and caches if not exists)
paper = get_arxiv_paper("0704.0001")
print(paper[:1000])  # First 1000 chars

Local Extraction (justfile recipes)

For extracting PDFs locally without the Mistral API, use the managed recipes in ~/pdf-extraction. These handle environment setup automatically via uv sync.

# From any directory
just -f ~/pdf-extraction/justfile -d ~/pdf-extraction <recipe>

Recipe	Purpose
`sample-pdf`	Regenerate the smoke-test PDF
`docling`	Extract with Docling
`mineru`	Extract with MinerU
`smoke`	Run both extraction checks

Outputs appear under ~/pdf-extraction/artifacts/ and ~/pdf-extraction/outputs/.

Do not create a separate venv or install ad hoc — let the recipes manage the environment.

When only structured extraction data is needed, prefer a recipe that emits the minimal MinerU JSON artifacts (middle.json and content_list.json) without generating extra rendered PDFs or Markdown. The recipe should own that mode; do not run private one-off extraction scripts. After extraction, verify the expected output files and keep the run log with the artifacts.

Zotero and MinerU Artifacts

MinerU markdown/JSON are external research artifacts, not repository source. Preserve that separation:

Original PDFs belong under ~/pdfs or Zotero storage, not in agent/code repos.
Extraction artifacts belong under ~/pdf-extraction outputs or the relevant Zotero attachment path, not in Git LFS.
When Zotero already has a PDF, prefer resolving the local attachment path via the zotero-api skill before downloading a duplicate.
When attaching existing MinerU output back to Zotero, verify against the running Zotero local API and Better BibTeX key; do not infer matches from filenames alone.

Zotero and MinerU Artifacts

MinerU markdown/JSON are external research artifacts, not repository source. Preserve that separation:

Original PDFs belong under ~/pdfs or Zotero storage, not in agent/code repos.
Extraction artifacts belong under ~/pdf-extraction outputs or the relevant Zotero attachment path, not in Git LFS.
When Zotero already has a PDF, prefer resolving the local attachment path via the zotero-api skill before downloading a duplicate.
When attaching existing MinerU output back to Zotero, verify against the running Zotero local API and Better BibTeX key; do not infer matches from filenames alone.

Notes

The OCR handles complex documents including tables, math equations, and multi-column layouts
Free tier has some OCR usage included (check dashboard for limits)
The API returns pages_processed in usage info
For very large documents, consider processing in batches