Run any Skill in Manus with one click

pdf-toolkit

Structured `.pdf` operations: extract text/tables, merge pages from multiple PDFs, split a PDF by page ranges, fill PDF form fields, and generate fresh PDFs from JSON. Trigger when the user wants programmatic PDF work without natural-language rewriting — examples: pull tables from a report, combine three PDFs, extract pages 5-12, fill a tax form, or build a new PDF from data. Distinct from `nano-pdf`, which uses an LLM to rewrite a page from a sentence; this skill is deterministic byte-level work via pypdf, pdfplumber, and reportlab.

Run Skill in Manus

Stars3,700

Forks288

UpdatedMay 6, 2026 at 22:15

Source

opensquilla

opensquilla/opensquilla

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

File Explorer

8 files

SKILL.md

readonly

More from this repository

same repository

nano-banana-pro

opensquilla/opensquilla

Generate or edit a single image via OpenRouter (google/gemini-3.1-flash-image-preview by default). Accepts a text prompt and optional --input-image for image-to-image editing. Trigger when the user asks for an AI image, illustration, concept art, product render, or wants to modify an existing image.

2026-06-043.7k

seedance-2-prompt

opensquilla/opensquilla

Render a single 3-15s video clip via Seedance 2.0. Supports two backends: OpenRouter (default, model bytedance/seedance-2.0) and the official Volcengine ARK / BytePlus ModelArk endpoint (model doubao-seedance-2-0-260128 / dreamina-seedance-2-0-260128). Accepts a structured English video prompt, optional first-frame image, and optional identity/style reference image. Trigger when the user asks for AI video clip generation, 分镜视频, seedance, or wants a short cinematic shot from a prompt + frame.

2026-06-043.7k

meta-paper-write

opensquilla/opensquilla

Use this meta-skill instead of answering directly when the current user asks to draft, repair, compile, or produce an academic/research paper or LaTeX manuscript. It uses multi-skill orchestration for manuscript workflows that need source search, citation planning, experiment or figure/table placeholders, drafting, length checks, citation integrity, and LaTeX/PDF compilation. Ordinary paper requests use a compact draft path; explicit full/PDF/long-form requests use the full manuscript path. Do not use it for web research reports, slide decks, document decisions, or generic plotting.

2026-06-033.7k

advanced-dubbing-studio

opensquilla/opensquilla

Submit audio or video for multilingual dubbing, poll status, and download dubbed audio. Use when the user asks for dubbing, 多语言配音, 视频翻译配音, 译制片, or wants a source clip dubbed into another language.

2026-06-023.7k

music-and-singing-studio

opensquilla/opensquilla

Generate instrumental music, background beds, jingles, or sung songs with lyrics through OpenSquilla audio tools. Use when the user asks for BGM, music generation, 唱歌, 生成歌曲, lyrics to song, or a playable music audio artifact.

2026-06-023.7k

voice-clone-lab

opensquilla/opensquilla

Create and register cloned voices for later TTS only when the speaker has explicit consent. Use when the user asks for voice clone, clone voice, 克隆音色, 复刻声音, or wants a reusable voice_id.

2026-06-023.7k

name	pdf-toolkit
description	Structured `.pdf` operations: extract text/tables, merge pages from multiple PDFs, split a PDF by page ranges, fill PDF form fields, and generate fresh PDFs from JSON. Trigger when the user wants programmatic PDF work without natural-language rewriting — examples: pull tables from a report, combine three PDFs, extract pages 5-12, fill a tax form, or build a new PDF from data. Distinct from `nano-pdf`, which uses an LLM to rewrite a page from a sentence; this skill is deterministic byte-level work via pypdf, pdfplumber, and reportlab.
homepage	https://pypdf.readthedocs.io/
provenance	{"origin":"clawhub-mit0","license":"MIT-0","upstream_url":"https://clawhub.ai/pdf","maintained_by":"OpenSquilla"}
metadata	{"platform":{"emoji":"📕","requires":{"anyBins":["python","python3"]},"install":[{"id":"pypdf","kind":"uv","package":"pypdf","label":"Install pypdf (uv pip)"},{"id":"reportlab","kind":"uv","package":"reportlab","label":"Install reportlab (uv pip)"}]}}

pdf-toolkit

Deterministic, structural PDF operations. Use this skill for programmatic work where you know exactly what you want done. Use the sibling nano-pdf skill instead when the task is "rewrite this page to say X" — nano-pdf applies a natural-language edit; pdf-toolkit applies an explicit operation.

Decide the operation

Goal	Script
Get text or tables out of a PDF	`extract.py`
Combine pages from multiple PDFs	`merge.py`
Split a PDF by page ranges	`split.py`
Fill `/Tx` form fields in a PDF	`form_fill.py`
Build a new PDF from data	inline `reportlab` snippet, see Path C below

Path A: Extract

python {baseDir}/scripts/extract.py /path/to/doc.pdf --json

Output:

{
  "pages": 12,
  "metadata": {"title": "...", "author": "..."},
  "text": [
    {"page": 1, "content": "..."},
    {"page": 2, "content": "..."}
  ],
  "tables": [
    {"page": 3, "rows": [["..."], ["..."]]}
  ]
}

Text uses pdfplumber (already in default dependencies) which preserves column layout better than naive PDF text extraction. Tables use pdfplumber.extract_tables() with default settings; for tricky layouts pass --tables-strategy lines|text|explicit to switch detection mode.

For OCR (scanned PDFs), this skill does not include Tesseract — use the sibling skill that wraps an OCR engine (out of scope here).

Path B: Merge / Split

Merge full files:

python {baseDir}/scripts/merge.py a.pdf b.pdf c.pdf --out combined.pdf

Or merge specific page ranges with the manifest form:

python {baseDir}/scripts/merge.py manifest.json --out combined.pdf

manifest.json:

[
  {"file": "a.pdf", "pages": "1-3"},
  {"file": "b.pdf", "pages": "5,7,9-11"},
  {"file": "c.pdf"}
]

Page ranges are 1-based, comma-separated, hyphen for ranges. Omit pages to include the whole file. Splits use the same syntax in reverse:

python {baseDir}/scripts/split.py input.pdf --pages "1-3,7,10-12" --out output_dir/

Each range writes one output file: output_dir/input_001.pdf, output_dir/input_002.pdf, …

Path C: Form fill

python {baseDir}/scripts/form_fill.py form.pdf data.json --out filled.pdf

data.json maps field name → string value:

{
  "applicant_name": "Wei E.",
  "submission_date": "2026-05-06",
  "agreed": "Yes"
}

The script discovers fields via pypdf.PdfReader.get_fields() and updates them with update_page_form_field_values(). Fields not present in the JSON are left untouched. Run with --list-fields to enumerate the form's fields without filling.

Caveats:

/Btn checkbox fields take the export value (often Yes, On, or 1) rather than true — inspect with --list-fields to discover.
AcroForm fills only. XFA forms (used by some legal templates) require Adobe-specific tooling and are out of scope.
Some signed PDFs invalidate the signature when fields change. Strip signatures explicitly with --clear-signatures if that is intended.

Path D: Generate from scratch

Use reportlab directly when you need a new PDF:

from reportlab.pdfgen import canvas
from reportlab.lib.pagesizes import LETTER
from pathlib import Path

c = canvas.Canvas(str(Path("out.pdf")), pagesize=LETTER)
c.setFont("Helvetica-Bold", 18)
c.drawString(72, 720, "Q3 Review")
c.setFont("Helvetica", 11)
c.drawString(72, 696, "Revenue grew 18% year over year.")
c.showPage()
c.save()

For tables, headers/footers, and multi-column layouts, switch to reportlab.platypus (SimpleDocTemplate, Paragraph, Table, PageBreak). See references/reportlab.md.

Boundary with `nano-pdf`

nano-pdf (sibling bundled skill) wraps an LLM that takes a page index and a natural-language instruction. Use it when the change is "fix the typo on page 1" or "make the title shorter". Use this skill when the change is "merge these three PDFs", "extract the tables", or "fill the form". The two do not overlap: if you find yourself reaching for nano-pdf to do a merge, switch to pdf-toolkit; if you reach here to "rewrite page 5 to be friendlier", switch back.

Common pitfalls

Symptom	Cause	Fix
Extracted text is empty	Scanned PDF, no text layer	OCR is out of scope; use a separate OCR skill
Garbled characters in extract	PDF uses a custom font encoding	Try `pdfplumber.open(path, laparams={...})` with `char_margin` adjustments
Merged PDF is huge	Underlying PDFs include large embedded fonts	Subset fonts via `pypdf` `compress_content_streams()`
Form fill silently no-ops	Field name in JSON does not match PDF field name	Run with `--list-fields` first to see exact names
Pages out of order after split	Range overlap collapsed unexpectedly	Use disjoint ranges, e.g. `1-3,4-6` not `1-5,3-6`

Boundaries

This skill works with text-based and form-based PDFs. Scanned image PDFs need OCR before any text path produces results.
Encrypted PDFs are read-only here. Decryption requires the user-supplied password and is out of scope for this skill.
For PDF-to-image rendering, use a separate skill that wraps Poppler or PyMuPDF.
Digital signature operations (signing, verifying, revoking) are out of scope.

pdf-toolkit

More from this repository

More from this repository

pdf-toolkit

Decide the operation

Path A: Extract

Path B: Merge / Split

Path C: Form fill

Path D: Generate from scratch

Boundary with nano-pdf

Common pitfalls

Boundaries

pdf-toolkit

Decide the operation

Path A: Extract

Path B: Merge / Split

Path C: Form fill

Path D: Generate from scratch

Boundary with nano-pdf

Common pitfalls

Boundaries

Boundary with `nano-pdf`

Boundary with `nano-pdf`