بنقرة واحدة
pdf-processing
Extract text and tables from PDF files, fill PDF forms, merge and split documents. Use when the user mentions PDFs, forms, document extraction, or needs to combine multiple PDF files.
القائمة
Extract text and tables from PDF files, fill PDF forms, merge and split documents. Use when the user mentions PDFs, forms, document extraction, or needs to combine multiple PDF files.
Performs thorough code reviews covering bugs, security vulnerabilities, performance, and style. Use when the user asks to review code, check a pull request, or audit a file for issues.
Creates and writes professional README.md files for software projects. Use when the user asks to write a README, create documentation, or generate project docs from existing code or a description.
| name | pdf-processing |
| description | Extract text and tables from PDF files, fill PDF forms, merge and split documents. Use when the user mentions PDFs, forms, document extraction, or needs to combine multiple PDF files. |
| license | Apache-2.0 |
| metadata | {"author":"example-org","version":"1.0"} |
This skill enables working with PDF documents: extracting text, filling forms, merging files, and splitting pages.
Activate this skill when the user:
Determine which operation is needed:
| Task | Tool |
|---|---|
| Extract text | pdfplumber |
| Fill forms | pypdf |
| Merge files | pypdf |
| Split pages | pypdf |
pip install pdfplumber pypdf
import pdfplumber
with pdfplumber.open("document.pdf") as pdf:
for page in pdf.pages:
text = page.extract_text()
print(text)
from pypdf import PdfWriter
writer = PdfWriter()
for filename in ["file1.pdf", "file2.pdf"]:
writer.append(filename)
with open("merged.pdf", "wb") as output:
writer.write(output)
pytesseract with pdf2imageAlways close file handles after processing. Use context managers (with statements) wherever possible.