원클릭으로 Manus에서 모든 스킬 실행

liteparse

스타141

포크24

업데이트2026년 6월 16일 23:16

Provides fast document to markdown extraction. Use this skill when the user asks to parse, perform multi-format document conversion or spatially extract text from an unstructured file (PDF, DOCX, PPTX, XLSX, images, etc.) locally.

설치

Codex 또는 Claude로 설치 이 Prompt를 복사해 Codex, Claude 또는 다른 어시스턴트에 붙여 넣으면 Skill 페이지를 검토하고 설치를 진행할 수 있습니다.

Manus에서 실행

출처

sammcj

sammcj/agentic-coding

GitHub 저장소 열기 Creator 저장소 보기

다운로드

Manus에서 실행

LiteParse Skill

Parse unstructured documents (PDF, DOCX, PPTX, XLSX, images, and more) locally with LiteParse: fast, lightweight, no cloud dependencies or LLM required.

Step 0 - Use via npx, or install LiteParse

NOTE: Rather than installing liteparse globally, you can instead run it directly with npx, substituting lit <args> with npx -y @llamaindex/liteparse <args> in the commands below.

npx -y @llamaindex/liteparse

Otherwise if installing globally, use pnpm install -g @llamaindex/liteparse and then run lit <args>.

Step 1 - Produce the CLI Command or Script

Parse a Single File

# Basic text extraction
lit parse document.pdf

# JSON output saved to a file
lit parse document.pdf --format json -o output.json

# Specific page range
lit parse document.pdf --target-pages "1-5,10,15-20"

# Disable OCR (faster, text-only PDFs)
lit parse document.pdf --no-ocr

# Use an external HTTP OCR server for higher accuracy
lit parse document.pdf --ocr-server-url http://localhost:8828/ocr

# Higher DPI for better quality
lit parse document.pdf --dpi 300

Batch Parse a Directory

lit batch-parse ./input-directory ./output-directory

# Only process PDFs, recursively
lit batch-parse ./input ./output --extension .pdf --recursive

Generate Page Screenshots

Screenshots are useful for LLM agents that need to see visual layout.

# All pages
lit screenshot document.pdf -o ./screenshots

# Specific pages
lit screenshot document.pdf --pages "1,3,5" -o ./screenshots

# High-DPI PNG
lit screenshot document.pdf --dpi 300 --format png -o ./screenshots

# Page range
lit screenshot document.pdf --pages "1-10" -o ./screenshots

Step 3 - Key Options Reference

OCR Options

Option	Description
(default)	Tesseract.js - zero setup, built-in
`--ocr-language fra`	Set OCR language (ISO code)
`--ocr-server-url <url>`	Use external HTTP OCR server (EasyOCR, PaddleOCR, custom)
`--no-ocr`	Disable OCR entirely

Output Options

Option	Description
`--format json`	Structured JSON with bounding boxes
`--format text`	Plain text (default)
`-o <file>`	Save output to file

Performance / Quality Options

Option	Description
`--dpi <n>`	Rendering DPI (default: 150; use 300 for high quality)
`--max-pages <n>`	Limit pages parsed
`--target-pages <pages>`	Parse specific pages (e.g. `"1-5,10"`)
`--no-precise-bbox`	Disable precise bounding boxes (faster)
`--skip-diagonal-text`	Ignore rotated/diagonal text
`--preserve-small-text`	Keep very small text that would otherwise be dropped

Step 4 - Using a Config File

For repeated use with consistent options, generate a liteparse.config.json:

{
  "ocrLanguage": "en",
  "ocrEnabled": true,
  "maxPages": 1000,
  "dpi": 150,
  "outputFormat": "json",
  "preciseBoundingBox": true,
  "skipDiagonalText": false,
  "preserveVerySmallText": false
}

For an HTTP OCR server:

{
  "ocrServerUrl": "http://localhost:8828/ocr",
  "ocrLanguage": "en",
  "outputFormat": "json"
}

Use with:

lit parse document.pdf --config liteparse.config.json

Step 5 - HTTP OCR Server API (Advanced)

If the user wants to plug in a custom OCR backend, the server must implement:

Endpoint: POST /ocr
Accepts: file (multipart) and language (string) parameters
Returns:

{
  "results": [
    { "text": "Hello", "bbox": [x1, y1, x2, y2], "confidence": 0.98 }
  ]
}

Ready-to-use wrappers exist for EasyOCR and PaddleOCR in the LiteParse repo.

Supported Input Formats

Category	Formats
PDF	`.pdf`
Word	`.doc`, `.docx`, `.docm`, `.odt`, `.rtf`
PowerPoint	`.ppt`, `.pptx`, `.pptm`, `.odp`
Spreadsheets	`.xls`, `.xlsx`, `.xlsm`, `.ods`, `.csv`, `.tsv`
Images	`.jpg`, `.jpeg`, `.png`, `.gif`, `.bmp`, `.tiff`, `.webp`, `.svg`

Office documents require LibreOffice; images require ImageMagick. LiteParse auto-converts these formats to PDF before parsing.

이 저장소의 다른 Skills

같은 저장소

skill-creator-primer

sammcj/agentic-coding

Foundational skill-authoring knowledge to use alongside the skill-creator skill. You MUST always load this skill before loading the skill-creator skill, when creating or updating skills.

2026-06-23141

iterative-refinement

sammcj/agentic-coding

Disciplined, measurable iteration for a substantial refinement or investigation: loop against verifiable pass/fail conditions, fan work out to subagents, and keep the main context lean. Use when improving something measurable over repeated cycles (tuning a metric or detector, refactoring against a regression bar), chasing a surprising or suspicious number, or driving a long multi-step task where delegation and context discipline matter. Not for one-shot edits or quick lookups that don't warrant a loop.

2026-06-20141

llm-wiki

sammcj/agentic-coding

Use when building or maintaining a self-contained personal knowledge base (an LLM wiki) as plain markdown, optionally opened as an Obsidian vault. Triggers: ingesting sources into a wiki, querying wiki knowledge, linting wiki health, auditing article claims against their sources, critiquing the reasoning in a source or article, superseding stale knowledge, 'add to wiki', or any mention of 'LLM wiki' or 'Karpathy wiki'.

2026-06-18141

code-review

sammcj/agentic-coding

Use this skill after completing multiple, complex software development tasks before informing the user that work is complete.

2026-06-16141

find-docs

sammcj/agentic-coding

Retrieves up-to-date documentation, API references, and code examples for any developer technology. Use this skill whenever the user asks about, or when you need to lookup documentation or usage reference a specific library, framework, SDK, CLI tool, or cloud service - even if well-known as your training data may be outdated. Prefer this over web search for library documentation and API details.

2026-06-16141

fable-mode

sammcj/agentic-coding

Activates explicit multi-stage planning, aggressive sub-agent delegation, and mandatory self-verification at each stage. Use this skill for complex tasks that benefit from systematic decomposition - large software projects, multi-source research, long-running analyses, scientific investigation, or any task where correctness and thoroughness matter more than raw speed.

2026-06-14141

name	liteparse
description	Provides fast document to markdown extraction. Use this skill when the user asks to parse, perform multi-format document conversion or spatially extract text from an unstructured file (PDF, DOCX, PPTX, XLSX, images, etc.) locally.

LiteParse Skill

Parse unstructured documents (PDF, DOCX, PPTX, XLSX, images, and more) locally with LiteParse: fast, lightweight, no cloud dependencies or LLM required.

Step 0 - Use via npx, or install LiteParse

NOTE: Rather than installing liteparse globally, you can instead run it directly with npx, substituting lit <args> with npx -y @llamaindex/liteparse <args> in the commands below.

npx -y @llamaindex/liteparse

Otherwise if installing globally, use pnpm install -g @llamaindex/liteparse and then run lit <args>.

Step 1 - Produce the CLI Command or Script

Parse a Single File

# Basic text extraction
lit parse document.pdf

# JSON output saved to a file
lit parse document.pdf --format json -o output.json

# Specific page range
lit parse document.pdf --target-pages "1-5,10,15-20"

# Disable OCR (faster, text-only PDFs)
lit parse document.pdf --no-ocr

# Use an external HTTP OCR server for higher accuracy
lit parse document.pdf --ocr-server-url http://localhost:8828/ocr

# Higher DPI for better quality
lit parse document.pdf --dpi 300

Batch Parse a Directory

lit batch-parse ./input-directory ./output-directory

# Only process PDFs, recursively
lit batch-parse ./input ./output --extension .pdf --recursive

Generate Page Screenshots

Screenshots are useful for LLM agents that need to see visual layout.

# All pages
lit screenshot document.pdf -o ./screenshots

# Specific pages
lit screenshot document.pdf --pages "1,3,5" -o ./screenshots

# High-DPI PNG
lit screenshot document.pdf --dpi 300 --format png -o ./screenshots

# Page range
lit screenshot document.pdf --pages "1-10" -o ./screenshots

Step 3 - Key Options Reference

OCR Options

Option	Description
(default)	Tesseract.js - zero setup, built-in
`--ocr-language fra`	Set OCR language (ISO code)
`--ocr-server-url <url>`	Use external HTTP OCR server (EasyOCR, PaddleOCR, custom)
`--no-ocr`	Disable OCR entirely

Output Options

Option	Description
`--format json`	Structured JSON with bounding boxes
`--format text`	Plain text (default)
`-o <file>`	Save output to file

Performance / Quality Options

Option	Description
`--dpi <n>`	Rendering DPI (default: 150; use 300 for high quality)
`--max-pages <n>`	Limit pages parsed
`--target-pages <pages>`	Parse specific pages (e.g. `"1-5,10"`)
`--no-precise-bbox`	Disable precise bounding boxes (faster)
`--skip-diagonal-text`	Ignore rotated/diagonal text
`--preserve-small-text`	Keep very small text that would otherwise be dropped

Step 4 - Using a Config File

For repeated use with consistent options, generate a liteparse.config.json:

{
  "ocrLanguage": "en",
  "ocrEnabled": true,
  "maxPages": 1000,
  "dpi": 150,
  "outputFormat": "json",
  "preciseBoundingBox": true,
  "skipDiagonalText": false,
  "preserveVerySmallText": false
}

For an HTTP OCR server:

{
  "ocrServerUrl": "http://localhost:8828/ocr",
  "ocrLanguage": "en",
  "outputFormat": "json"
}

Use with:

lit parse document.pdf --config liteparse.config.json

Step 5 - HTTP OCR Server API (Advanced)

If the user wants to plug in a custom OCR backend, the server must implement:

Endpoint: POST /ocr
Accepts: file (multipart) and language (string) parameters
Returns:

{
  "results": [
    { "text": "Hello", "bbox": [x1, y1, x2, y2], "confidence": 0.98 }
  ]
}

Ready-to-use wrappers exist for EasyOCR and PaddleOCR in the LiteParse repo.

Supported Input Formats

Category	Formats
PDF	`.pdf`
Word	`.doc`, `.docx`, `.docm`, `.odt`, `.rtf`
PowerPoint	`.ppt`, `.pptx`, `.pptm`, `.odp`
Spreadsheets	`.xls`, `.xlsx`, `.xlsm`, `.ods`, `.csv`, `.tsv`
Images	`.jpg`, `.jpeg`, `.png`, `.gif`, `.bmp`, `.tiff`, `.webp`, `.svg`

Office documents require LibreOffice; images require ImageMagick. LiteParse auto-converts these formats to PDF before parsing.