Skip to main content
在 Manus 中运行任何 Skill
一键导入

doc-to-vector-dataset-generator

Converts documents into clean, chunked datasets suitable for embeddings and vector search. Produces chunked JSONL files with metadata, deduplication logic, and quality checks. Use when preparing "training data", "vector datasets", "document processing", or "embedding data".

概览

Converts documents into clean, chunked datasets suitable for embeddings and vector search. Produces chunked JSONL files with metadata, deduplication logic, and quality checks. Use when preparing "training data", "vector datasets", "document processing", or "embedding data".

安装命令
npx skills add https://github.com/patricio0312rev/skillset --skill doc-to-vector-dataset-generator

复制此命令并粘贴到 Claude Code 中以安装该技能

星标5
分支0
更新时间2025年12月31日 05:05
SKILL.md
readonly