ワンクリックでManusで任意のスキルを実行

$pwd:

import

Name: Import
Author: ZimoLiao

// Use when the user wants to import papers from Endnote XML/RIS, Zotero Web API or local SQLite, attach PDFs, match PDFs to records, or supplement records with PDF content.

Manusで実行

$ git log --oneline --stat

stars:493

forks:68

updated:2026年5月23日 13:16

SKILL.md

readonly

name	import
description	Use when the user wants to import papers from Endnote XML/RIS, Zotero Web API or local SQLite, attach PDFs, match PDFs to records, or supplement records with PDF content.

导入外部文献管理工具数据 / 补充 PDF

支持从 Endnote / Zotero 批量导入，或为已入库论文补充/重拉 PDF（attach-pdf、fetch-pdf）。

Endnote 导入

支持 Endnote 导出的 XML 和 RIS 格式文件。

# 完整导入：元数据 + PDF 匹配 + MinerU 批量转换 + enrich (toc/l3/abstract) + embed + index
scholaraio import-endnote <file.xml>

# 多文件导入
scholaraio import-endnote file1.xml file2.ris

# 仅导入元数据和 PDF，跳过 MinerU 转换和 enrich
scholaraio import-endnote <file.xml> --no-convert

# 预览模式
scholaraio import-endnote <file.xml> --dry-run

# 离线模式
scholaraio import-endnote <file.xml> --no-api

PDF 自动匹配

对 Endnote XML 文件，自动解析 internal-pdf:// 链接，从 <library>.Data/PDF/ 目录匹配 PDF：

多个 PDF 时自动排除 SI/补充材料
默认通过 MinerU 批量转换为 paper.md

导入后自动处理

默认行为（不带 --no-convert）下，导入完成后自动执行完整 pipeline：

批量 PDF→MD：云端模式使用 convert_pdfs_cloud_batch() 批量转换（批次大小由 config.yaml ingest.mineru_batch_size 控制，默认 20）
Abstract 补全：从 markdown 中提取缺失的摘要
TOC + L3 提取：LLM 提取目录结构和结论段
Embed + Index：更新语义向量和全文索引

使用 --no-convert 跳过以上全部后处理（仅导入元数据 + PDF 复制 + embed + index）。

Zotero 导入

支持 Web API 和本地 SQLite 两种模式。

Web API 模式

# 列出 collections
scholaraio import-zotero --api-key KEY --library-id ID --list-collections

# 完整导入
scholaraio import-zotero --api-key KEY --library-id ID

# 仅导入指定 collection
scholaraio import-zotero --api-key KEY --library-id ID --collection COLLECTION_KEY

# 导入后将 collections 创建为工作区
scholaraio import-zotero --api-key KEY --library-id ID --import-collections

本地 SQLite 模式

scholaraio import-zotero --local /path/to/zotero.sqlite

配置文件（可选）

在 config.local.yaml 中配置 Zotero 凭据：

zotero:
  api_key: "your-zotero-api-key"
  library_id: "your-library-id"

补充 PDF（单篇）

已有本地 PDF 文件时：

scholaraio attach-pdf <paper-id> <path/to/paper.pdf> [--force]

自动把原始 PDF 保存到论文目录中（与 paper.md 同级，使用论文目录同名 stem），调用 MinerU 转换 PDF → markdown，补全缺失的 abstract，增量更新 embed + index。若目标目录已有 canonical PDF，默认拒绝覆盖；确认要替换时使用 --force。

当前网络环境有正版访问权限、希望从出版社页面或 DOI 拉取 PDF 时：

# 拉取新论文 PDF 到 configured inbox；校园网直连时加 --direct 可绕过代理环境变量
scholaraio fetch-pdf 10.xxxx/example --direct

# 拉取后立刻走普通论文入库流程
scholaraio fetch-pdf 10.xxxx/example --direct --ingest

# 为已入库单篇论文重新拉取 canonical PDF；已有 PDF 时需要 --force
scholaraio fetch-pdf --paper <paper-id> --direct --force

# 为指定多篇论文批量重新拉取 PDF
scholaraio fetch-pdf --paper <paper-id-1> <paper-id-2> --direct --force

# 批量为全库论文重新拉取 PDF
scholaraio fetch-pdf --all --direct --force

fetch-pdf 只做 PDF 获取，不做访问绕过；能否下载取决于用户当前网络、机构权限和 publisher 返回的 PDF 链接。它会优先使用 source_url，否则使用 DOI。重拉已有论文不会自动重新转换 paper.md。

批量补转 PDF（已入库论文）

对已入库但缺少 paper.md 的论文（如首次导入时用了 --no-convert），可通过 Python 调用批量转换：

from scholaraio.core.config import load_config
from scholaraio.services.ingest.pipeline import batch_convert_pdfs

cfg = load_config()
stats = batch_convert_pdfs(cfg, enrich=True)

自动扫描 configured papers library 中有 PDF 无 paper.md 的论文，云端模式使用批量 API 转换，完成后运行 abstract backfill + toc + l3 + embed + index。

related-skills.json

同じリポジトリ

index.md

from "ZimoLiao/scholaraio"

Use when the user wants to rebuild or refresh ScholarAIO keyword, full-text, FTS5, FAISS, or semantic search indexes after data or metadata changes.

2026-05-23493

search.md

from "ZimoLiao/scholaraio"

Use when the user wants to find academic papers, search the local library, run keyword or semantic search, search by author, explore topics, or federate across library, explore databases, and arXiv.

2026-05-23493

ingest.md

from "ZimoLiao/scholaraio"

Use when the user wants to process new papers, patents, theses, documents, or proceedings from inbox queues into the knowledge base, run the ingest pipeline, or rebuild indexes.

2026-05-23493

draw.md

from "ZimoLiao/scholaraio"

Use when the user wants diagrams, flowcharts, architecture visuals, data relationships, timelines, concept maps, Mermaid, Graphviz, drawio, or polished paper figures generated from structured text or IR.

2026-05-06493

setup.md

from "ZimoLiao/scholaraio"

Use when the user wants to install, configure, diagnose, or troubleshoot ScholarAIO, including setup check, dependency status, API keys, and bilingual setup flow.

2026-05-06493

paper2any.md

from "ZimoLiao/scholaraio"

Use when the user wants Paper2Any-based paper-to-figure, PPT, poster, video, citation, rebuttal, DrawIO, mindmap, image, PDF-to-PPT, or KB workflows through the ScholarAIO Paper2Any MCP sidecar.

2026-05-06493

package.json

"author": "ZimoLiao"

"repository": "ZimoLiao/scholaraio"

GitHub リポジトリを開く Creator のリポジトリを見る

$ install --global

$ download --local

Manusで実行

$ useful --forSOC

ソフトウェア開発者コンピュータ・数学職15-1252L4

name	import
description	Use when the user wants to import papers from Endnote XML/RIS, Zotero Web API or local SQLite, attach PDFs, match PDFs to records, or supplement records with PDF content.

导入外部文献管理工具数据 / 补充 PDF

支持从 Endnote / Zotero 批量导入，或为已入库论文补充/重拉 PDF（attach-pdf、fetch-pdf）。

Endnote 导入

支持 Endnote 导出的 XML 和 RIS 格式文件。

# 完整导入：元数据 + PDF 匹配 + MinerU 批量转换 + enrich (toc/l3/abstract) + embed + index
scholaraio import-endnote <file.xml>

# 多文件导入
scholaraio import-endnote file1.xml file2.ris

# 仅导入元数据和 PDF，跳过 MinerU 转换和 enrich
scholaraio import-endnote <file.xml> --no-convert

# 预览模式
scholaraio import-endnote <file.xml> --dry-run

# 离线模式
scholaraio import-endnote <file.xml> --no-api

PDF 自动匹配

对 Endnote XML 文件，自动解析 internal-pdf:// 链接，从 <library>.Data/PDF/ 目录匹配 PDF：

多个 PDF 时自动排除 SI/补充材料
默认通过 MinerU 批量转换为 paper.md

导入后自动处理

默认行为（不带 --no-convert）下，导入完成后自动执行完整 pipeline：

批量 PDF→MD：云端模式使用 convert_pdfs_cloud_batch() 批量转换（批次大小由 config.yaml ingest.mineru_batch_size 控制，默认 20）
Abstract 补全：从 markdown 中提取缺失的摘要
TOC + L3 提取：LLM 提取目录结构和结论段
Embed + Index：更新语义向量和全文索引

使用 --no-convert 跳过以上全部后处理（仅导入元数据 + PDF 复制 + embed + index）。

Zotero 导入

支持 Web API 和本地 SQLite 两种模式。

Web API 模式

# 列出 collections
scholaraio import-zotero --api-key KEY --library-id ID --list-collections

# 完整导入
scholaraio import-zotero --api-key KEY --library-id ID

# 仅导入指定 collection
scholaraio import-zotero --api-key KEY --library-id ID --collection COLLECTION_KEY

# 导入后将 collections 创建为工作区
scholaraio import-zotero --api-key KEY --library-id ID --import-collections

本地 SQLite 模式

scholaraio import-zotero --local /path/to/zotero.sqlite

配置文件（可选）

在 config.local.yaml 中配置 Zotero 凭据：

zotero:
  api_key: "your-zotero-api-key"
  library_id: "your-library-id"

补充 PDF（单篇）

已有本地 PDF 文件时：

scholaraio attach-pdf <paper-id> <path/to/paper.pdf> [--force]

当前网络环境有正版访问权限、希望从出版社页面或 DOI 拉取 PDF 时：

# 拉取新论文 PDF 到 configured inbox；校园网直连时加 --direct 可绕过代理环境变量
scholaraio fetch-pdf 10.xxxx/example --direct

# 拉取后立刻走普通论文入库流程
scholaraio fetch-pdf 10.xxxx/example --direct --ingest

# 为已入库单篇论文重新拉取 canonical PDF；已有 PDF 时需要 --force
scholaraio fetch-pdf --paper <paper-id> --direct --force

# 为指定多篇论文批量重新拉取 PDF
scholaraio fetch-pdf --paper <paper-id-1> <paper-id-2> --direct --force

# 批量为全库论文重新拉取 PDF
scholaraio fetch-pdf --all --direct --force

批量补转 PDF（已入库论文）

对已入库但缺少 paper.md 的论文（如首次导入时用了 --no-convert），可通过 Python 调用批量转换：

from scholaraio.core.config import load_config
from scholaraio.services.ingest.pipeline import batch_convert_pdfs

cfg = load_config()
stats = batch_convert_pdfs(cfg, enrich=True)

自动扫描 configured papers library 中有 PDF 无 paper.md 的论文，云端模式使用批量 API 转换，完成后运行 abstract backfill + toc + l3 + embed + index。

import

导入外部文献管理工具数据 / 补充 PDF

Endnote 导入

PDF 自动匹配

导入后自动处理

Zotero 导入

Web API 模式

本地 SQLite 模式

配置文件（可选）

补充 PDF（单篇）

批量补转 PDF（已入库论文）

このリポジトリの他の Skills

このリポジトリの他の Skills

导入外部文献管理工具数据 / 补充 PDF

Endnote 导入

PDF 自动匹配

导入后自动处理

Zotero 导入

Web API 模式

本地 SQLite 模式

配置文件（可选）

补充 PDF（单篇）

批量补转 PDF（已入库论文）