mit einem Klick
setup
// Use when the user wants to install, configure, diagnose, or troubleshoot ScholarAIO, including setup check, dependency status, API keys, and bilingual setup flow.
// Use when the user wants to install, configure, diagnose, or troubleshoot ScholarAIO, including setup check, dependency status, API keys, and bilingual setup flow.
Use when the user wants to rebuild or refresh ScholarAIO keyword, full-text, FTS5, FAISS, or semantic search indexes after data or metadata changes.
Use when the user wants to find academic papers, search the local library, run keyword or semantic search, search by author, explore topics, or federate across library, explore databases, and arXiv.
Use when the user wants to import papers from Endnote XML/RIS, Zotero Web API or local SQLite, attach PDFs, match PDFs to records, or supplement records with PDF content.
Use when the user wants to process new papers, patents, theses, documents, or proceedings from inbox queues into the knowledge base, run the ingest pipeline, or rebuild indexes.
Use when the user wants diagrams, flowcharts, architecture visuals, data relationships, timelines, concept maps, Mermaid, Graphviz, drawio, or polished paper figures generated from structured text or IR.
Use when the user wants Paper2Any-based paper-to-figure, PPT, poster, video, citation, rebuttal, DrawIO, mindmap, image, PDF-to-PPT, or KB workflows through the ScholarAIO Paper2Any MCP sidecar.
| name | setup |
| description | Use when the user wants to install, configure, diagnose, or troubleshoot ScholarAIO, including setup check, dependency status, API keys, and bilingual setup flow. |
当用户需要配置、安装、初始化 ScholarAIO 时,按以下流程操作:
scholaraio setup check --lang zh
阅读输出,了解哪些组件已就绪、哪些缺失。 如果用户明确是在让 agent 代为配置,而不是自己逐步操作:
scholaraio setup check --lang zh默认把 setup 分成两层:
config.yaml、LLM key、PDF 解析器选择、MinerU token、contact_emailSemantic Scholar API key、Zotero API key、非默认 LLM backend / base_url 等按需项目执行顺序要求:
对每个附加项,agent 必须按同一模板说明:
embed: 语义向量检索(Qwen3 嵌入模型)topics: BERTopic 主题建模import: Endnote / Zotero 导入Graphviz dot: diagram --format svg 的 DOT→SVG 渲染后端;Linux 可用 sudo apt-get install graphvizInkscape: Beamer \includesvg 插入 SVG 时需要;Linux 可用 sudo apt-get install inkscapefull: 全部功能pip install -e ".[full]" 或按需安装scholaraio setup 交互式向导自动创建MinerU 还是 DoclingMinerU 服务、mineru-open-api、MinerU token 状态,以及 https://huggingface.co 可达性MinerU;这里的“MinerU 路径可走”包括:本地服务可达,或 mineru-open-api 已安装且可继续走免费 token 的云端路径mineru-open-api / 免费 token 云路径也走不通时,才改为优先建议 Doclingmineru-open-api 是否存在MinerU 云端方案时提示。要明确说明:MinerU token 是免费的,只需要注册并申请;优先使用 MINERU_TOKEN,MINERU_API_KEY 只保留兼容config.local.yaml(不进 git)完成核心配置后,再问一次:
Semantic Scholar API key
ingest.s2_api_key 或环境变量 S2_API_KEYZotero API key
import-zotero 的 Web API 路径,而不是本地 zotero.sqlitezotero.sqlite 导入zotero.api_key 或环境变量 ZOTERO_API_KEYconfig.yaml 的 llm.backend / model / base_urlmineru_model_version_cloud
pipeline 或 vlmMinerU-HTML;那是 HTML 解析专用,不是 PDF 默认路径mineru_parse_method
parse_method 请求字段ocr 时映射为官方 file.is_ocr=trueauto / txt 默认都按“不强制 OCR”处理,不要过度解释成不同云端模式mineru_enable_formula / mineru_enable_table / mineru_lang
pipeline / vlm 有效mineru_batch_size
mineru_backend_local
MinerU,默认先说明:云端路径可直接继续,且免费 token 可注册申请pip install docling,以及 Linux CPU-only 场景的官方安装示例scholaraio setup 里应尽量直接实现:
MinerU cloud、Hugging Face、以及 localhost:8000 这类连通性测试:
localhost 不一定等于用户宿主机的 localhostcurl -I --max-time 10 http://localhost:8000curl -I --max-time 10 https://mineru.net/apiManage/tokencurl -I --max-time 10 https://huggingface.cosetup 过程中,agent 不要只说“需要 key”,必须同时说明:
LLM API key:通常单独计费;不配则降级MinerU token:免费申请;不配仍可走本地 MinerU / Docling / PyMuPDFcontact_email:免费简短答法模板:
LLM API key:用于元数据提取和内容富化;不配会降级为纯正则,通常单独计费MinerU token:用于 MinerU 云端解析;免费,不配仍可走本地 MinerU / Docling / PyMuPDFcontact_email:用于 Crossref polite pool;免费,不配通常只是请求识别度和服务礼貌性更弱Semantic Scholar API key:用于认证访问;不配多数端点仍可用,但部分端点需要 keyZotero API key:用于当前的 Zotero Web API 导入路径;不配就改走本地 zotero.sqlitescholaraio setup check 后如果目录缺失,运行任意 scholaraio 命令会自动创建(ensure_dirs())配置完成后再次运行 scholaraio setup check 确认所有项目 [OK]。
scholaraio setup 进入交互式向导(bilingual EN/ZH)config.local.yaml 存放敏感信息(API key),不进 git