| name | paper-expert-generator |
| description | Generate a specialized domain-expert research agent modeled on PaperClaw architecture. Use this skill when a user wants to create an AI agent that can automatically search, filter, summarize, and evaluate academic papers in a specific research field. Trigger phrases include help me create a paper tracking agent for my field, I want an agent to monitor latest papers in bioinformatics, build me a paper review agent for computer vision, create a PaperClaw-style agent for my domain, generate a domain-specific paper expert agent. The generated agent is a complete OpenClaw agent with all required skills (arxiv-search, semantic-scholar, paper-review, daily-search, weekly-report) fully adapted for the target domain. |
Paper Expert Generator
Generate a complete, ready-to-use domain-specific paper expert agent by adapting the PaperClaw architecture for any research field.
Workflow
Step 1: Domain Interview
Collect these details from the user before generating anything. Ask conversationally ā do not dump all questions at once. Start with the most critical ones:
Critical (ask first):
- Research domain ā Primary field (e.g., "bioinformatics", "quantum computing", "computer vision")
- Core topics ā Specific sub-areas or problems (e.g., "protein folding, drug discovery, single-cell sequencing")
- Key methods/techniques ā Central methodologies (e.g., "transformers, GNN, diffusion models, RL")
Important (ask second):
4. Evaluation priorities ā What dimensions matter most for paper quality in this domain?
5. Exclusion topics ā What should be filtered out? (e.g., "finance, social media, NLP")
6. Output location ā Where to create the agent? (default: ~/agents/<domain-slug>/)
Optional (ask only if needed):
7. Notification channel ā Feishu/Lark webhook URL for push notifications
8. LLM config ā API base URL, model name, API key (default: same as PaperClaw models.json)
9. Schedule timezone ā Default is Asia/Singapore
Infer reasonable defaults for anything not provided and confirm before proceeding.
Step 2: Build Keyword Library
Construct a structured keyword library from the domain interview. Aim for:
- Core queries (3ā5): Direct topic+method combinations for arXiv
ti: searches
- Method queries (3ā5): Method+application combinations
- Application queries (2ā3): Use-case-specific terms
- Exclusion keywords (3ā6): Out-of-scope terms to filter
See references/domain-adaptation-guide.md Section 1 for keyword examples across 8 common domains.
Step 3: Design Evaluation Rubric
Design 4 domain-specific scoring dimensions (each scored 1ā10) that replace PaperClaw's SciML dimensions (engineering_value, architecture_innovation, theoretical_contribution, result_reliability).
The scoring formula is unchanged:
final_score = base_score Ć 0.9 + impact_score Ć 0.1
base_score = (dim1 + dim2 + dim3 + dim4) / 4
impact_score = date_citation_adjustment(citations, age_months)
See references/domain-adaptation-guide.md Section 2 for rubric examples by domain.
Step 4: Generate Agent Files
Run the scaffolding script to create the directory structure:
python ~/.comate/skills/paper-expert-generator/scripts/init_domain_agent.py \
--domain "<domain_slug>" \
--output "<output_dir>" \
--paperclaw-skills "<paperclaw_skills_path>"
Example:
python ~/.comate/skills/paper-expert-generator/scripts/init_domain_agent.py \
--domain "bioinfo-ml" \
--output ~/agents/bioinfo-ml \
--paperclaw-skills /work/work/PaperClaw/skills
Generated structure:
<output_dir>/
āāā agent/
ā āāā AGENT.md ā write domain content here
ā āāā models.json ā pre-filled from template
ā āāā schedules.json ā pre-filled from template
āāā skills/
ā āāā arxiv-search/ ā copy from PaperClaw (needs keyword update)
ā āāā semantic-scholar/ ā copy from PaperClaw (no changes needed)
ā āāā paper-review/ ā copy from PaperClaw (needs rubric update)
ā āāā daily-search/ ā copy from PaperClaw (minor text update)
ā āāā weekly-report/ ā copy from PaperClaw (minor text update)
āāā workspace/
āāā evaluated_papers.json ā initialized empty
Step 5: Write AGENT.md
Use assets/templates/AGENT.md.template as the base. The AGENT.md must include:
-
Role Definition ā Domain expert persona with specific depth. Replace SciML expertise with domain-specific expertise (key algorithms, theoretical foundations, benchmark datasets, top venues/conferences).
-
Keyword Library ā Paste structured keywords from Step 2.
-
Four Core Tasks (preserve exact structure from PaperClaw):
- Task 1 (Paper Research): Download PDF ā write
summary.md answering 10 domain-adapted questions
- Task 2 (Paper Evaluation): 4-dimension scoring ā write
scores.md ā update metadata.json ā update registry
- Task 3 (Daily Search): Cron trigger ā
daily_paper_search.py --top 3 ā dedup ā trigger Task 1+2
- Task 4 (Weekly Report): Cron trigger ā
generate_weekly_report_v2.py ā push notification
-
Mandatory <think> Reasoning ā Required in Task 2 evaluation.
-
Dedup Gate ā Always check evaluated_papers.json before starting paper review.
See references/agent-template-guide.md for the full AGENT.md authoring guide.
Step 6: Adapt Skill SKILL.md Files
Minimal adaptation needed ā Python scripts are domain-agnostic:
| Skill | Required changes to SKILL.md |
|---|
arxiv-search | Replace the keyword list with domain keywords from Step 2 |
paper-review | Replace 4 scoring dimensions + update the 10 summary questions |
daily-search | Update domain name in task description text |
weekly-report | Update domain name in report title |
semantic-scholar | No changes needed |
Step 7: Configure models.json and schedules.json
models.json: Edit agent/models.json, fill in:
baseUrl: LLM API endpoint
apiKey: API key placeholder
id and name: Model identifier
schedules.json: Default schedule is pre-filled. Adjust tz field if not in Singapore timezone.
Step 8: Validate and Deliver
Checklist before presenting results:
Then present the output summary (see next section).
Output Summary Format
Always deliver this summary after generation:
## Generated Agent: <Domain Name> Paper Expert
**Domain**: <domain>
**Location**: `<output_dir>`
**Model**: <model_name>
### Keyword Library (<N> total queries)
**Core**: <query1>, <query2>, <query3>
**Methods**: <query1>, <query2>
**Exclusions**: <term1>, <term2>, ...
### Evaluation Rubric
| Dimension | Score Weight | Measures |
|-----------|-------------|---------|
| <dim1> | 25% | ... |
| <dim2> | 25% | ... |
| <dim3> | 25% | ... |
| <dim4> | 25% | ... |
### Schedule
- Daily search: `0 20 * * *` (<timezone>)
- Weekly report: `0 10 * * 0` (<timezone>)
### Quick Start
1. Open OpenClaw ā select agent from `<output_dir>/agent/`
2. Set API key in `agent/models.json`
3. Test: "Search for recent papers on <core_topic>"
4. Or wait for first daily trigger at 20:00
References
references/domain-adaptation-guide.md ā Keyword and rubric examples for 8 common domains
references/agent-template-guide.md ā Full AGENT.md authoring guide with annotated sections
assets/templates/AGENT.md.template ā Base template for the generated AGENT.md
assets/templates/models.json ā Base models config template
assets/templates/schedules.json ā Base schedules config template