Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

$pwd:

paper-research

Name: Paper Research
Author: LiYu0524

// End-to-end paper research support for arXiv/literature surveys, reproducibility-focused paper shortlisting, and experiment design. Use when you need to (1) search arXiv with complex queries, (2) download PDFs, extract text/sections, and fetch BibTeX, (3) dedupe/cluster results into a structured report, and (4) turn findings into a lit-review plan, benchmark/evaluation suite, and representation/probing experiment checklist (e.g., implicit reasoning, hidden-CoT, multilingual reasoning, cross-lingual alignment).

Exécuter dans Manus

$ git log --oneline --stat

stars:11

forks:0

updated:4 mars 2026 à 11:44

Explorateur de fichiers

14 fichiers

SKILL.md

readonly

name

paper-research

description

End-to-end paper research support for arXiv/literature surveys, reproducibility-focused paper shortlisting, and experiment design. Use when you need to (1) search arXiv with complex queries, (2) download PDFs, extract text/sections, and fetch BibTeX, (3) dedupe/cluster results into a structured report, and (4) turn findings into a lit-review plan, benchmark/evaluation suite, and representation/probing experiment checklist (e.g., implicit reasoning, hidden-CoT, multilingual reasoning, cross-lingual alignment).

Paper Research

Overview

Run a fast, reproducible “survey → shortlist → synthesize” loop for research topics, backed by small scripts that fetch arXiv metadata/PDFs/BibTeX, extract text, and generate structured Markdown briefs.

Quick start (recommended workflow)

Create a topic workspace directory (keep everything together):
- Example: notes/implicit-reasoning-survey/
Search arXiv and (optionally) download PDFs:
- Run: python3 scripts/arxiv_survey.py --terms "implicit reasoning" "hidden chain-of-thought" "multilingual reasoning" --max-results 100 --download-pdfs --pdf-dir ./pdfs --out ./arxiv.jsonl
Extract text (+ rough sections) from PDFs:
- Run: python3 scripts/pdf_extract.py --pdf-dir ./pdfs --out-dir ./texts --sections
Fetch BibTeX for the found arXiv IDs:
- Run: python3 scripts/arxiv_bibtex.py --from-jsonl ./arxiv.jsonl --out ./refs.bib
Generate a structured research brief (table + clusters + TODO slots for notes):
- Run: python3 scripts/generate_report.py --jsonl ./arxiv.jsonl --out ./REPORT.md

Then ask Codex to synthesize (taxonomy/benchmarks/experiments) using REPORT.md + your notes.

Workflow decision tree

A) “I need a lit review plan + paper outline”

Do this:

Use the scripts to produce REPORT.md (table + clusters) and refs.bib.
Build a survey plan as a set of falsifiable questions + “what evidence would change my mind”.
Output deliverables (in this order):
- Lit review plan (subtopics → why → representative papers to read first)
- Benchmarks/metrics (existing + proposed) aligned to the hypothesis
- Validation experiments (including representation/probing/interventions)
- Paper outline + expected contributions

When relevant, include “fastest path to reproduce” (datasets, eval harnesses, probing code).

B) “I need a reproducibility-first shortlist”

Prioritize:

Open-source repos (training recipe, evaluation harness, probing code)
Clear protocol (hyperparams, seeds, compute, preprocessing)
Reusable artifacts (scripts, configs, checkpoints, datasets)

Do this:

Run arxiv_survey.py with stricter terms and fewer results (e.g., 30–80).
Ask Codex to rank papers in REPORT.md by reproducibility criteria:
- Code availability, license clarity, dataset accessibility, protocol completeness
Produce:
- Ranked shortlist with repo links (if available)
- “Reusable parts” per paper (eval harness / probing / training recipe)
- Minimal reproduction plan (timeboxed: 2h / 1d / 1w)

C) “I need an evaluation suite + detection experiments (multilingual latent reasoning)”

Use this structure:

Hypothesis → operational definition (what counts as “English latent reasoning”)
Tasks:
- Multi-step reasoning across languages (same semantics, different surface forms)
- Translation-free reasoning (language-neutral, symbol-heavy, or synthetic)
- Controlled prompts enforcing target-language output
Metrics that separate reasoning vs fluency:
- Task accuracy, step-consistency proxies, calibration, controllability, latency
Representation-level detection:
- Layer-wise language ID / probing on activations
- Activation patching/interventions (swap “language subspace” signals)
- Forced-language and mixed-language ablations
Expected signatures + failure modes (confounds: translation, tokenization, data mixture)

Use assets/experiment_checklist.md as the backbone checklist.

Templates (assets/)

Copy and fill these as working docs:

assets/research_brief.md → one-topic brief (taxonomy + top papers + open questions)
assets/paper_comparison_table.md → consistent per-paper extraction fields
assets/experiment_checklist.md → step-by-step experimental checklist

Scripts

All scripts are pure-Python (stdlib) where possible. pdf_extract.py supports optional extractors; if none are available, it prints a clear install hint.

`scripts/arxiv_survey.py`

Search arXiv via the official Atom API, write results to JSONL, and optionally download PDFs.

`scripts/arxiv_bibtex.py`

Fetch BibTeX from arxiv.org for a list of arXiv IDs or a JSONL produced by arxiv_survey.py.

`scripts/pdf_extract.py`

Extract text from PDFs into .txt and optionally produce rough section splits (heuristics).

`scripts/dedupe_jsonl.py`

Dedupe a JSONL file by arxiv_id and near-duplicate titles (useful when iterating queries).

`scripts/generate_report.py`

Generate a structured Markdown report (table + clusters + TODO note slots) from arxiv.jsonl.

References

Read when you need query patterns or a report schema:

references/arxiv_query_guide.md
references/report_fields.md

Output quality bar (what “good” looks like)

Prefer explicit assumptions + failure modes over broad claims.
Prefer checklists and protocols over vague “future work”.
Always separate: (1) claim, (2) evidence, (3) test that could falsify it.

related-skills.json

même dépôt

auto-research.md

from "LiYu0524/Auto-Reasearch-Skills"

一站式学术研究工作流：论文检索与阅读(arXiv + Zotero)、文献综述写作(Google Docs)、论文精读与审稿(paper-reviewer)、学术写作Prompt工具箱(academic-writing)、学术插图生成(PaperBanana)、架构图绘制(draw.io)、演示文稿制作(python-pptx / Pencil)。整合 paper-research、paper-reviewer、academic-writing、google-docs、paper-banana、drawio、zotero-mcp、pptx 八大子技能。

2026-03-0911

academic-writing.md

from "LiYu0524/Auto-Reasearch-Skills"

学术论文写作 Prompt 工具箱：中英翻译、润色、缩写/扩写、逻辑检查、去 AI 味、实验分析、图表标题生成、架构图描述、实验绘图推荐、Reviewer 视角审稿。来源：awesome-ai-research-writing（MSRA / Seed / SH AI Lab 等顶尖研究机构实战 Prompt）。

2026-03-0911

paper-reviewer.md

from "LiYu0524/Auto-Reasearch-Skills"

Review research papers (especially PDFs). Use when the user asks to read/通读/讲解/总结/审稿 a paper and wants a Chinese-first explanation of what it does, what is novel (创新点), plus reviewer-style strengths/weaknesses, major/minor concerns, and questions to authors.

2026-03-0811

paper-banana.md

from "LiYu0524/Auto-Reasearch-Skills"

学术插图生成 - 使用 PaperBanana 多智能体框架从方法文本自动生成框架图和统计图

2026-03-0611

google-docs.md

from "LiYu0524/Auto-Reasearch-Skills"

Manage Google Docs and Google Drive with full document operations and file management. Includes Markdown support for creating formatted documents with headings, bold, italic, lists, tables, and checkboxes. Also supports Drive operations (upload, download, share, search).

2026-03-0411

package.json

"author": "LiYu0524"

"repository": "LiYu0524/Auto-Reasearch-Skills"

Ouvrir le dépôt GitHub Voir les dépôts du créateur

$ install --global

$ download --local

Exécuter dans Manus

$ useful --forSOC

Développeurs de logicielsProfessions informatiques et mathématiques15-1252L4

name

paper-research

description

Paper Research

Overview

Quick start (recommended workflow)

Create a topic workspace directory (keep everything together):
- Example: notes/implicit-reasoning-survey/
Search arXiv and (optionally) download PDFs:
- Run: python3 scripts/arxiv_survey.py --terms "implicit reasoning" "hidden chain-of-thought" "multilingual reasoning" --max-results 100 --download-pdfs --pdf-dir ./pdfs --out ./arxiv.jsonl
Extract text (+ rough sections) from PDFs:
- Run: python3 scripts/pdf_extract.py --pdf-dir ./pdfs --out-dir ./texts --sections
Fetch BibTeX for the found arXiv IDs:
- Run: python3 scripts/arxiv_bibtex.py --from-jsonl ./arxiv.jsonl --out ./refs.bib
Generate a structured research brief (table + clusters + TODO slots for notes):
- Run: python3 scripts/generate_report.py --jsonl ./arxiv.jsonl --out ./REPORT.md

Then ask Codex to synthesize (taxonomy/benchmarks/experiments) using REPORT.md + your notes.

Workflow decision tree

A) “I need a lit review plan + paper outline”

Do this:

Use the scripts to produce REPORT.md (table + clusters) and refs.bib.
Build a survey plan as a set of falsifiable questions + “what evidence would change my mind”.
Output deliverables (in this order):
- Lit review plan (subtopics → why → representative papers to read first)
- Benchmarks/metrics (existing + proposed) aligned to the hypothesis
- Validation experiments (including representation/probing/interventions)
- Paper outline + expected contributions

When relevant, include “fastest path to reproduce” (datasets, eval harnesses, probing code).

B) “I need a reproducibility-first shortlist”

Prioritize:

Open-source repos (training recipe, evaluation harness, probing code)
Clear protocol (hyperparams, seeds, compute, preprocessing)
Reusable artifacts (scripts, configs, checkpoints, datasets)

Do this:

Run arxiv_survey.py with stricter terms and fewer results (e.g., 30–80).
Ask Codex to rank papers in REPORT.md by reproducibility criteria:
- Code availability, license clarity, dataset accessibility, protocol completeness
Produce:
- Ranked shortlist with repo links (if available)
- “Reusable parts” per paper (eval harness / probing / training recipe)
- Minimal reproduction plan (timeboxed: 2h / 1d / 1w)

C) “I need an evaluation suite + detection experiments (multilingual latent reasoning)”

Use this structure:

Hypothesis → operational definition (what counts as “English latent reasoning”)
Tasks:
- Multi-step reasoning across languages (same semantics, different surface forms)
- Translation-free reasoning (language-neutral, symbol-heavy, or synthetic)
- Controlled prompts enforcing target-language output
Metrics that separate reasoning vs fluency:
- Task accuracy, step-consistency proxies, calibration, controllability, latency
Representation-level detection:
- Layer-wise language ID / probing on activations
- Activation patching/interventions (swap “language subspace” signals)
- Forced-language and mixed-language ablations
Expected signatures + failure modes (confounds: translation, tokenization, data mixture)

Use assets/experiment_checklist.md as the backbone checklist.

Templates (assets/)

Copy and fill these as working docs:

assets/research_brief.md → one-topic brief (taxonomy + top papers + open questions)
assets/paper_comparison_table.md → consistent per-paper extraction fields
assets/experiment_checklist.md → step-by-step experimental checklist

Scripts

All scripts are pure-Python (stdlib) where possible. pdf_extract.py supports optional extractors; if none are available, it prints a clear install hint.

`scripts/arxiv_survey.py`

Search arXiv via the official Atom API, write results to JSONL, and optionally download PDFs.

`scripts/arxiv_bibtex.py`

Fetch BibTeX from arxiv.org for a list of arXiv IDs or a JSONL produced by arxiv_survey.py.

`scripts/pdf_extract.py`

Extract text from PDFs into .txt and optionally produce rough section splits (heuristics).

`scripts/dedupe_jsonl.py`

Dedupe a JSONL file by arxiv_id and near-duplicate titles (useful when iterating queries).

`scripts/generate_report.py`

Generate a structured Markdown report (table + clusters + TODO note slots) from arxiv.jsonl.

References

Read when you need query patterns or a report schema:

references/arxiv_query_guide.md
references/report_fields.md

Output quality bar (what “good” looks like)

Prefer explicit assumptions + failure modes over broad claims.
Prefer checklists and protocols over vague “future work”.
Always separate: (1) claim, (2) evidence, (3) test that could falsify it.

paper-research

Paper Research

Overview

Quick start (recommended workflow)

Workflow decision tree

A) “I need a lit review plan + paper outline”

B) “I need a reproducibility-first shortlist”

C) “I need an evaluation suite + detection experiments (multilingual latent reasoning)”

Templates (assets/)

Scripts

scripts/arxiv_survey.py

scripts/arxiv_bibtex.py

scripts/pdf_extract.py

scripts/dedupe_jsonl.py

scripts/generate_report.py

References

Output quality bar (what “good” looks like)

Plus depuis ce dépôt

Plus depuis ce dépôt

Paper Research

Overview

Quick start (recommended workflow)

Workflow decision tree

A) “I need a lit review plan + paper outline”

B) “I need a reproducibility-first shortlist”

C) “I need an evaluation suite + detection experiments (multilingual latent reasoning)”

Templates (assets/)

Scripts

scripts/arxiv_survey.py

scripts/arxiv_bibtex.py

scripts/pdf_extract.py

scripts/dedupe_jsonl.py

scripts/generate_report.py

References

Output quality bar (what “good” looks like)

`scripts/arxiv_survey.py`

`scripts/arxiv_bibtex.py`

`scripts/pdf_extract.py`

`scripts/dedupe_jsonl.py`

`scripts/generate_report.py`

`scripts/arxiv_survey.py`

`scripts/arxiv_bibtex.py`

`scripts/pdf_extract.py`

`scripts/dedupe_jsonl.py`

`scripts/generate_report.py`