com um clique
empirical-paper-writer
// Draft IEEE-style empirical ML/AI papers from a structured research contract. Builds experiment plans, section skeletons, placeholder-safe results, and a near-submission draft without fabricating evidence.
// Draft IEEE-style empirical ML/AI papers from a structured research contract. Builds experiment plans, section skeletons, placeholder-safe results, and a near-submission draft without fabricating evidence.
Writes ML/AI review and survey papers for arXiv using the IEEEtran LaTeX template with verified BibTeX citations. Triggers when the deliverable is a review paper, literature survey, or when an existing LaTeX project needs citation validation or repair.
Verify that Gemini CLI and Claude Code CLI are installed, authenticated, and API-reachable before starting collaboration workflows.
Use the Claude Code CLI as a depth-analysis co-pilot for paper-from-zero. Primary role: claim tree compression, logical hole detection, evidence sufficiency audit, and review/empirical routing judgment. Also supports generic code collaboration as fallback.
Use the Gemini CLI as a breadth-exploration co-pilot for paper-from-zero. Primary role: candidate direction expansion, adjacent work scanning, keyword cluster generation, and alternative framing proposals. Also supports generic code collaboration as fallback.
Post-process LaTeX project prose to improve readability through varied sentence and paragraph lengths. Removes filler phrases and unnecessary transitions while preserving all citations and semantic meaning.
Route a fixed research topic into a rigorous paper-generation workflow. Handles active literature search, innovation framing, contribution/evidence planning, and routes to either the review writer or empirical writer skill.
| name | empirical-paper-writer |
| description | Draft IEEE-style empirical ML/AI papers from a structured research contract. Builds experiment plans, section skeletons, placeholder-safe results, and a near-submission draft without fabricating evidence. |
| metadata | {"short-description":"Experimental paper executor with evidence-first safeguards"} |
Use this skill for novel experimental research papers after the topic and contribution have already been framed.
This skill is the empirical counterpart to ../arxiv-paper-writer.
It reuses the same high-value paper-engine pieces—citation discipline, LaTeX
compilation, source policy, and QA—but changes the paper logic from review to
experiment-driven writing.
../paper-from-zero when available:
brief/topic-brief.mdbrief/contribution-map.yamlbrief/evidence-matrix.csvplan/outline-contract.mdmain.tex (placeholder-safe draft)ref.bibpaper.config.yaml (includes runtime.* for experiment execution)plan/<timestamp>-<slug>.mdissues/<timestamp>-<slug>.csvnotes/literature-notes.mdnotes/innovation/ (candidates + decision log + evidence links)notes/design/ CSV artifacts (baselines.csv, method-components.csv, experiment-matrix.csv)experiments/ code scaffold (PyTorch skeleton; not executed by this skill)main.pdf after compile/QA when LaTeX is availablemain.tex until plan approved and issues CSV exists.planned, placeholder, or verified.python3 scripts/bootstrap_ieee_empirical_paper.py --stage kickoff --topic "<topic>" --layout project
--layout project creates <project>/paper/ (LaTeX + issues) and <project>/experiments/ (code scaffold).
For a lighter entrypoint with an outline-only plan:
python3 scripts/bootstrap_ieee_empirical_paper.py --stage outline --topic "<topic>" --layout project
main.tex with headings, bullet placeholders, experiment slots, and seed citations.After the user approves the initial plan, design the method and experiments before creating the issues CSV. This phase produces structured CSV artifacts.
Step 1: Systematic Baseline Identification
notes/design/baselines.csv (see assets/baselines-template.csv).selected=yes with reason.references/experiment-design.md Section 1 for selection criteria.Step 2: Innovation Module Design
notes/design/method-components.csv (see assets/method-components-template.csv).is_novel=yes for novel components; define replaceable_by for ablation.Step 3: Pipeline Architecture Design
Step 4: Comparison Experiment Matrix
notes/design/experiment-matrix.csv with type=main_comparison (see assets/experiment-matrix-template.csv).Step 5: Ablation Experiment Design
method-components.csv, identify factors by ablation_priority:
experiment-matrix.csv with type=ablation.Step 6: Robustness & Efficiency Analysis Plan
experiment-matrix.csv with type=robustness and type=efficiency.Phase 0.5 outputs (saved in notes/design/):
baselines.csv — baseline & SOTA competitive landscapeexperiment-matrix.csv — full experiment design matrixmethod-components.csv — pipeline component inventorySTOP until the user confirms the design artifacts before proceeding to Gate 1.
python3 scripts/bootstrap_ieee_empirical_paper.py --stage issues --topic "<topic>" --with-literature-notes --layout project
python3 scripts/validate_empirical_paper_issues.py <paper_dir>/issues/<timestamp>-<slug>.csv
For --layout project, <paper_dir> is <project>/paper.Before starting the writing loop, ensure citation coverage is adequate.
ref.bib. If < 25, trigger enrichment.baselines.csv must have at
least one corresponding entry in ref.bib.method-components.csv,
find 1-2 papers that motivate the design choice (prior art or the gap it fills).ref.bib has ≥ 25 verified entries. Target 30-40
for the finished paper (empirical papers need fewer than reviews, but 14 is
universally too low).For each issue:
verified: backed by real evidenceplaceholder: reserved for future real evidenceplanned: the experiment is designed but not yet filled in(hypothesis) / [Pending: ...] tags).Depends_On column are already DONE or SKIP. If any dependency is still TODO or DOING, the current issue MUST NOT be marked DONE. This rule is non-negotiable.python3 ../arxiv-paper-writer/scripts/issue_workflow.py --project-dir <paper_dir> render-skeleton --issues <issues.csv> --issue-id <Wx> to render a LaTeX section skeleton for a Writing issue.--apply-if-missing only when the full section path is entirely absent from main.tex; nested insertion under an existing parent stays manual.python3 ../arxiv-paper-writer/scripts/issue_workflow.py --project-dir <paper_dir> audit --issues <issues.csv> to check section-path consistency, citation counts, placeholders, and lightweight figure/page signals.After all experiment design issues (E0-E4) and experiment code issues (E5-E7) are DONE:
conda activate <runtime.conda_env>
cd <project_dir>/experiments
python run_all.py --config configs/<config>.yaml
results-backfill SKILL after experiments complete.After all writing issues (W1-W7) reach DONE, resolve structural diagrams. These are non-result figures—architecture, pipeline, formulation diagrams that depend on method design, not on experiment outcomes.
Step 1: Identify required figures
Scan main.tex for \fbox{...placeholder...}. Classify each:
Step 2: Generate structural TikZ figures For each structural placeholder:
notes/design/method-components.csv to extract component names,
is_novel flags, and data-flow edges.references/figure-generation-guide.md..tikz file under paper/figures/ and replace the \fbox
with \input{figures/<name>.tikz}.fig:teaser): problem setting +
where the method fits. Place in Introduction.fig:method): pipeline with components,
novel parts highlighted. Place in Method.fig:formulation): MDP / state
machine / optimization flow. Place in Problem Formulation or Method.Step 3: Visual issues tracking Use V-prefixed issues (V1, V2, ...) in the issues CSV for each figure. Mark DONE only when the TikZ compiles and is referenced in text.
See references/figure-generation-guide.md for TikZ patterns and style rules.
Gate: All structural \fbox placeholders must be resolved before
Phase 2.5. Result-dependent \fbox pass through to Phase 2.5.
After experiment issues (E*) reach verified status, perform a systematic
upgrade pass. This phase has four mandatory steps.
Step 1: Claim Analysis (mandatory)
For each contribution claim (C0, C1, C2, ...):
experiment-matrix.csv.result_status for ALL supporting experiment rows.| Evidence state | Action |
|---|---|
| ALL experiments verified | Upgrade (hypothesis) → bounded factual claim with specific numbers |
| SOME verified, SOME planned | Upgrade the verified part; note remaining gaps explicitly |
| NONE verified | Keep as (hypothesis) |
See references/abstract-conclusion-guide.md for the claim-upgrade
decision tree and safe-language patterns.
Step 2: Result-dependent Figure Resolution (mandatory)
For each remaining \fbox{...placeholder...} in main.tex:
paper/results/.\fbox with an explicit text marker
[Figure pending: <experiment_id> not yet verified].Step 3: Section Back-fill (mandatory)
For each experiment-matrix.csv row with result_status=verified:
main.tex contains actual
results or is still a skeleton.[Results pending: <experiment_id>].Step 4: Abstract & Conclusion Completion (mandatory)
Abstract (see references/abstract-conclusion-guide.md):
Conclusion (see references/abstract-conclusion-guide.md):
(hypothesis), state it as future work,
not as a finding.Gate: Do not proceed to Rhythm Refinement (Phase 2.7) until:
\fbox placeholders are resolved or explicitly marked pending.(hypothesis) tags remain for claims with verified evidence.After all writing issues are DONE, refine prose section-by-section using the latex-rhythm-refiner skill. This step varies sentence/paragraph lengths and removes filler phrases while preserving all citations.
../arxiv-paper-writer/references/quality-report.md).python3 ../arxiv-paper-writer/scripts/issue_workflow.py --project-dir <paper_dir> audit --issues <issues.csv> --fail-on-issuespython3 ../arxiv-paper-writer/scripts/source_ranker.py --project-dir <paper_dir> rankpython3 ../arxiv-paper-writer/scripts/citation_policy.py --project-dir <paper_dir> audit-bibpython3 ../arxiv-paper-writer/scripts/citation_policy.py --project-dir <paper_dir> audit-tex --issues <issues.csv>python3 ../arxiv-paper-writer/scripts/style_profile.py --project-dir <paper_dir> check-draft (if using style_mode=target_venue)python3 ../arxiv-paper-writer/scripts/compile_paper.py --project-dir <paper_dir> --check-warnings --fail-on-warningspython3 ../arxiv-paper-writer/scripts/citation_policy.py --project-dir <paper_dir> lint-bib --fail-on-lintOverfull \hbox warnings in main.log.main.tex, ref.bib, figures, and main.pdf.Before running any experiment or utility script:
paper.config.yaml → runtime.conda_env or runtime.python.conda_env is set, activate it: conda activate <env_name>.python is set, use that interpreter directly.Compilation: python3 ../arxiv-paper-writer/scripts/compile_paper.py --project-dir <paper_dir> --check-warnings --fail-on-warnings (exit 0).
Quality Metrics:
\fbox placeholders in main.texDONE or SKIPverified or explicitly placeholder(hypothesis) tags for claims with verified evidenceverified for a result that is actually placeholder.ref.bib.DONE only when criteria met.Fix Overfull \hbox warnings before marking issues DONE:
figure + \columnwidth; switch to figure* + \textwidth if neededp{...} column widths / \tabcolsep over \resizeboxsplit, multline, aligned, or IEEEeqnarray for line-breakingThe empirical issues CSV uses an 18-column schema with experiment-specific fields.
| Column | Purpose |
|---|---|
| ID | Issue identifier with phase prefix (R/E/W/RF/Q + number) |
| Phase | One of: Research, Experiment, Writing, Refinement, QA |
| Title | Short description of the deliverable |
| Section_Path | Target section in main.tex (e.g., Introduction > Contributions) |
| Claim_ID | Links to evidence-matrix claim (e.g., C1, C2) |
| Evidence_Type | n/a, citation, experiment, figure, table, mixed |
| Experiment_ID | Links to experiment matrix (e.g., EXP-1) |
| Result_Status | n/a, planned, placeholder, verified |
| Description | Detailed scope of the issue |
| Source_Policy | core, standard, frontier (for citation sourcing) |
| Target_Citations | Minimum citations expected for this issue |
| Visualization | Required figure/table description |
| Acceptance | Criteria for marking DONE |
| Status | TODO, DOING, DONE, SKIP |
| Verified_Citations | Actual verified citation count |
| Depends_On | Semicolon-separated issue IDs that must complete first |
| Must_Verify | yes/no: whether this issue requires evidence verification |
| Notes | Free-form notes |
Phase prefixes: R (Research), E (Experiment), W (Writing), RF (Refinement), Q (QA).
Schema validated by scripts/validate_empirical_paper_issues.py.
../arxiv-paper-writer/scripts/arxiv_registry.py../arxiv-paper-writer/scripts/compile_paper.py../arxiv-paper-writer/scripts/citation_policy.py../arxiv-paper-writer/scripts/source_ranker.py../arxiv-paper-writer/scripts/style_profile.pyreferences/experiment-design.md (baseline selection, experiment matrix patterns, ablation design, statistical rigor)references/figure-generation-guide.md (TikZ patterns for structural diagrams)references/abstract-conclusion-guide.md (abstract template, conclusion template, claim upgrade decision tree)references/research-workflow.mdreferences/experiment-evidence.mdreferences/results-writing.mdreferences/reviewer-loop.mdreferences/reproducibility-checklist.mdreferences/fork-extend-workflow.md../arxiv-paper-writer/references/:
bibtex-guide.mdcitation-workflow.mdquality-report.mdtemplate-usage.mdvisual-templates.mdwriting-style.md