Run any Skill in Manus with one click

$pwd:

jmh-benchmark-compare

Name: Jmh Benchmark Compare
Author: eclipse-rdf4j

// Parse JMH result text by finding the first header line that starts with Benchmark and contains Mode and Score, build a structured table for all columns/rows, compare overlapping benchmarks across 2+ files, compute Diff Score and Diff %, filter by deviation or regression thresholds, analyze regressions over time from filename/mtime timestamps, and export sortable reports to txt/md/csv/xlsx/html. Use for benchmark run comparisons, regression triage, and directory-wide historical analysis.

Run Skill in Manus

$ git log --oneline --stat

stars:404

forks:183

updated:April 22, 2026 at 15:59

File Explorer

7 files

SKILL.md

readonly

name

jmh-benchmark-compare

description

Parse JMH result text by finding the first header line that starts with Benchmark and contains Mode and Score, build a structured table for all columns/rows, compare overlapping benchmarks across 2+ files, compute Diff Score and Diff %, filter by deviation or regression thresholds, analyze regressions over time from filename/mtime timestamps, and export sortable reports to txt/md/csv/xlsx/html. Use for benchmark run comparisons, regression triage, and directory-wide historical analysis.

jmh-benchmark-compare

Use this skill when benchmark output comparison must be reproducible, sortable, and exportable.

Quick start

Run two-file comparison:

python3 .codex/skills/jmh-benchmark-compare/scripts/jmh_benchmark_compare.py \
  /path/run-a.txt /path/run-b.txt \
  --export-formats txt,md,csv,xlsx,html \
  --output-dir /tmp \
  --output-base jmh-compare

Sort by diff percent (descending):

python3 .codex/skills/jmh-benchmark-compare/scripts/jmh_benchmark_compare.py \
  run-a.txt run-b.txt \
  --sort-column "Diff % [run-b - run-a]" \
  --sort-desc \
  --export-formats md \
  --output /tmp/jmh-diff.md

Core behavior

Detect first JMH table header line: line.startswith("Benchmark") and "Mode" in line and "Score" in line.
Derive column boundaries from that header.
Parse all following benchmark rows into an internal table.
Match overlapping benchmark keys across files.
Add derived columns: Diff Score [target - baseline], Diff % [target - baseline], Status [...].

Default key columns are all columns except Cnt, Score, Error. Override via --id-columns.

Inputs and overlap

Pass any mix of files and directories.
Directory entries are scanned for files that contain a JMH header.
--overlap-mode all keeps only rows present in all files.
--overlap-mode any keeps rows present in at least two files.
Baseline selection: --baseline <index-or-label>.

Filters and regression shortcuts

Hide tiny deltas: --min-deviation-pct 1.0
Show only regressions above threshold: --regressions-over-pct 3.0
Control direction interpretation: --score-direction auto|higher|lower

Historical analysis

Analyze trends across many runs:

python3 .codex/skills/jmh-benchmark-compare/scripts/jmh_benchmark_compare.py \
  /path/bench-history \
  --recursive \
  --glob "*.txt" \
  --timestamp-source auto \
  --analyze-over-time \
  --regressions-over-pct 2.5 \
  --export-formats html,csv \
  --output-dir /tmp \
  --output-base jmh-history

Timeline report files are emitted with -timeline suffix.

Exports

txt: aligned plain-text table.
md: valid markdown table.
csv: spreadsheet-friendly CSV.
xlsx: native Excel workbook (single sheet). (xslx alias accepted)
html: sortable table (click header), built-in CSS + JS, color theme selector.

If one format and explicit destination needed, use --output /path/file.ext. If multiple formats, use --output-dir + --output-base.

Script

scripts/jmh_benchmark_compare.py

For timestamp parsing behavior and filename examples, see: references/timestamps-and-discovery.md

related-skills.json

same repository

docker-jfr-benchmark-loop.md

from "eclipse-rdf4j/rdf4j"

Run a repeatable RDF4J performance loop against one JMH benchmark in Docker with Linux Java 26 and JFR CPU-time profiling. Use when working in this repo on benchmark-guided performance changes, hotspot triage, JFR reading, CPU bottleneck analysis, or repeated baseline, fix, and rerun loops. Trigger on requests mentioning benchmark, profiling, JFR, hotspot, perf loop, CPU bottleneck, or Docker benchmark runs in RDF4J.

2026-04-22404

high-performance-java.md

from "eclipse-rdf4j/rdf4j"

Use when writing, reviewing, or reshaping HotSpot Java where algorithmic complexity, data-structure choice, throughput, latency, allocation rate, zero-copy, lazy evaluation, non-materialization, runtime specialization, query-engine code generation, Janino, primitive collections, performance libraries, intrinsics, SuperWord auto-vectorization, or C2 assembly matter. Also use for advanced algorithmic problem solving in Java, including dynamic programming, graph/range techniques, cache-aware code shape, and choosing between interpreted, vectorized, and compiled execution paths. Bias toward asymptotic wins first, then the right execution model, then specialized hot-path code, then benchmark and JIT evidence.

2026-04-11404

query-plan-snapshot-cli.md

from "eclipse-rdf4j/rdf4j"

Use QueryPlanSnapshotCli to capture and compare RDF4J query plans, then assess likely performance improvements/regressions from execution verification and semantic plan diffs. Trigger when users ask about optimizer impact, query-plan drift, join algorithm changes, or query performance regressions in testsuites/benchmark.

2026-02-17404

gh-read-inspector.md

from "eclipse-rdf4j/rdf4j"

Retrieve GitHub issues, pull requests, and milestones with read-only, whitelisted `gh` commands only. Use when you need complete issue or PR context, need to resolve a PR from commit ID/PR ID/issue ID, fetch milestone metadata, or list all issues in a milestone (labels, status, assignees, and related fields).

2026-02-12404

mvnf.md

from "eclipse-rdf4j/rdf4j"

Run Maven tests in this repo with a consistent workflow (module clean, root -Pquick clean install to refresh .m2_repo, then module verify or a single test class/method). Use when asked to run tests/verify in the rdf4j multi-module build or when the user says mvnf.

2026-02-11404

debug-surefire.md

from "eclipse-rdf4j/rdf4j"

Debug Maven Surefire unit tests by running them in JDWP "wait for debugger" mode (`-Dmaven.surefire.debug`) and attaching to the forked test JVM using **jdb** (preferred for CLI/agent debugging), IntelliJ, or VS Code. Use when asked to debug/step through a failing JUnit test, attach a debugger to a Maven test run, or run `mvn test -Dtest=Class[#method]` suspended on a port (including multi-module `-pl` runs). The JVM will block at startup until a debugger attaches; the agent should attach with `jdb -attach <host>:<port>` and drive the session from the terminal.

2025-12-25404

package.json

"author": "eclipse-rdf4j"

"repository": "eclipse-rdf4j/rdf4j"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software Quality Assurance Analysts and TestersComputer and Mathematical Occupations15-1253L4

name

jmh-benchmark-compare

description

jmh-benchmark-compare

Use this skill when benchmark output comparison must be reproducible, sortable, and exportable.

Quick start

Run two-file comparison:

python3 .codex/skills/jmh-benchmark-compare/scripts/jmh_benchmark_compare.py \
  /path/run-a.txt /path/run-b.txt \
  --export-formats txt,md,csv,xlsx,html \
  --output-dir /tmp \
  --output-base jmh-compare

Sort by diff percent (descending):

python3 .codex/skills/jmh-benchmark-compare/scripts/jmh_benchmark_compare.py \
  run-a.txt run-b.txt \
  --sort-column "Diff % [run-b - run-a]" \
  --sort-desc \
  --export-formats md \
  --output /tmp/jmh-diff.md

Core behavior

Detect first JMH table header line: line.startswith("Benchmark") and "Mode" in line and "Score" in line.
Derive column boundaries from that header.
Parse all following benchmark rows into an internal table.
Match overlapping benchmark keys across files.
Add derived columns: Diff Score [target - baseline], Diff % [target - baseline], Status [...].

Default key columns are all columns except Cnt, Score, Error. Override via --id-columns.

Inputs and overlap

Pass any mix of files and directories.
Directory entries are scanned for files that contain a JMH header.
--overlap-mode all keeps only rows present in all files.
--overlap-mode any keeps rows present in at least two files.
Baseline selection: --baseline <index-or-label>.

Filters and regression shortcuts

Hide tiny deltas: --min-deviation-pct 1.0
Show only regressions above threshold: --regressions-over-pct 3.0
Control direction interpretation: --score-direction auto|higher|lower

Historical analysis

Analyze trends across many runs:

python3 .codex/skills/jmh-benchmark-compare/scripts/jmh_benchmark_compare.py \
  /path/bench-history \
  --recursive \
  --glob "*.txt" \
  --timestamp-source auto \
  --analyze-over-time \
  --regressions-over-pct 2.5 \
  --export-formats html,csv \
  --output-dir /tmp \
  --output-base jmh-history

Timeline report files are emitted with -timeline suffix.

Exports

txt: aligned plain-text table.
md: valid markdown table.
csv: spreadsheet-friendly CSV.
xlsx: native Excel workbook (single sheet). (xslx alias accepted)
html: sortable table (click header), built-in CSS + JS, color theme selector.

If one format and explicit destination needed, use --output /path/file.ext. If multiple formats, use --output-dir + --output-base.

Script

scripts/jmh_benchmark_compare.py

For timestamp parsing behavior and filename examples, see: references/timestamps-and-discovery.md

jmh-benchmark-compare

jmh-benchmark-compare

Quick start

Core behavior

Inputs and overlap

Filters and regression shortcuts

Historical analysis

Exports

Script

More from this repository

jmh-benchmark-compare

Quick start

Core behavior

Inputs and overlap

Filters and regression shortcuts

Historical analysis

Exports

Script

More from this repository