一键在 Manus 中运行任何 Skill

run-benchmark-suite

Run full benchmark suite with ablation sweep, connection sweep, JFR profiling, and doc updates. Use this skill whenever the user wants to run benchmarks, performance tests, load tests, measure throughput or latency, profile with JFR, do a connection sweep, or update benchmark analysis documents — even if they don't use the exact term "benchmark suite".

在 Manus 中运行

概览

安装命令

npx skills add https://github.com/cuioss/OAuthSheriff --skill run-benchmark-suite

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

cuioss/OAuthSheriff

星标0

分支1

更新时间2026年4月1日 12:27

文件资源管理器

2 个文件

SKILL.md

readonly

name	run-benchmark-suite
description	Run full benchmark suite with ablation sweep, connection sweep, JFR profiling, and doc updates. Use this skill whenever the user wants to run benchmarks, performance tests, load tests, measure throughput or latency, profile with JFR, do a connection sweep, or update benchmark analysis documents — even if they don't use the exact term "benchmark suite".
user-invocable	true

Benchmark Suite Runner

Repeatable benchmark execution workflow for all 6 endpoints: health, JWT, mock-jwt, direct-validation, ablation-baseline, ablation-header-only.

When to activate: When the user wants to run benchmarks, collect clean data, or update analysis docs.

Parameters

phase: Which workflow to run. One of sweep, connections, jfr, docs, all (default: all)
For connections: optional comma-separated list of connection counts (default: 50,100,150,200,250,300)

Working Directory

All benchmark commands run from benchmarking/benchmark-integration-wrk/ relative to the project root.

Workflow 1: Full Ablation Sweep (`phase=sweep`)

Navigate to benchmarking/benchmark-integration-wrk/
Run the full benchmark suite:
```
../../mvnw clean verify -Pbenchmark
```
This runs all 6 endpoints at default connections (50) with fresh tokens per run.

Extract results:

grep "Requests/sec" target/benchmark-results/wrk/*.txt

Analyze server logs — load references/log-analysis-checklist.md and follow the checklist against:
- target/quarkus.log
- target/benchmark-results/keycloak-logs-*.txt
- target/benchmark-results/wrk/*.txt (for non-2xx errors)
Report results summary to the user.

Error Handling

If a benchmark run fails mid-sweep (container crash, OOM, non-2xx spike), stop the current workflow, report which iteration failed, and offer to re-run from that point.
If containers are left running after an interrupted sweep, always clean up with the stop script before re-running.
If target/benchmark-results/ contains stale data from a previous run, warn the user before overwriting.

Workflow 2: Connection Sweep (`phase=connections`)

Navigate to benchmarking/benchmark-integration-wrk/

Start containers:

bash ../../oauth-sheriff-quarkus-parent/oauth-sheriff-quarkus-integration-tests/scripts/start-integration-container.sh

Loop over connection counts (default: 50, 100, 150, 200, 250, 300):

../../mvnw verify -Pbenchmark -Dwrk.connections=$CONNS -Dskip.container.lifecycle=true
cp -r target/benchmark-results/wrk target/benchmark-results/wrk-${CONNS}c

Stop containers:

bash ../../oauth-sheriff-quarkus-parent/oauth-sheriff-quarkus-integration-tests/scripts/stop-integration-container.sh

Analyze logs for each connection count using references/log-analysis-checklist.md.
Build comparison table from results across all connection counts.
Report the comparison table to the user.

Workflow 3: JFR Profiling (`phase=jfr`)

Navigate to benchmarking/benchmark-integration-wrk/

Run JFR profiling for JWT (default):

../../mvnw clean verify -Pbenchmark-jfr

Run JFR for other benchmarks (reuse containers). Supported values for -Djfr.benchmark=: jwt (default), health, direct-validation, mock-jwt, ablation-baseline, ablation-header-only:

../../mvnw verify -Pbenchmark-jfr -Djfr.benchmark=direct-validation -Dskip.container.lifecycle=true
../../mvnw verify -Pbenchmark-jfr -Djfr.benchmark=mock-jwt -Dskip.container.lifecycle=true

JFR output is at target/jfr-output/. Analyze with:

jfr print --events ExecutionSample,ObjectAllocationSample,GarbageCollection target/jfr-output/*.jfr

Summarize hot methods, allocation sites, and GC activity.

Workflow 4: Update Docs (`phase=docs`)

Read current benchmark results from target/benchmark-results/.
Update these analysis documents with fresh data:
- Find the most recent benchmarking/doc/Analysis-*-Integration.adoc — connection sweep tables (throughput, avg latency, P99)
- Find the most recent benchmarking/doc/Analysis-*-Latency-Decomposition.adoc — ablation decomposition table + six-layer decomposition
- Find the most recent benchmarking/doc/Analysis-*-JFR-Profiling.adoc — JFR analysis (only if JFR data available)
Preserve existing document structure; only update data tables and numbers.

Workflow 5: All (`phase=all`, default)

Execute workflows 1 through 4 sequentially. Stop and report if any workflow fails.

Key Project Files

File	Purpose
`benchmarking/benchmark-integration-wrk/pom.xml`	Maven profiles: benchmark, benchmark-jfr, quick, stress, max, autoscale
`benchmarking/doc/Analysis-*-Integration.adoc`	Connection sweep data tables (find the most recent)
`benchmarking/doc/Analysis-*-Latency-Decomposition.adoc`	Ablation decomposition (find the most recent)
`benchmarking/doc/Analysis-*-JFR-Profiling.adoc`	JFR analysis (find the most recent)
`oauth-sheriff-quarkus-parent/oauth-sheriff-quarkus-integration-tests/scripts/start-integration-container.sh`	Container lifecycle start (supports COMPOSE_OVERRIDE)
`oauth-sheriff-quarkus-parent/oauth-sheriff-quarkus-integration-tests/scripts/stop-integration-container.sh`	Container lifecycle stop (supports COMPOSE_OVERRIDE)

同仓库更多 Skills

同仓库

semantic-code-analysis

cuioss/OAuthSheriff

Semantic analysis of production code to find security issues, API design problems, dead code, unnecessary complexity, and semantic duplication. Use this skill whenever the user asks for a pre-release audit, API surface review, code cleanup, dead code detection, duplication analysis, security audit, or wants to reduce unnecessary complexity — even if they phrase it differently. Also use for code quality reviews, pre-release checklists, or "what can we clean up" requests.

2026-04-050

来源

cuioss

cuioss/OAuthSheriff

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

软件质量保证分析师与测试员计算机与数学类职业15-1253L4

name	run-benchmark-suite
description	Run full benchmark suite with ablation sweep, connection sweep, JFR profiling, and doc updates. Use this skill whenever the user wants to run benchmarks, performance tests, load tests, measure throughput or latency, profile with JFR, do a connection sweep, or update benchmark analysis documents — even if they don't use the exact term "benchmark suite".
user-invocable	true

Benchmark Suite Runner

Repeatable benchmark execution workflow for all 6 endpoints: health, JWT, mock-jwt, direct-validation, ablation-baseline, ablation-header-only.

When to activate: When the user wants to run benchmarks, collect clean data, or update analysis docs.

Parameters

phase: Which workflow to run. One of sweep, connections, jfr, docs, all (default: all)
For connections: optional comma-separated list of connection counts (default: 50,100,150,200,250,300)

Working Directory

All benchmark commands run from benchmarking/benchmark-integration-wrk/ relative to the project root.

Workflow 1: Full Ablation Sweep (`phase=sweep`)

Navigate to benchmarking/benchmark-integration-wrk/
Run the full benchmark suite:
```
../../mvnw clean verify -Pbenchmark
```
This runs all 6 endpoints at default connections (50) with fresh tokens per run.

Extract results:

grep "Requests/sec" target/benchmark-results/wrk/*.txt

Analyze server logs — load references/log-analysis-checklist.md and follow the checklist against:
- target/quarkus.log
- target/benchmark-results/keycloak-logs-*.txt
- target/benchmark-results/wrk/*.txt (for non-2xx errors)
Report results summary to the user.

Error Handling

If a benchmark run fails mid-sweep (container crash, OOM, non-2xx spike), stop the current workflow, report which iteration failed, and offer to re-run from that point.
If containers are left running after an interrupted sweep, always clean up with the stop script before re-running.
If target/benchmark-results/ contains stale data from a previous run, warn the user before overwriting.

Workflow 2: Connection Sweep (`phase=connections`)

Navigate to benchmarking/benchmark-integration-wrk/

Start containers:

bash ../../oauth-sheriff-quarkus-parent/oauth-sheriff-quarkus-integration-tests/scripts/start-integration-container.sh

Loop over connection counts (default: 50, 100, 150, 200, 250, 300):

../../mvnw verify -Pbenchmark -Dwrk.connections=$CONNS -Dskip.container.lifecycle=true
cp -r target/benchmark-results/wrk target/benchmark-results/wrk-${CONNS}c

Stop containers:

bash ../../oauth-sheriff-quarkus-parent/oauth-sheriff-quarkus-integration-tests/scripts/stop-integration-container.sh

Analyze logs for each connection count using references/log-analysis-checklist.md.
Build comparison table from results across all connection counts.
Report the comparison table to the user.

Workflow 3: JFR Profiling (`phase=jfr`)

Navigate to benchmarking/benchmark-integration-wrk/

Run JFR profiling for JWT (default):

../../mvnw clean verify -Pbenchmark-jfr

Run JFR for other benchmarks (reuse containers). Supported values for -Djfr.benchmark=: jwt (default), health, direct-validation, mock-jwt, ablation-baseline, ablation-header-only:

../../mvnw verify -Pbenchmark-jfr -Djfr.benchmark=direct-validation -Dskip.container.lifecycle=true
../../mvnw verify -Pbenchmark-jfr -Djfr.benchmark=mock-jwt -Dskip.container.lifecycle=true

JFR output is at target/jfr-output/. Analyze with:

jfr print --events ExecutionSample,ObjectAllocationSample,GarbageCollection target/jfr-output/*.jfr

Summarize hot methods, allocation sites, and GC activity.

Workflow 4: Update Docs (`phase=docs`)

Read current benchmark results from target/benchmark-results/.
Update these analysis documents with fresh data:
- Find the most recent benchmarking/doc/Analysis-*-Integration.adoc — connection sweep tables (throughput, avg latency, P99)
- Find the most recent benchmarking/doc/Analysis-*-Latency-Decomposition.adoc — ablation decomposition table + six-layer decomposition
- Find the most recent benchmarking/doc/Analysis-*-JFR-Profiling.adoc — JFR analysis (only if JFR data available)
Preserve existing document structure; only update data tables and numbers.

Workflow 5: All (`phase=all`, default)

Execute workflows 1 through 4 sequentially. Stop and report if any workflow fails.

Key Project Files

File	Purpose
`benchmarking/benchmark-integration-wrk/pom.xml`	Maven profiles: benchmark, benchmark-jfr, quick, stress, max, autoscale
`benchmarking/doc/Analysis-*-Integration.adoc`	Connection sweep data tables (find the most recent)
`benchmarking/doc/Analysis-*-Latency-Decomposition.adoc`	Ablation decomposition (find the most recent)
`benchmarking/doc/Analysis-*-JFR-Profiling.adoc`	JFR analysis (find the most recent)
`oauth-sheriff-quarkus-parent/oauth-sheriff-quarkus-integration-tests/scripts/start-integration-container.sh`	Container lifecycle start (supports COMPOSE_OVERRIDE)
`oauth-sheriff-quarkus-parent/oauth-sheriff-quarkus-integration-tests/scripts/stop-integration-container.sh`	Container lifecycle stop (supports COMPOSE_OVERRIDE)

run-benchmark-suite

Benchmark Suite Runner

Parameters

Working Directory

Workflow 1: Full Ablation Sweep (phase=sweep)

Error Handling

Workflow 2: Connection Sweep (phase=connections)

Workflow 3: JFR Profiling (phase=jfr)

Workflow 4: Update Docs (phase=docs)

Workflow 5: All (phase=all, default)

Key Project Files

同仓库更多 Skills

Benchmark Suite Runner

Parameters

Working Directory

Workflow 1: Full Ablation Sweep (phase=sweep)

Error Handling

Workflow 2: Connection Sweep (phase=connections)

Workflow 3: JFR Profiling (phase=jfr)

Workflow 4: Update Docs (phase=docs)

Workflow 5: All (phase=all, default)

Key Project Files

同仓库更多 Skills

Workflow 1: Full Ablation Sweep (`phase=sweep`)

Workflow 2: Connection Sweep (`phase=connections`)

Workflow 3: JFR Profiling (`phase=jfr`)

Workflow 4: Update Docs (`phase=docs`)

Workflow 5: All (`phase=all`, default)

Workflow 1: Full Ablation Sweep (`phase=sweep`)

Workflow 2: Connection Sweep (`phase=connections`)

Workflow 3: JFR Profiling (`phase=jfr`)

Workflow 4: Update Docs (`phase=docs`)

Workflow 5: All (`phase=all`, default)