Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

Loslegen

sim-compare

Sterne17.720

Forks1.694

Aktualisiert22. März 2026 um 22:58

Run cache policy hit rate comparison across multiple cache sizes with charts

Installation

Mit Codex oder Claude installieren Kopieren Sie diesen Prompt, fügen Sie ihn in Codex, Claude oder einen anderen Assistant ein und lassen Sie die Skill-Seite prüfen und installieren.

In Manus ausführen

Quelle

ben-manes

ben-manes/caffeine

GitHub-Repository öffnen Creator-Repositorys ansehen

Download

In Manus ausführen

Verwandte BerufeSOC

Basierend auf der SOC-Berufsklassifikation

DatenwissenschaftlerInformatik- und Mathematikberufe·SOC 15-2051

SoftwareentwicklerSOC 15-1252

SKILL.md

readonly

name	sim-compare
description	Run cache policy hit rate comparison across multiple cache sizes with charts
argument-hint	<trace-file> [policies...] [sizes...]
context	fork
disable-model-invocation	true
allowed-tools	Read, Grep, Glob, Bash

Run a comprehensive cache policy comparison for the given trace.

Input

Trace file: $ARGUMENTS

If no policies or sizes are specified, use sensible defaults based on the trace.

Workflow

Identify the trace format. Check the file extension and contents:
- .gz files: try common formats (lirs, arc, etc.)
- Look at simulator/src/main/resources/reference.conf for format options
- Use format:path syntax (e.g., lirs:trace.gz)
Select policies to compare. Policy names use category.PolicyName format. Note: config categories use hyphens (two-queue, greedy-dual), not underscores. Each policy is paired with configured admission filters (default: Always, TinyLfu, Clairvoyant), creating multiple instances per policy name.

Include at minimum:
- product.Caffeine (the production implementation)
- opt.Clairvoyant (theoretical optimal, upper bound)
- opt.Unbounded (infinite cache, ceiling)
- linked.Lru (baseline)
- sketch.WindowTinyLfu (research W-TinyLFU)
- sketch.HillClimberWindowTinyLfu (adaptive variant)
- Add relevant competitors based on trace characteristics:
  - For recency-heavy: linked.S4Lru, adaptive.Arc
  - For frequency-heavy: linked.Lfu, irr.Lirs
  - For scan-resistant: two-queue.TwoQueue, two-queue.S3Fifo
  - For size-aware traces: greedy-dual.Gdsf, greedy-dual.Camp
Choose cache sizes. Use a geometric progression covering the working set:
- Start small (e.g., 100), end near working set size
- 5-8 sizes: e.g., 100,500,1_000,2_500,5_000,10_000,25_000
- If the trace has few distinct keys, reduce the range

Run the simulation. Use the Gradle task:

./gradlew simulator:simulate -q \
  --maximumSize=100,500,1000,2500,5000,10000 \
  --metric="Hit Rate" \
  --title="Description" \
  --theme=light \
  --outputDir=build/reports/sim

Override the trace and policies via system properties appended to the command:

-Dcaffeine.simulator.files.paths.0="format:path/to/trace"
-Dcaffeine.simulator.policies.0=product.Caffeine
-Dcaffeine.simulator.policies.1=opt.Clairvoyant
# ... etc

Note: for single-size runs, use ./gradlew simulator:run -q with -Dcaffeine.simulator.maximum-size=N instead of simulator:simulate.

Read and interpret results. The simulate task produces:
- Individual CSV per cache size
- Combined CSV (policies as rows, sizes as columns)
- PNG chart (line graph of metric vs cache size) Read the CSV output files in the output directory:
- Compare hit rates across policies at each cache size
- Identify the crossover points where one policy overtakes another
- Note the gap between Caffeine and Clairvoyant (theoretical ceiling)
Explain findings. For each notable result:
- WHY does policy X beat policy Y on this trace?
- What trace characteristic drives the difference? (frequency bias, recency bias, scan patterns, temporal shifts)
- How close is Caffeine to optimal? Where does it lose?
- Reference the relevant research paper if applicable:
  - TinyLFU paper for admission filter behavior
  - Adaptive paper for hill climber effectiveness
  - See .claude/docs/research-foundations.md for paper-to-code mapping
Report. Present:
- Summary table of hit rates at each cache size
- Key takeaways (2-3 sentences)
- Notable policy behaviors
- Path to generated chart PNG

Mehr aus diesem Repository

gleiches Repository

audit-adaptivity

ben-manes/caffeine

Audit the adaptive window hill-climber and region-resize logic for implementation defects (not algorithm quality)

2026-06-1717.7k

audit-jcache-conformance

ben-manes/caffeine

JSR-107 (JCache) spec-conformance audit

2026-06-1717.7k

audit-state-machine

ben-manes/caffeine

Audit explicit state machines (drain status, node lifecycle, async-value lifecycle) for illegal or missed transitions

2026-06-1717.7k

audit-temporal-walk

ben-manes/caffeine

Heavyweight history-mining bug audit. Walks the caffeine module's git history chronologically (oldest to HEAD), maintains a forward-tracked issue database, and surfaces concerns introduced by past commits that were never resolved. Catches bugs that snapshot mining cannot — half-fixes invisible from current state, latent+trigger pairs across multi-commit interactions, and partial refactors. Slow (model/effort-dependent; ~24h on Opus + max effort) and rare-run (every several months or before a major release).

2026-06-1717.7k

audit-sibling-divergence

ben-manes/caffeine

Differential audit comparing matched code paths that should behave identically. Spawns one auditor per sibling pair (sync/async, bounded/unbounded, view consistency, bulk vs single, generated node variants, read fast vs slow, adapter conformance) and requires a concrete witness scenario where the two paths diverge observably.

2026-06-0217.7k

audit-contract-drift

ben-manes/caffeine

Find places where documented API contracts and the implementation diverge

2026-04-2717.7k

name	sim-compare
description	Run cache policy hit rate comparison across multiple cache sizes with charts
argument-hint	<trace-file> [policies...] [sizes...]
context	fork
disable-model-invocation	true
allowed-tools	Read, Grep, Glob, Bash

Run a comprehensive cache policy comparison for the given trace.

Input

Trace file: $ARGUMENTS

If no policies or sizes are specified, use sensible defaults based on the trace.

Workflow

Identify the trace format. Check the file extension and contents:
- .gz files: try common formats (lirs, arc, etc.)
- Look at simulator/src/main/resources/reference.conf for format options
- Use format:path syntax (e.g., lirs:trace.gz)
Select policies to compare. Policy names use category.PolicyName format. Note: config categories use hyphens (two-queue, greedy-dual), not underscores. Each policy is paired with configured admission filters (default: Always, TinyLfu, Clairvoyant), creating multiple instances per policy name.

Include at minimum:
- product.Caffeine (the production implementation)
- opt.Clairvoyant (theoretical optimal, upper bound)
- opt.Unbounded (infinite cache, ceiling)
- linked.Lru (baseline)
- sketch.WindowTinyLfu (research W-TinyLFU)
- sketch.HillClimberWindowTinyLfu (adaptive variant)
- Add relevant competitors based on trace characteristics:
  - For recency-heavy: linked.S4Lru, adaptive.Arc
  - For frequency-heavy: linked.Lfu, irr.Lirs
  - For scan-resistant: two-queue.TwoQueue, two-queue.S3Fifo
  - For size-aware traces: greedy-dual.Gdsf, greedy-dual.Camp
Choose cache sizes. Use a geometric progression covering the working set:
- Start small (e.g., 100), end near working set size
- 5-8 sizes: e.g., 100,500,1_000,2_500,5_000,10_000,25_000
- If the trace has few distinct keys, reduce the range

Run the simulation. Use the Gradle task:

./gradlew simulator:simulate -q \
  --maximumSize=100,500,1000,2500,5000,10000 \
  --metric="Hit Rate" \
  --title="Description" \
  --theme=light \
  --outputDir=build/reports/sim

Override the trace and policies via system properties appended to the command:

-Dcaffeine.simulator.files.paths.0="format:path/to/trace"
-Dcaffeine.simulator.policies.0=product.Caffeine
-Dcaffeine.simulator.policies.1=opt.Clairvoyant
# ... etc

Note: for single-size runs, use ./gradlew simulator:run -q with -Dcaffeine.simulator.maximum-size=N instead of simulator:simulate.

Read and interpret results. The simulate task produces:
- Individual CSV per cache size
- Combined CSV (policies as rows, sizes as columns)
- PNG chart (line graph of metric vs cache size) Read the CSV output files in the output directory:
- Compare hit rates across policies at each cache size
- Identify the crossover points where one policy overtakes another
- Note the gap between Caffeine and Clairvoyant (theoretical ceiling)
Explain findings. For each notable result:
- WHY does policy X beat policy Y on this trace?
- What trace characteristic drives the difference? (frequency bias, recency bias, scan patterns, temporal shifts)
- How close is Caffeine to optimal? Where does it lose?
- Reference the relevant research paper if applicable:
  - TinyLFU paper for admission filter behavior
  - Adaptive paper for hill climber effectiveness
  - See .claude/docs/research-foundations.md for paper-to-code mapping
Report. Present:
- Summary table of hit rates at each cache size
- Key takeaways (2-3 sentences)
- Notable policy behaviors
- Path to generated chart PNG