원클릭으로 Manus에서 모든 스킬 실행

시작하기

audit-adaptivity

스타17,720

포크1,694

업데이트2026년 6월 17일 09:34

Audit the adaptive window hill-climber and region-resize logic for implementation defects (not algorithm quality)

설치

Codex 또는 Claude로 설치 이 Prompt를 복사해 Codex, Claude 또는 다른 어시스턴트에 붙여 넣으면 Skill 페이지를 검토하고 설치를 진행할 수 있습니다.

Manus에서 실행

출처

ben-manes

ben-manes/caffeine

GitHub 저장소 열기 Creator 저장소 보기

다운로드

Manus에서 실행

Scope boundary — implementation correctness, NOT algorithm quality

The adaptation policy itself is at its tuned frontier. Out of scope: convergence rate, hit-rate, oscillation-as-a-design-tradeoff, and the choice of the tuning constants. Do NOT report "the climber could converge faster / oscillates / constant X should be Y." Report only defects: arithmetic that yields a wrong value, a sign error, a state that violates a structural invariant, a race, a NaN, or an overflow.

Methods in scope

climb, determineAdjustment — the feedback step (hit-rate delta → step → adjustment)
increaseWindow, decreaseWindow, demoteFromMainProtected — region transfer
setMaximumSize — initial window/main split and the SMALL_CACHE_THRESHOLD step-sign flip
evictFromWindow / evictFromMain — they consume the region maxima the climber sets

Fields/constants: windowMaximum, mainProtectedMaximum, windowWeightedSize, mainProtectedWeightedSize, stepSize, adjustment, hitsInSample, missesInSample, previousSampleHitRate; HILL_CLIMBER_STEP_PERCENT, HILL_CLIMBER_STEP_DECAY_RATE, HILL_CLIMBER_RESTART_THRESHOLD, HILL_CLIMBER_MIN_INITIAL_STEP, SMALL_CACHE_THRESHOLD, SMALL_CACHE_SAMPLE_RATIO_CAP, QUEUE_TRANSFER_THRESHOLD.

Structural invariants to attack (violations are real bugs)

Region partition sum: windowMaximum + mainMaximum (probation + protected) must equal maximum() after every climb and every resize. Can any single transfer, or a sequence capped by QUEUE_TRANSFER_THRESHOLD, drift the sum?
Non-negative maxima: can windowMaximum or mainProtectedMaximum go negative — a quota larger than the donor region, or repeated decreaseWindow at the floor?
Quota accounting: in increaseWindow/decreaseWindow the quota is decremented per transferred node by policyWeight. With weighted entries, can quota underflow, skip/over-run the loop, or transfer the wrong count? Does the QUEUE_TRANSFER_THRESHOLD cap leave the regions half-adjusted such that the next climb mis-reads them?
determineAdjustment math:
- requestCount = hits + misses; the early return guards requestCount < effectiveSampleSize. Is the hitRate division ever reachable with requestCount == 0?
- small-cache branch: effectiveSampleSize = (long)(sampleSize * ratio), where ratio = clamp(initialStep / magnitude). Can initialStep be 0 (maximum 0 or tiny) making magnitude 0 → division by zero? Can the (long) cast truncate ratio so it defeats the intended sample-period growth?
- nextStepSize uses Math.copySign(max(...), amount). For amount == 0.0 / -0.0, does copySign choose the intended direction? Can stepSize become NaN or 0 and permanently stall adaptation (a stuck-window bug, distinct from slow convergence)?
setMaximumSize at boundaries: the step-sign flip at max <= SMALL_CACHE_THRESHOLD plus a runtime maximum change via Policy.eviction().setMaximum — when maximum crosses SMALL_CACHE_THRESHOLD in either direction, do the window/main split, the stepSize sign, and the sample state stay mutually consistent?
Stale adjustment consumption: climb calls determineAdjustment then increaseWindow/decreaseWindow off adjustment(). When determineAdjustment early-returns (uninitialized sketch, sub-sample request count), can a stale adjustment from a prior cycle be re-applied?

Output

For each defect: give concrete maximum/weight/access values, trace the arithmetic step by step, show the resulting invariant violation or wrong region size, and a Verification (a BoundedLocalCacheTest white-box method plus the required -P flags).

Everything here runs under evictionLock (single-writer), so most findings will be arithmetic / state-corruption, not races — but explicitly check whether any climber-written field (adjustment, stepSize, the region maxima) is also read off-lock by a concurrent reader before concluding "single-writer, cannot race."

이 저장소의 다른 Skills

같은 저장소

audit-jcache-conformance

ben-manes/caffeine

JSR-107 (JCache) spec-conformance audit

2026-06-1717.7k

audit-state-machine

ben-manes/caffeine

Audit explicit state machines (drain status, node lifecycle, async-value lifecycle) for illegal or missed transitions

2026-06-1717.7k

audit-temporal-walk

ben-manes/caffeine

Heavyweight history-mining bug audit. Walks the caffeine module's git history chronologically (oldest to HEAD), maintains a forward-tracked issue database, and surfaces concerns introduced by past commits that were never resolved. Catches bugs that snapshot mining cannot — half-fixes invisible from current state, latent+trigger pairs across multi-commit interactions, and partial refactors. Slow (model/effort-dependent; ~24h on Opus + max effort) and rare-run (every several months or before a major release).

2026-06-1717.7k

audit-sibling-divergence

ben-manes/caffeine

Differential audit comparing matched code paths that should behave identically. Spawns one auditor per sibling pair (sync/async, bounded/unbounded, view consistency, bulk vs single, generated node variants, read fast vs slow, adapter conformance) and requires a concrete witness scenario where the two paths diverge observably.

2026-06-0217.7k

audit-contract-drift

ben-manes/caffeine

Find places where documented API contracts and the implementation diverge

2026-04-2717.7k

audit-exception-safety

ben-manes/caffeine

Audit exception safety and failure atomicity across all throw sites

2026-04-1317.7k

name	audit-adaptivity
description	Audit the adaptive window hill-climber and region-resize logic for implementation defects (not algorithm quality)
context	fork
agent	auditor
disable-model-invocation	true

Scope boundary — implementation correctness, NOT algorithm quality

Methods in scope

climb, determineAdjustment — the feedback step (hit-rate delta → step → adjustment)
increaseWindow, decreaseWindow, demoteFromMainProtected — region transfer
setMaximumSize — initial window/main split and the SMALL_CACHE_THRESHOLD step-sign flip
evictFromWindow / evictFromMain — they consume the region maxima the climber sets

Structural invariants to attack (violations are real bugs)

Region partition sum: windowMaximum + mainMaximum (probation + protected) must equal maximum() after every climb and every resize. Can any single transfer, or a sequence capped by QUEUE_TRANSFER_THRESHOLD, drift the sum?
Non-negative maxima: can windowMaximum or mainProtectedMaximum go negative — a quota larger than the donor region, or repeated decreaseWindow at the floor?
Quota accounting: in increaseWindow/decreaseWindow the quota is decremented per transferred node by policyWeight. With weighted entries, can quota underflow, skip/over-run the loop, or transfer the wrong count? Does the QUEUE_TRANSFER_THRESHOLD cap leave the regions half-adjusted such that the next climb mis-reads them?
determineAdjustment math:
- requestCount = hits + misses; the early return guards requestCount < effectiveSampleSize. Is the hitRate division ever reachable with requestCount == 0?
- small-cache branch: effectiveSampleSize = (long)(sampleSize * ratio), where ratio = clamp(initialStep / magnitude). Can initialStep be 0 (maximum 0 or tiny) making magnitude 0 → division by zero? Can the (long) cast truncate ratio so it defeats the intended sample-period growth?
- nextStepSize uses Math.copySign(max(...), amount). For amount == 0.0 / -0.0, does copySign choose the intended direction? Can stepSize become NaN or 0 and permanently stall adaptation (a stuck-window bug, distinct from slow convergence)?
setMaximumSize at boundaries: the step-sign flip at max <= SMALL_CACHE_THRESHOLD plus a runtime maximum change via Policy.eviction().setMaximum — when maximum crosses SMALL_CACHE_THRESHOLD in either direction, do the window/main split, the stepSize sign, and the sample state stay mutually consistent?
Stale adjustment consumption: climb calls determineAdjustment then increaseWindow/decreaseWindow off adjustment(). When determineAdjustment early-returns (uninitialized sketch, sub-sample request count), can a stale adjustment from a prior cycle be re-applied?