Run any Skill in Manus with one click

Get Started

debug-pnl

Stars0

Forks0

UpdatedFebruary 26, 2026 at 19:53

Step-by-step workflow for diagnosing why the market maker is losing money

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

Run Skill in Manus

Source

trudumb

trudumb/hyper_make

View GitHub Repository View Creator Repositories

Download

Run Skill in Manus

Related occupationsSOC

Based on SOC occupation classification

Data ScientistsComputer and Mathematical Occupations·SOC 15-2051

SKILL.md

readonly

name	debug-pnl
description	Step-by-step workflow for diagnosing why the market maker is losing money
disable-model-invocation	true
context	fork
agent	general-purpose
argument-hint	[asset] [timerange]

Debug PnL Workflow

Structured diagnostic for market maker PnL issues. Follow these steps in order.

Step 1: Identify the Drag

Check PnL attribution to find which component is causing losses. Read the relevant analytics:

src/market_maker/analytics/attribution.rs   — PnL decomposition
src/market_maker/analytics/edge_metrics.rs  — Edge tracking
src/market_maker/analytics/sharpe.rs        — Sharpe ratio

PnL decomposes into:

Spread capture (positive) — edge from bid-ask spread
Adverse selection (negative) — fills on wrong side of price move
Inventory cost (negative) — holding risk
Fees (negative) — exchange maker fees (1.5 bps on Hyperliquid)

The largest negative component is your priority target.

Step 2: Route to Component

Dominant Drag	Check These	Read Skill
Adverse selection	`adverse_selection/`, `calibration/`	`adverse-selection-classifier`
Spread too tight	`strategy/signal_integration.rs`	`quote-engine`
Inventory cost	`strategy/position_manager.rs`, `control/`	`stochastic-controller`
Fill rate collapsed	`estimator/kappa.rs`, `simulation/fill_sim.rs`	`fill-intensity-hawkes`
Regime misclassification	`estimator/regime_hmm.rs`	`regime-detection-hmm`
Cascade losses	`risk/circuit_breaker.rs`, `risk/monitors/cascade.rs`	`risk-management`

Step 3: Check Calibration

For the identified component, check its calibration metrics:

src/market_maker/calibration/brier_score.rs       — Brier Score
src/market_maker/calibration/information_ratio.rs  — Information Ratio
src/market_maker/calibration/conditional_metrics.rs — Conditional calibration
src/market_maker/calibration/model_gating.rs       — Model weight/gating

Key thresholds:

IR < 1.0: model adding noise, not signal
IR < 0.8: model actively harmful, consider disabling
Brier > 0.25: severe miscalibration

Step 4: Conditional Analysis

Check if the issue is regime-specific:

Does the model fail only in high-vol? Check conditional_metrics.rs by volatility bucket
Does it fail near funding settlement? Check by time_to_funding_settlement_s
Does it fail at specific times of day? Check by hour
Does it fail at specific position sizes? Check by inventory level

Regime-specific failures suggest the model needs regime-dependent parameters.

Step 5: Validate Fix

After identifying and fixing the issue:

Check calibration metrics before/after
Run the paper trader to validate in simulation
Monitor live metrics for at least 1 hour after deployment
Watch for regression in other components — fixes often shift PnL between components

Common Patterns

"Making money in quiet, losing in volatile"

Gamma too low for high-vol regime
Cascade detection too slow
Spread floor not wide enough

"Fills are profitable but too few"

Kappa estimate too low (spreads too wide)
Check fill_rate_model.rs for stale estimates
Calibration gamma too conservative

"Fills are plentiful but adverse"

Pre-fill toxicity classifier miscalibrated
Check pre_fill_classifier.rs z-score computation
Lead-lag signal may have decayed

"Position builds up, can't unwind"

Inventory skew not aggressive enough
Position guard soft threshold too high
Check for asymmetric fill rates (buying but not selling)

More from this repository

same repository

checkpoint-management

trudumb/hyper_make

State persistence, prior transfer, and warmup lifecycle. Read when working on checkpoint/, adding new checkpoint fields, debugging cold starts or stale priors, or understanding serde(default) requirements and backward compatibility rules.

2026-02-260

config-derivation

trudumb/hyper_make

Documents auto_derive.rs first-principles parameter derivation from capital and exchange metadata. Use when onboarding new assets, debugging parameter mismatches, understanding why gamma/max_position/target_liquidity have their values, or adding new derived parameters.

2026-02-260

infrastructure-ops

trudumb/hyper_make

WebSocket management, event loop, rate limiting, reconnection, recovery, metrics, and order execution infrastructure. Use when working on orchestrator/, infra/, messages/, core/, fills/, or execution/ modules, debugging connectivity or order placement, adding message handlers, or investigating stale data and latency issues.

2026-02-260

learning-pipeline

trudumb/hyper_make

Documents the 9 learning feedback loops, SpreadBandit Thompson Sampling, adaptive ensemble, confidence tracking, and baseline tracker. Use when debugging learning behavior, tuning reward attribution, investigating model weight decay, or understanding how fills translate into parameter updates.

2026-02-260

risk-management

trudumb/hyper_make

Layered risk system with monitors, circuit breakers, kill switch, and position guards. Use when working on risk/, safety/, or monitoring/ modules, debugging position limits, emergency shutdowns, spread widening, or adding new risk monitors. Covers RiskMonitor trait, severity escalation, and defense-first architecture.

2026-02-260

spread-chain

trudumb/hyper_make

Documents the additive spread composition pipeline from GLFT optimal through to final bid/ask prices. Use when debugging wide spreads, investigating spread component contributions, tuning defensive behavior, or understanding why quotes are wider than expected. Critical for incident triage.

2026-02-260

name	debug-pnl
description	Step-by-step workflow for diagnosing why the market maker is losing money
disable-model-invocation	true
context	fork
agent	general-purpose
argument-hint	[asset] [timerange]

Debug PnL Workflow

Structured diagnostic for market maker PnL issues. Follow these steps in order.

Step 1: Identify the Drag

Check PnL attribution to find which component is causing losses. Read the relevant analytics:

src/market_maker/analytics/attribution.rs   — PnL decomposition
src/market_maker/analytics/edge_metrics.rs  — Edge tracking
src/market_maker/analytics/sharpe.rs        — Sharpe ratio

PnL decomposes into:

Spread capture (positive) — edge from bid-ask spread
Adverse selection (negative) — fills on wrong side of price move
Inventory cost (negative) — holding risk
Fees (negative) — exchange maker fees (1.5 bps on Hyperliquid)

The largest negative component is your priority target.

Step 2: Route to Component

Dominant Drag	Check These	Read Skill
Adverse selection	`adverse_selection/`, `calibration/`	`adverse-selection-classifier`
Spread too tight	`strategy/signal_integration.rs`	`quote-engine`
Inventory cost	`strategy/position_manager.rs`, `control/`	`stochastic-controller`
Fill rate collapsed	`estimator/kappa.rs`, `simulation/fill_sim.rs`	`fill-intensity-hawkes`
Regime misclassification	`estimator/regime_hmm.rs`	`regime-detection-hmm`
Cascade losses	`risk/circuit_breaker.rs`, `risk/monitors/cascade.rs`	`risk-management`

Step 3: Check Calibration

For the identified component, check its calibration metrics:

src/market_maker/calibration/brier_score.rs       — Brier Score
src/market_maker/calibration/information_ratio.rs  — Information Ratio
src/market_maker/calibration/conditional_metrics.rs — Conditional calibration
src/market_maker/calibration/model_gating.rs       — Model weight/gating

Key thresholds:

IR < 1.0: model adding noise, not signal
IR < 0.8: model actively harmful, consider disabling
Brier > 0.25: severe miscalibration

Step 4: Conditional Analysis

Check if the issue is regime-specific:

Does the model fail only in high-vol? Check conditional_metrics.rs by volatility bucket
Does it fail near funding settlement? Check by time_to_funding_settlement_s
Does it fail at specific times of day? Check by hour
Does it fail at specific position sizes? Check by inventory level

Regime-specific failures suggest the model needs regime-dependent parameters.

Step 5: Validate Fix

After identifying and fixing the issue:

Check calibration metrics before/after
Run the paper trader to validate in simulation
Monitor live metrics for at least 1 hour after deployment
Watch for regression in other components — fixes often shift PnL between components

Common Patterns

"Making money in quiet, losing in volatile"

Gamma too low for high-vol regime
Cascade detection too slow
Spread floor not wide enough

"Fills are profitable but too few"

Kappa estimate too low (spreads too wide)
Check fill_rate_model.rs for stale estimates
Calibration gamma too conservative

"Fills are plentiful but adverse"

Pre-fill toxicity classifier miscalibrated
Check pre_fill_classifier.rs z-score computation
Lead-lag signal may have decayed

"Position builds up, can't unwind"

Inventory skew not aggressive enough
Position guard soft threshold too high
Check for asymmetric fill rates (buying but not selling)