Run any Skill in Manus with one click

Get Started

skill-iterative-loop

Run tasks in a loop until goals are met — use for iterative refinement, polling, or convergence

Run Skill in Manus

Overview

Run tasks in a loop until goals are met — use for iterative refinement, polling, or convergence

Install command

npx skills add https://github.com/nyldn/claude-octopus --skill skill-iterative-loop

Copy and paste this command into Claude Code to install the skill

Source

nyldn/claude-octopus

Stars3,463

Forks310

UpdatedMay 19, 2026 at 11:47

SKILL.md

readonly

Iterative Loop Execution

Overview

Systematic iterative execution with clear goals, exit conditions, and progress tracking.

Core principle: Define goal → Set max iterations → Execute → Evaluate → Loop or complete.

When to Use

Use this skill when user wants to:

Execute a task multiple times with refinements
Loop until a condition is met
Iteratively improve something (code, tests, performance)
Retry operations with modifications
Progressive enhancement in rounds

Do NOT use for:

Single execution ("run tests once")
Manual step-by-step work
Infinite loops without bounds
Simple retry logic (use skill-debug)

The Process

Phase 1: Loop Setup

Step 1: Understand the Intent

**Loop Intent:**

Goal: [what should be achieved]
Success criteria: [how do we know we're done]
Max iterations: [safety limit]
Per-iteration tasks: [what to do each loop]

Step 2: Clarify Parameters

Use AskUserQuestion if unclear:

Max iterations: How many times maximum?
Success condition: What indicates we can stop early?
Per-iteration actions: What exactly to do each round?
Failure handling: What if it never succeeds?

Step 3: Safety Checks

**Safety Validation:**

- [ ] Max iterations defined (no infinite loops)
- [ ] Success condition is measurable
- [ ] Each iteration makes progress
- [ ] Failure exit strategy exists
- [ ] User aware of potential duration

Never proceed without max iterations defined.

Phase 2: Loop Execution

Step 1: Initialize Loop

**Starting Iterative Loop**

Goal: [description]
Max iterations: [N]
Success criteria: [condition]

---

### Iteration 1 / [N]

Step 2: Execute Iteration

For each iteration:

**Iteration [current] / [max]**

**Actions:**
1. [Action 1]
   → [result/output]
2. [Action 2]
   → [result/output]
3. [Action 3]
   → [result/output]

**Evaluation:**
- Success criteria met? [Yes/No]
- Progress made? [Yes/No]
- Issues found: [list any issues]

**Status:** [Continue/Success/Need intervention]

---

Step 3: Progress Tracking

Use TodoWrite to track iterations:

Iteration Progress:
✓ Iteration 1 - [what was done]
✓ Iteration 2 - [what was done]
⚙️ Iteration 3 - [in progress]
- Iteration 4 - [pending]
- Iteration 5 - [pending]

Phase 3: Exit Conditions

Exit Condition 1: Success

🎉 **Success! Loop complete.**

**Goal achieved:** [description]
**Iterations used:** [N] / [max]

**Final state:**
[description of what was achieved]

**Summary of iterations:**
1. Iteration 1: [what happened]
2. Iteration 2: [what happened]
...
N. Iteration N: [what happened] ✓ Success

Exit Condition 2: Max Iterations Reached

⚠️ **Max iterations reached without full success**

**Iterations completed:** [max]
**Goal:** [description]
**Current state:** [how close we got]

**Progress made:**
- [Improvement 1]
- [Improvement 2]
- [Improvement 3]

**Remaining issues:**
- [Issue 1]
- [Issue 2]

**Options:**
1. Accept current state (substantial progress made)
2. Continue with [N] more iterations
3. Change approach (current method may not work)

What would you like to do?

Exit Condition 3: No Progress Detected

🛑 **Stopping early: No progress detected**

**Iteration:** [N] / [max]
**Reason:** Last [M] iterations showed no improvement

**Analysis:**
This suggests the current approach may be fundamentally flawed.

**Recommendation:**
Rather than continue looping, let's:
1. Analyze why no progress is being made
2. Consider alternative approaches
3. Re-evaluate the goal or success criteria

Shall we pause and reassess?

Common Patterns

Pattern 1: Loop with Testing

User: "Loop around 5 times auditing, enhancing, testing, until it's done"

Implementation:

**Loop Goal:** Code passes all quality gates
**Max Iterations:** 5
**Per-iteration:**
1. Audit code for issues
2. Enhance/fix identified issues
3. Run tests
4. Check if all pass

**Success:** All tests pass + no issues found

Execute:
Iteration 1:
- Audit → Found 8 issues
- Fix → Fixed 8 issues
- Test → 2 tests still failing
- Continue

Iteration 2:
- Audit → Found 2 new issues from fixes
- Fix → Fixed 2 issues
- Test → All tests pass ✓
- Success! Stopping early (2/5 iterations used)

Pattern 2: Performance Optimization Loop

User: "Keep trying optimizations until we hit < 100ms response time"

Implementation:

**Loop Goal:** Response time < 100ms
**Max Iterations:** 10
**Per-iteration:**
1. Measure current performance
2. Identify bottleneck
3. Apply optimization
4. Re-measure

**Success:** Response time < 100ms

Execute:
Iteration 1: 450ms → Cache database queries → 280ms (Continue)
Iteration 2: 280ms → Add index to frequent query → 150ms (Continue)
Iteration 3: 150ms → Implement response compression → 85ms (Success!)

Pattern 3: Retry with Backoff

User: "Try deploying, retry up to 3 times if it fails"

Implementation:

**Loop Goal:** Successful deployment
**Max Iterations:** 3
**Per-iteration:**
1. Attempt deployment
2. Check status
3. If failed, wait before retry

**Success:** Deployment succeeds

Execute:
Iteration 1: Deploy → Failed (API timeout) → Wait 10s
Iteration 2: Deploy → Failed (API timeout) → Wait 20s
Iteration 3: Deploy → Success ✓

Pattern 4: Incremental Refinement

User: "Iterate 4 times improving the error messages based on user feedback"

Implementation:

**Loop Goal:** Error messages meet clarity standard
**Max Iterations:** 4
**Per-iteration:**
1. Review current error messages
2. Identify confusing ones
3. Rewrite for clarity
4. Evaluate against criteria

**Success:** All messages rated 8+/10 for clarity

Execute each iteration with progressive improvement

Integration with Other Skills

With skill-debug

Loop for debugging:
"Keep debugging until all tests pass, max 5 tries"

Each iteration:
- Use skill-debug to investigate failure
- Apply fix
- Re-run tests
- Evaluate

With skill-audit

Loop for comprehensive checking:
"Loop 3 times auditing different aspects"

Iteration 1: Audit security
Iteration 2: Audit performance
Iteration 3: Audit accessibility

With skill-tdd

Loop for TDD cycles:
"Do 5 red-green-refactor cycles"

Each iteration:
- Write failing test (red)
- Make it pass (green)
- Refactor (refactor)
- Evaluate and continue

Best Practices

1. Always Define Max Iterations

Good:

Loop max 5 times trying to fix the issue

Dangerous:

Keep trying until it works
(What if it never works? Infinite loop!)

2. Measurable Success Criteria

Good:

Success: All 15 tests pass AND code coverage > 80%

Poor:

Success: Code looks better
(Too subjective)

3. Make Progress Visible

**Progress Tracking:**

Iteration 1: 5/15 tests passing
Iteration 2: 10/15 tests passing
Iteration 3: 13/15 tests passing
Iteration 4: 15/15 tests passing ✓

4. Early Exit on Success

Don't continue looping if goal is achieved:

**Iteration 2/5:** All tests pass!

Stopping early - goal achieved.
No need to continue to iteration 3.

5. Detect Stalls

Iteration 4: 10/15 tests passing
Iteration 5: 10/15 tests passing
Iteration 6: 10/15 tests passing

⚠️ No progress in 3 iterations - stopping to reassess approach

Red Flags - Don't Do This

Action	Why It's Dangerous
No max iterations	Could loop forever
Vague success criteria	Don't know when to stop
No progress tracking	Can't tell if making progress
Ignoring stalls	Waste time on ineffective approach
Same action each loop	If not working, need different approach

Strategy Rotation

If the strategy-rotation hook fires, immediately change approach. Do not retry the same approach. Explain what you'll do differently before the next attempt. The hook fires after consecutive failures of the same tool — this is a strong signal that the current approach is fundamentally wrong, not just slightly off.

Self-Regulation (MANDATORY)

Every iterative loop MUST track a Self-Regulation Score that accumulates danger signals. This prevents runaway loops where the agent keeps "fixing" things without real progress.

Sliding-Window Stuck Detection

Maintain a mental window of the last 10 iterations (or fewer if less than 10 have run). After each iteration, check for repeated patterns:

Single-state repetition: Did the same outcome/error occur 3+ times consecutively?

Same test failure, same error message, same files modified → STUCK

Multi-step cycle detection: Is there an A→B→A→B oscillation?

Iteration N touches file X, N+1 touches file Y, N+2 touches file X again, N+3 touches Y again → CYCLE DETECTED
Compare the files modified and error messages across iterations, not just success/failure

On first detection: Announce the pattern to the user. Attempt ONE diagnostic retry with explicit acknowledgment: "This pattern has repeated — here's what I'll do differently: [specific change]."

On second detection: HALT immediately. Display the detected cycle and ask the user whether to continue with a completely different approach or stop.

WTF-Likelihood Score

Track a cumulative score starting at 0%. Each event adds to the score.

Default weights (override via ~/.claude-octopus/loop-config.conf):

Event	Score Impact
Revert (git revert, undo, roll back)	+15%
Touching files unrelated to the stated goal	+20%
A fix that requires changing >3 files	+5%
After the 15th fix attempt	+1% per additional fix
All remaining issues are Low severity	+10%

If WTF score exceeds 20%: STOP immediately. Show:

The current WTF score and what contributed to it
Work completed so far
Ask the user: "Continue with a different approach, or stop here?"

Hard cap: 50 iterations regardless of score or progress. No exceptions.

Configurable Weights

At loop start, check for ~/.claude-octopus/loop-config.conf. If it exists, read the key=value pairs and use them instead of defaults. Format:

# Loop Self-Regulation Configuration
WINDOW_SIZE=10
REVERT_PENALTY=15
UNRELATED_FILES_PENALTY=20
LARGE_FIX_PENALTY=5
AFTER_FIX_15_PENALTY=1
ALL_LOW_SEVERITY_PENALTY=10
WTF_THRESHOLD=20
HARD_CAP=50
STUCK_THRESHOLD=3

If the file does not exist, use the defaults shown above. Users can create this file to tune sensitivity for their workflow.

How to Track

You do NOT need external tools for this. Track mentally during the loop:

After each iteration, mentally note: files touched, outcome, whether a revert happened
Compare against the sliding window of recent iterations
Accumulate the WTF score
Report both the iteration count AND the self-regulation score in each iteration summary:

Iteration 5/20 | Self-regulation: 10% (1 revert, 0 unrelated files)

Interaction with Strategy Rotation

The strategy-rotation hook and self-regulation are complementary:

Strategy rotation fires on consecutive tool failures (same tool, same error)
Self-regulation fires on outcome patterns (cycles, reverts, scope creep)
Both can fire independently. If both fire, HALT — the loop is definitely stuck.

Safety Mechanisms

1. Iteration Limit

MAX_ITERATIONS = user_specified or 10  # Always have a limit
HARD_CAP = 50  # Absolute maximum regardless of user setting

2. Self-Regulation Score

Track WTF score across iterations.
If score > 20%: STOP and ask user.

3. Sliding-Window Detection

Track last 10 iterations.
If repeated pattern detected twice: STOP and ask user.

4. Progress Detection

If last 3 iterations show same result:
  → Stop and ask user

5. Time Limit (for long operations)

If total time > 30 minutes:
  → Checkpoint progress
  → Ask user if should continue

6. User Checkpoints

Every N iterations:
  → Show progress
  → Ask if should continue or adjust approach

Quick Reference

Pattern	Max Iterations	Success Criteria	Early Exit
Test until pass	5-10	All tests pass	Yes
Performance optimization	10-20	Metric < target	Yes
Retry with backoff	3-5	Operation succeeds	Yes
Incremental refinement	3-7	Quality threshold met	Maybe
Comprehensive audit	3-5	All areas covered	No

Metric Verification Mode

When the user specifies a Metric command, switch to mechanical metric verification mode. This replaces subjective evaluation with automated measurement, git-backed experiments, and automatic rollback on regression.

Falls back to standard loop behavior (above) when no metric is specified.

Key Principles

One change per iteration (atomic) — never combine multiple unrelated changes
Mechanical verification only — no subjective "looks good"; the metric command decides
Automatic rollback on regression — git revert HEAD --no-edit if metric worsens
Simplicity wins — equal metric results + less code = KEEP the simpler version
Git is memory — every experiment is committed with experiment: prefix before verification
Guard commands must also pass — even if metric improves, a failing guard reverts the change

Parameters

Parameter	Format	Required	Description
Metric	`Metric: <shell command>`	Yes (for this mode)	Command whose stdout is a number (the metric value)
Direction	`Direction: higher\|lower`	Yes	Whether higher or lower metric values are better
Guard	`Guard: <shell command>`	No	Must exit 0 for a change to be kept; run after metric
Iterations	`Iterations: N`	No	Max iterations (default: unbounded, runs until interrupted)

Experiment Log

All results are logged as JSONL to .claude-octopus/experiments/<YYYY-MM-DD>.jsonl.

Each line is a JSON object:

{"iteration": 1, "timestamp": "2026-03-21T14:30:00Z", "metric": 72.5, "best": 72.5, "status": "kept", "description": "Add index to users table", "commit": "abc1234"}

Fields:

iteration — iteration number (starting from 1; iteration 0 is baseline)
timestamp — ISO 8601 timestamp
metric — measured value from the metric command
best — best metric value seen so far
status — "kept" (improvement), "reverted" (regression), or "error" (metric/guard crashed)
description — one-line summary of what was changed
commit — short git SHA of the experiment commit (before potential revert)

Execution Contract

You MUST follow this exact sequence for each iteration. No steps may be skipped or reordered.

Iteration 0: Establish Baseline

Create the experiment log directory if it does not exist:
```
mkdir -p .claude-octopus/experiments
```
Check for existing experiment log — if .claude-octopus/experiments/<today>.jsonl exists, read it to determine the current best metric value and iteration count. Resume from the next iteration number.
Run the metric command and capture the output number. This is the baseline.

Log the baseline:

{"iteration": 0, "timestamp": "...", "metric": <baseline>, "best": <baseline>, "status": "baseline", "description": "Baseline measurement", "commit": "<current HEAD short SHA>"}

Report the baseline value to the user.

Each Subsequent Iteration (1..N)

Step 1: Review state. Read the experiment log (.claude-octopus/experiments/<today>.jsonl), review git history (git log --oneline -10), and identify what has been tried, what worked, and what failed.

Step 2: Pick the next change. Based on what worked/failed/is untried, decide on ONE focused change. Do NOT combine multiple unrelated changes.

Step 3: Make the change. Implement exactly one atomic change.

Step 4: Git commit BEFORE verification. Commit with the experiment: prefix:

git add -A && git commit -m "experiment: <one-line description of the change>"

This ensures every experiment is recorded in git history regardless of outcome.

Step 5: Run mechanical verification. Execute the metric command and capture the numeric result.

Step 6: Evaluate and act.

If metric improved (higher when Direction=higher, lower when Direction=lower):
- If a Guard command is specified, run it now.
  - If guard passes (exit 0) → KEEP the commit. Update best metric.
  - If guard fails (exit non-zero) → REVERT: git revert HEAD --no-edit. Log status as "reverted".
- If no guard → KEEP the commit. Update best metric.
If metric stayed the same:
- Check if the change reduces code complexity or size. If simpler → KEEP (simplicity wins).
- Otherwise → REVERT: git revert HEAD --no-edit. Log status as "reverted".
If metric worsened:
- REVERT: git revert HEAD --no-edit. Log status as "reverted".
If metric command crashed (non-zero exit, no numeric output):
- Attempt a quick fix (one try only). If fix works, re-measure.
- If still broken → REVERT: git revert HEAD --no-edit. Log status as "error".

Step 7: Log the result. Append a JSONL entry to .claude-octopus/experiments/<today>.jsonl.

Step 8: Report iteration summary. Display:

Iteration N: <description>
  Metric: <value> (best: <best>) — <kept|reverted|error>

Step 9: Repeat — go to Step 1 of the next iteration, unless:

Iterations limit reached → stop and report final summary
User interrupts → stop and report final summary

Resume Behavior

If an experiment log already exists for today:

Read the log to determine the last iteration number and current best metric value
Resume from the next iteration number
Use the recorded best value as the comparison baseline
This allows stopping and resuming experiments across sessions

Final Summary

When the loop completes (iterations exhausted or user stops), report:

Experiment Complete
  Iterations: N
  Baseline: <initial metric>
  Final best: <best metric>
  Improvement: <delta> (<percentage>%)
  Kept: K changes, Reverted: R changes, Errors: E

Example

/octo:loop Metric: npm test -- --coverage | grep 'All files' | awk '{print $10}' Direction: higher Guard: npm test Iterations: 20

This will:

Measure baseline code coverage percentage
Each iteration: make one change, commit as experiment: ..., measure coverage
If coverage improves AND npm test passes → keep
If coverage drops OR tests fail → git revert HEAD --no-edit
After 20 iterations, report total improvement

The Bottom Line

Iterative loop → Clear goal + Max iterations + Progress tracking + Exit strategy
Otherwise → Infinite loops + Wasted effort + Unclear when done

Define the goal. Set the limit. Track progress. Know when to stop.

name	skill-iterative-loop
aliases	["iterative-loop","loop-execution","repeat-until"]
description	Run tasks in a loop until goals are met — use for iterative refinement, polling, or convergence
trigger	AUTOMATICALLY ACTIVATE when user requests iterative execution: - "loop X times" or "loop around N times" - "loop around 5 times auditing, enhancing, testing" - "keep trying until" or "iterate until" - "run until X passes" or "loop until Y works" - "repeat N times" or "try N times" DO NOT activate for: - Single execution requests - Manual retry requests - Infinite loops (require max iterations)

Iterative Loop Execution

Overview

Systematic iterative execution with clear goals, exit conditions, and progress tracking.

Core principle: Define goal → Set max iterations → Execute → Evaluate → Loop or complete.

When to Use

Use this skill when user wants to:

Execute a task multiple times with refinements
Loop until a condition is met
Iteratively improve something (code, tests, performance)
Retry operations with modifications
Progressive enhancement in rounds

Do NOT use for:

Single execution ("run tests once")
Manual step-by-step work
Infinite loops without bounds
Simple retry logic (use skill-debug)

The Process

Phase 1: Loop Setup

Step 1: Understand the Intent

**Loop Intent:**

Goal: [what should be achieved]
Success criteria: [how do we know we're done]
Max iterations: [safety limit]
Per-iteration tasks: [what to do each loop]

Step 2: Clarify Parameters

Use AskUserQuestion if unclear:

Max iterations: How many times maximum?
Success condition: What indicates we can stop early?
Per-iteration actions: What exactly to do each round?
Failure handling: What if it never succeeds?

Step 3: Safety Checks

**Safety Validation:**

- [ ] Max iterations defined (no infinite loops)
- [ ] Success condition is measurable
- [ ] Each iteration makes progress
- [ ] Failure exit strategy exists
- [ ] User aware of potential duration

Never proceed without max iterations defined.

Phase 2: Loop Execution

Step 1: Initialize Loop

**Starting Iterative Loop**

Goal: [description]
Max iterations: [N]
Success criteria: [condition]

---

### Iteration 1 / [N]

Step 2: Execute Iteration

For each iteration:

**Iteration [current] / [max]**

**Actions:**
1. [Action 1]
   → [result/output]
2. [Action 2]
   → [result/output]
3. [Action 3]
   → [result/output]

**Evaluation:**
- Success criteria met? [Yes/No]
- Progress made? [Yes/No]
- Issues found: [list any issues]

**Status:** [Continue/Success/Need intervention]

---

Step 3: Progress Tracking

Use TodoWrite to track iterations:

Iteration Progress:
✓ Iteration 1 - [what was done]
✓ Iteration 2 - [what was done]
⚙️ Iteration 3 - [in progress]
- Iteration 4 - [pending]
- Iteration 5 - [pending]

Phase 3: Exit Conditions

Exit Condition 1: Success

🎉 **Success! Loop complete.**

**Goal achieved:** [description]
**Iterations used:** [N] / [max]

**Final state:**
[description of what was achieved]

**Summary of iterations:**
1. Iteration 1: [what happened]
2. Iteration 2: [what happened]
...
N. Iteration N: [what happened] ✓ Success

Exit Condition 2: Max Iterations Reached

⚠️ **Max iterations reached without full success**

**Iterations completed:** [max]
**Goal:** [description]
**Current state:** [how close we got]

**Progress made:**
- [Improvement 1]
- [Improvement 2]
- [Improvement 3]

**Remaining issues:**
- [Issue 1]
- [Issue 2]

**Options:**
1. Accept current state (substantial progress made)
2. Continue with [N] more iterations
3. Change approach (current method may not work)

What would you like to do?

Exit Condition 3: No Progress Detected

🛑 **Stopping early: No progress detected**

**Iteration:** [N] / [max]
**Reason:** Last [M] iterations showed no improvement

**Analysis:**
This suggests the current approach may be fundamentally flawed.

**Recommendation:**
Rather than continue looping, let's:
1. Analyze why no progress is being made
2. Consider alternative approaches
3. Re-evaluate the goal or success criteria

Shall we pause and reassess?

Common Patterns

Pattern 1: Loop with Testing

User: "Loop around 5 times auditing, enhancing, testing, until it's done"

Implementation:

**Loop Goal:** Code passes all quality gates
**Max Iterations:** 5
**Per-iteration:**
1. Audit code for issues
2. Enhance/fix identified issues
3. Run tests
4. Check if all pass

**Success:** All tests pass + no issues found

Execute:
Iteration 1:
- Audit → Found 8 issues
- Fix → Fixed 8 issues
- Test → 2 tests still failing
- Continue

Iteration 2:
- Audit → Found 2 new issues from fixes
- Fix → Fixed 2 issues
- Test → All tests pass ✓
- Success! Stopping early (2/5 iterations used)

Pattern 2: Performance Optimization Loop

User: "Keep trying optimizations until we hit < 100ms response time"

Implementation:

**Loop Goal:** Response time < 100ms
**Max Iterations:** 10
**Per-iteration:**
1. Measure current performance
2. Identify bottleneck
3. Apply optimization
4. Re-measure

**Success:** Response time < 100ms

Execute:
Iteration 1: 450ms → Cache database queries → 280ms (Continue)
Iteration 2: 280ms → Add index to frequent query → 150ms (Continue)
Iteration 3: 150ms → Implement response compression → 85ms (Success!)

Pattern 3: Retry with Backoff

User: "Try deploying, retry up to 3 times if it fails"

Implementation:

**Loop Goal:** Successful deployment
**Max Iterations:** 3
**Per-iteration:**
1. Attempt deployment
2. Check status
3. If failed, wait before retry

**Success:** Deployment succeeds

Execute:
Iteration 1: Deploy → Failed (API timeout) → Wait 10s
Iteration 2: Deploy → Failed (API timeout) → Wait 20s
Iteration 3: Deploy → Success ✓

Pattern 4: Incremental Refinement

User: "Iterate 4 times improving the error messages based on user feedback"

Implementation:

**Loop Goal:** Error messages meet clarity standard
**Max Iterations:** 4
**Per-iteration:**
1. Review current error messages
2. Identify confusing ones
3. Rewrite for clarity
4. Evaluate against criteria

**Success:** All messages rated 8+/10 for clarity

Execute each iteration with progressive improvement

Integration with Other Skills

With skill-debug

Loop for debugging:
"Keep debugging until all tests pass, max 5 tries"

Each iteration:
- Use skill-debug to investigate failure
- Apply fix
- Re-run tests
- Evaluate

With skill-audit

Loop for comprehensive checking:
"Loop 3 times auditing different aspects"

Iteration 1: Audit security
Iteration 2: Audit performance
Iteration 3: Audit accessibility

With skill-tdd

Loop for TDD cycles:
"Do 5 red-green-refactor cycles"

Each iteration:
- Write failing test (red)
- Make it pass (green)
- Refactor (refactor)
- Evaluate and continue

Best Practices

1. Always Define Max Iterations

Good:

Loop max 5 times trying to fix the issue

Dangerous:

Keep trying until it works
(What if it never works? Infinite loop!)

2. Measurable Success Criteria

Good:

Success: All 15 tests pass AND code coverage > 80%

Poor:

Success: Code looks better
(Too subjective)

3. Make Progress Visible

**Progress Tracking:**

Iteration 1: 5/15 tests passing
Iteration 2: 10/15 tests passing
Iteration 3: 13/15 tests passing
Iteration 4: 15/15 tests passing ✓

4. Early Exit on Success

Don't continue looping if goal is achieved:

**Iteration 2/5:** All tests pass!

Stopping early - goal achieved.
No need to continue to iteration 3.

5. Detect Stalls

Iteration 4: 10/15 tests passing
Iteration 5: 10/15 tests passing
Iteration 6: 10/15 tests passing

⚠️ No progress in 3 iterations - stopping to reassess approach

Red Flags - Don't Do This

Action	Why It's Dangerous
No max iterations	Could loop forever
Vague success criteria	Don't know when to stop
No progress tracking	Can't tell if making progress
Ignoring stalls	Waste time on ineffective approach
Same action each loop	If not working, need different approach

Strategy Rotation

Self-Regulation (MANDATORY)

Every iterative loop MUST track a Self-Regulation Score that accumulates danger signals. This prevents runaway loops where the agent keeps "fixing" things without real progress.

Sliding-Window Stuck Detection

Maintain a mental window of the last 10 iterations (or fewer if less than 10 have run). After each iteration, check for repeated patterns:

Single-state repetition: Did the same outcome/error occur 3+ times consecutively?

Same test failure, same error message, same files modified → STUCK

Multi-step cycle detection: Is there an A→B→A→B oscillation?

Iteration N touches file X, N+1 touches file Y, N+2 touches file X again, N+3 touches Y again → CYCLE DETECTED
Compare the files modified and error messages across iterations, not just success/failure

On second detection: HALT immediately. Display the detected cycle and ask the user whether to continue with a completely different approach or stop.

WTF-Likelihood Score

Track a cumulative score starting at 0%. Each event adds to the score.

Default weights (override via ~/.claude-octopus/loop-config.conf):

Event	Score Impact
Revert (git revert, undo, roll back)	+15%
Touching files unrelated to the stated goal	+20%
A fix that requires changing >3 files	+5%
After the 15th fix attempt	+1% per additional fix
All remaining issues are Low severity	+10%

If WTF score exceeds 20%: STOP immediately. Show:

The current WTF score and what contributed to it
Work completed so far
Ask the user: "Continue with a different approach, or stop here?"

Hard cap: 50 iterations regardless of score or progress. No exceptions.

Configurable Weights

At loop start, check for ~/.claude-octopus/loop-config.conf. If it exists, read the key=value pairs and use them instead of defaults. Format:

# Loop Self-Regulation Configuration
WINDOW_SIZE=10
REVERT_PENALTY=15
UNRELATED_FILES_PENALTY=20
LARGE_FIX_PENALTY=5
AFTER_FIX_15_PENALTY=1
ALL_LOW_SEVERITY_PENALTY=10
WTF_THRESHOLD=20
HARD_CAP=50
STUCK_THRESHOLD=3

If the file does not exist, use the defaults shown above. Users can create this file to tune sensitivity for their workflow.

How to Track

You do NOT need external tools for this. Track mentally during the loop:

After each iteration, mentally note: files touched, outcome, whether a revert happened
Compare against the sliding window of recent iterations
Accumulate the WTF score
Report both the iteration count AND the self-regulation score in each iteration summary:

Iteration 5/20 | Self-regulation: 10% (1 revert, 0 unrelated files)

Interaction with Strategy Rotation

The strategy-rotation hook and self-regulation are complementary:

Strategy rotation fires on consecutive tool failures (same tool, same error)
Self-regulation fires on outcome patterns (cycles, reverts, scope creep)
Both can fire independently. If both fire, HALT — the loop is definitely stuck.

Safety Mechanisms

1. Iteration Limit

MAX_ITERATIONS = user_specified or 10  # Always have a limit
HARD_CAP = 50  # Absolute maximum regardless of user setting

2. Self-Regulation Score

Track WTF score across iterations.
If score > 20%: STOP and ask user.

3. Sliding-Window Detection

Track last 10 iterations.
If repeated pattern detected twice: STOP and ask user.

4. Progress Detection

If last 3 iterations show same result:
  → Stop and ask user

5. Time Limit (for long operations)

If total time > 30 minutes:
  → Checkpoint progress
  → Ask user if should continue

6. User Checkpoints

Every N iterations:
  → Show progress
  → Ask if should continue or adjust approach

Quick Reference

Pattern	Max Iterations	Success Criteria	Early Exit
Test until pass	5-10	All tests pass	Yes
Performance optimization	10-20	Metric < target	Yes
Retry with backoff	3-5	Operation succeeds	Yes
Incremental refinement	3-7	Quality threshold met	Maybe
Comprehensive audit	3-5	All areas covered	No

Metric Verification Mode

Falls back to standard loop behavior (above) when no metric is specified.

Key Principles

One change per iteration (atomic) — never combine multiple unrelated changes
Mechanical verification only — no subjective "looks good"; the metric command decides
Automatic rollback on regression — git revert HEAD --no-edit if metric worsens
Simplicity wins — equal metric results + less code = KEEP the simpler version
Git is memory — every experiment is committed with experiment: prefix before verification
Guard commands must also pass — even if metric improves, a failing guard reverts the change

Parameters

Parameter	Format	Required	Description
Metric	`Metric: <shell command>`	Yes (for this mode)	Command whose stdout is a number (the metric value)
Direction	`Direction: higher\|lower`	Yes	Whether higher or lower metric values are better
Guard	`Guard: <shell command>`	No	Must exit 0 for a change to be kept; run after metric
Iterations	`Iterations: N`	No	Max iterations (default: unbounded, runs until interrupted)

Experiment Log

All results are logged as JSONL to .claude-octopus/experiments/<YYYY-MM-DD>.jsonl.

Each line is a JSON object:

{"iteration": 1, "timestamp": "2026-03-21T14:30:00Z", "metric": 72.5, "best": 72.5, "status": "kept", "description": "Add index to users table", "commit": "abc1234"}

Fields:

iteration — iteration number (starting from 1; iteration 0 is baseline)
timestamp — ISO 8601 timestamp
metric — measured value from the metric command
best — best metric value seen so far
status — "kept" (improvement), "reverted" (regression), or "error" (metric/guard crashed)
description — one-line summary of what was changed
commit — short git SHA of the experiment commit (before potential revert)

Execution Contract

You MUST follow this exact sequence for each iteration. No steps may be skipped or reordered.

Iteration 0: Establish Baseline

Create the experiment log directory if it does not exist:
```
mkdir -p .claude-octopus/experiments
```
Check for existing experiment log — if .claude-octopus/experiments/<today>.jsonl exists, read it to determine the current best metric value and iteration count. Resume from the next iteration number.
Run the metric command and capture the output number. This is the baseline.

Log the baseline:

{"iteration": 0, "timestamp": "...", "metric": <baseline>, "best": <baseline>, "status": "baseline", "description": "Baseline measurement", "commit": "<current HEAD short SHA>"}

Report the baseline value to the user.

Each Subsequent Iteration (1..N)

Step 2: Pick the next change. Based on what worked/failed/is untried, decide on ONE focused change. Do NOT combine multiple unrelated changes.

Step 3: Make the change. Implement exactly one atomic change.

Step 4: Git commit BEFORE verification. Commit with the experiment: prefix:

git add -A && git commit -m "experiment: <one-line description of the change>"

This ensures every experiment is recorded in git history regardless of outcome.

Step 5: Run mechanical verification. Execute the metric command and capture the numeric result.

Step 6: Evaluate and act.

If metric improved (higher when Direction=higher, lower when Direction=lower):
- If a Guard command is specified, run it now.
  - If guard passes (exit 0) → KEEP the commit. Update best metric.
  - If guard fails (exit non-zero) → REVERT: git revert HEAD --no-edit. Log status as "reverted".
- If no guard → KEEP the commit. Update best metric.
If metric stayed the same:
- Check if the change reduces code complexity or size. If simpler → KEEP (simplicity wins).
- Otherwise → REVERT: git revert HEAD --no-edit. Log status as "reverted".
If metric worsened:
- REVERT: git revert HEAD --no-edit. Log status as "reverted".
If metric command crashed (non-zero exit, no numeric output):
- Attempt a quick fix (one try only). If fix works, re-measure.
- If still broken → REVERT: git revert HEAD --no-edit. Log status as "error".

Step 7: Log the result. Append a JSONL entry to .claude-octopus/experiments/<today>.jsonl.

Step 8: Report iteration summary. Display:

Iteration N: <description>
  Metric: <value> (best: <best>) — <kept|reverted|error>

Step 9: Repeat — go to Step 1 of the next iteration, unless:

Iterations limit reached → stop and report final summary
User interrupts → stop and report final summary

Resume Behavior

If an experiment log already exists for today:

Read the log to determine the last iteration number and current best metric value
Resume from the next iteration number
Use the recorded best value as the comparison baseline
This allows stopping and resuming experiments across sessions

Final Summary

When the loop completes (iterations exhausted or user stops), report:

Experiment Complete
  Iterations: N
  Baseline: <initial metric>
  Final best: <best metric>
  Improvement: <delta> (<percentage>%)
  Kept: K changes, Reverted: R changes, Errors: E

Example

/octo:loop Metric: npm test -- --coverage | grep 'All files' | awk '{print $10}' Direction: higher Guard: npm test Iterations: 20

This will:

Measure baseline code coverage percentage
Each iteration: make one change, commit as experiment: ..., measure coverage
If coverage improves AND npm test passes → keep
If coverage drops OR tests fail → git revert HEAD --no-edit
After 20 iterations, report total improvement

The Bottom Line

Iterative loop → Clear goal + Max iterations + Progress tracking + Exit strategy
Otherwise → Infinite loops + Wasted effort + Unclear when done

Define the goal. Set the limit. Track progress. Know when to stop.

skill-iterative-loop

More from this repository

More from this repository

Iterative Loop Execution

Overview

When to Use

The Process

Phase 1: Loop Setup

Step 1: Understand the Intent

Step 2: Clarify Parameters

Step 3: Safety Checks

Phase 2: Loop Execution

Step 1: Initialize Loop

Step 2: Execute Iteration

Step 3: Progress Tracking

Phase 3: Exit Conditions

Exit Condition 1: Success

Exit Condition 2: Max Iterations Reached

Exit Condition 3: No Progress Detected

Common Patterns

Pattern 1: Loop with Testing

Pattern 2: Performance Optimization Loop

Pattern 3: Retry with Backoff

Pattern 4: Incremental Refinement

Integration with Other Skills

With skill-debug

With skill-audit

With skill-tdd

Best Practices

1. Always Define Max Iterations

2. Measurable Success Criteria

3. Make Progress Visible

4. Early Exit on Success

5. Detect Stalls

Red Flags - Don't Do This

Strategy Rotation

Self-Regulation (MANDATORY)

Sliding-Window Stuck Detection

WTF-Likelihood Score

Configurable Weights

How to Track

Interaction with Strategy Rotation

Safety Mechanisms

1. Iteration Limit

2. Self-Regulation Score

3. Sliding-Window Detection

4. Progress Detection

5. Time Limit (for long operations)

6. User Checkpoints

Quick Reference

Metric Verification Mode

Key Principles

Parameters

Experiment Log

Execution Contract

Iteration 0: Establish Baseline

Each Subsequent Iteration (1..N)

Resume Behavior

Final Summary

Example

The Bottom Line

Iterative Loop Execution

Overview

When to Use

The Process

Phase 1: Loop Setup

Step 1: Understand the Intent

Step 2: Clarify Parameters

Step 3: Safety Checks

Phase 2: Loop Execution

Step 1: Initialize Loop

Step 2: Execute Iteration

Step 3: Progress Tracking

Phase 3: Exit Conditions

Exit Condition 1: Success

Exit Condition 2: Max Iterations Reached

Exit Condition 3: No Progress Detected

Common Patterns

Pattern 1: Loop with Testing

Pattern 2: Performance Optimization Loop