Run any Skill in Manus with one click

triple-loop-learning

(Industry standard: Meta-Learning System / Automated Autoresearch) Primary Use Case: Continuous, self-improving orchestration of an agentic system over multiple sessions. Use when: building a continuous improvement layer that autonomously identifies workflow friction, postulates hypotheses, and tests improved instructions/coding skills against an objective headless benchmark before merging and persisting.

Run Skill in Manus

Stars3

Forks2

UpdatedJune 8, 2026 at 06:06

Source

richfrem

richfrem/agent-plugins-skills

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

File Explorer

2 files

SKILL.md

readonly

name	triple-loop-learning
plugin	agent-loops
description	(Industry standard: Meta-Learning System / Automated Autoresearch) Primary Use Case: Continuous, self-improving orchestration of an agentic system over multiple sessions. Use when: building a continuous improvement layer that autonomously identifies workflow friction, postulates hypotheses, and tests improved instructions/coding skills against an objective headless benchmark before merging and persisting.
allowed-tools	Bash, Read, Write

Dependencies

This skill requires Python 3.8+ and standard library only.

Evaluation gate: NOT included in this primitive. The calling system (e.g., agent-agentic-os os-improvement-loop) is responsible for wrapping this skill with an eval gate and experiment log.

Triple-Loop Learning (Meta-Learning System)

This skill defines the orchestration pattern for the Triple-Loop Architecture. Pattern 5 is a robust, autonomous feedback loop where an independent Meta-Learning Orchestrator governs a long-horizon pipeline of execution, planning, and tactical problem-solving.

This architecture is entirely framework-agnostic. While originally developed for agent-agentic-os, it models the core loop defined by Meta-Harness research where autonomous systems evolve their own operating instructions based strictly on headless evaluators.

Architecture Overview

flowchart TD
    subgraph Outer["Outer Loop (Meta-Learning & Orchestration)"]
        Hypothesize[Hypothesis Generation] --> StrategyBridge[Strategy Packet]
        Report --> EvalBridge[Score Analysis]
        EvalBridge --> Conclude[Accept / Reject Hypothesis]
    end

    subgraph Mid["Strategic Planner (Dual-Loop Integration)"]
        Plan[Define Sub-tasks] --> TacticalBridge[Handoff Packet]
        Result[Aggregate Results] --> Report[Generate Report]
    end

    subgraph Inner["Tactical Executor (Single-Loop Integration)"]
        Execute[Code Mutation] --> Test[Headless Evaluation]
        Test --> ResultBridge[Pass/Fail Signal]
    end

    StrategyBridge --> Plan
    TacticalBridge --> Execute
    ResultBridge --> Result

The Workflow Protocol

Step 1: Friction Aggregation (Outer Loop)

The Orchestrator constantly ingests execution logs from existing operations. Look for repeated uncertainties, API errors, test failures, or syntax flaws.
Group the friction into clustered tasks.

Step 2: Hypothesis Generation (Outer Loop)

Define a singular thesis: "If we change instruction X, the accuracy score on benchmark Y will improve by N."
Write a rigid Strategy Packet for the Mid-level Planner.

Step 3: Distribution (Strategic Planner)

Interactively Determine CLI and Model (ask once during bootstrap): Interactively prompt the user to select the CLI backend (agy, claude, copilot, etc.) and the specific model to run mutations and evaluation.
The Planner assigns disjoint code fixes to one or multiple Tactical Executors using the selected CLI and model.
Ensure test boundaries and standard input redirection (appending < /dev/null to commands) are defined to prevent SIGTTIN process freezes.

Step 4: Mutation & Headless Scoring (Tactical Executor)

Constraint: Subjective LLM analysis is expressly prohibited.

Apply the instruction set or code adjustment.
Run pure, headless deterministic tests. Return an objective integer/float score, not opinions.

Step 5: Verification & Promotion (Outer Loop - Trust But Verify)

Read the objective score differentials. No blind trust is allowed.
TDD / Test Check: The promotion logic MUST be backed by headless evaluation. Run the full regression test suite on mutated code.
Delta Inspection: Check the source diffs for any stub placeholders ("TODO", "TBD", "[NEEDS INPUT]") and verify syntax cleanliness.
KEEP only if Accuracy AND F1 score pass the current baseline. Reject otherwise.
Postulate a retrospective mapping for continuous system-wide instructions improvement.

More from this repository

same repository

agent-swarm

richfrem/agent-plugins-skills

(Industry standard: Parallel Agent) Primary Use Case: Work that can be partitioned into independent sub-tasks running concurrently across multiple agents. Parallel multi-agent execution pattern. Use when: work can be partitioned into independent tasks that N agents can execute simultaneously across worktrees. Includes routing (sequential vs parallel), merge verification, and correction loops.

2026-06-083

dual-loop

richfrem/agent-plugins-skills

(Industry standard: Sequential Agent / Agent as a Tool) Primary Use Case: Delegating a well-defined task to a worker agent, verifying its execution, and repeating if necessary. Inner/outer agent delegation pattern. Use when: work needs to be delegated from a strategic controller (Outer Loop) to a tactical executor (Inner Loop) via strategy packets, with verification and correction loops.

2026-06-083

learning-loop

richfrem/agent-plugins-skills

(Industry standard: Loop Agent / Single Agent) Primary Use Case: Self-contained research, content generation, and exploration where no inner delegation is required. Self-directed research and knowledge capture loop. Use when: starting a session (Orientation), performing research (Synthesis), or closing a session (Seal, Persist, Retrospective). Ensures knowledge survives across isolated agent sessions.

2026-06-083

orchestrator

richfrem/agent-plugins-skills

(Industry standard: Routing Agent / Orchestrator Pattern) Primary Use Case: Analyzing an ambiguous trigger and routing it to one of the specific specialized implementations. Routes triggers to the appropriate agent-loop pattern. Use when: assessing a task, research need, or work assignment and deciding whether to run a simple learning loop, red team review, dual-loop delegation, or parallel swarm. Manages shared closure (seal, persist, retrospective, self-improvement).

2026-06-083

red-team-review

richfrem/agent-plugins-skills

(Industry standard: Review and Critique Pattern) Primary Use Case: Iterative generation paired with adversarial review, continuing until an 'Approved' verdict is reached. Orchestrated adversarial review loop. Use when: research, designs, architectures, or decisions need to be reviewed by red team agents (human, browser, or CLI). Iterates in rounds of research → bundle → review → feedback until approved.

2026-06-083

agy-cli-agent

richfrem/agent-plugins-skills

Antigravity (`agy`) CLI sub-agent system for frontier Google Gemini models. Use when dispatching tasks to Gemini 3.5 Flash and above via the `agy` binary. For cheaper/older Gemini models (gemini-3-flash-preview, gemini-3.1-pro-preview), use gemini-cli-agent instead. Trigger with "use agy", "dispatch to antigravity", "run with agy", "use frontier gemini model", or "agy sub-agent".

2026-06-083

name	triple-loop-learning
plugin	agent-loops
description	(Industry standard: Meta-Learning System / Automated Autoresearch) Primary Use Case: Continuous, self-improving orchestration of an agentic system over multiple sessions. Use when: building a continuous improvement layer that autonomously identifies workflow friction, postulates hypotheses, and tests improved instructions/coding skills against an objective headless benchmark before merging and persisting.
allowed-tools	Bash, Read, Write

Dependencies

This skill requires Python 3.8+ and standard library only.

Evaluation gate: NOT included in this primitive. The calling system (e.g., agent-agentic-os os-improvement-loop) is responsible for wrapping this skill with an eval gate and experiment log.

Triple-Loop Learning (Meta-Learning System)

Architecture Overview

flowchart TD
    subgraph Outer["Outer Loop (Meta-Learning & Orchestration)"]
        Hypothesize[Hypothesis Generation] --> StrategyBridge[Strategy Packet]
        Report --> EvalBridge[Score Analysis]
        EvalBridge --> Conclude[Accept / Reject Hypothesis]
    end

    subgraph Mid["Strategic Planner (Dual-Loop Integration)"]
        Plan[Define Sub-tasks] --> TacticalBridge[Handoff Packet]
        Result[Aggregate Results] --> Report[Generate Report]
    end

    subgraph Inner["Tactical Executor (Single-Loop Integration)"]
        Execute[Code Mutation] --> Test[Headless Evaluation]
        Test --> ResultBridge[Pass/Fail Signal]
    end

    StrategyBridge --> Plan
    TacticalBridge --> Execute
    ResultBridge --> Result

The Workflow Protocol

Step 1: Friction Aggregation (Outer Loop)

The Orchestrator constantly ingests execution logs from existing operations. Look for repeated uncertainties, API errors, test failures, or syntax flaws.
Group the friction into clustered tasks.

Step 2: Hypothesis Generation (Outer Loop)

Define a singular thesis: "If we change instruction X, the accuracy score on benchmark Y will improve by N."
Write a rigid Strategy Packet for the Mid-level Planner.

Step 3: Distribution (Strategic Planner)

Interactively Determine CLI and Model (ask once during bootstrap): Interactively prompt the user to select the CLI backend (agy, claude, copilot, etc.) and the specific model to run mutations and evaluation.
The Planner assigns disjoint code fixes to one or multiple Tactical Executors using the selected CLI and model.
Ensure test boundaries and standard input redirection (appending < /dev/null to commands) are defined to prevent SIGTTIN process freezes.

Step 4: Mutation & Headless Scoring (Tactical Executor)

Constraint: Subjective LLM analysis is expressly prohibited.

Apply the instruction set or code adjustment.
Run pure, headless deterministic tests. Return an objective integer/float score, not opinions.

Step 5: Verification & Promotion (Outer Loop - Trust But Verify)

Read the objective score differentials. No blind trust is allowed.
TDD / Test Check: The promotion logic MUST be backed by headless evaluation. Run the full regression test suite on mutated code.
Delta Inspection: Check the source diffs for any stub placeholders ("TODO", "TBD", "[NEEDS INPUT]") and verify syntax cleanliness.
KEEP only if Accuracy AND F1 score pass the current baseline. Reject otherwise.
Postulate a retrospective mapping for continuous system-wide instructions improvement.