Run any Skill in Manus with one click

$pwd:

kv-tool-loop-stability

Name: Kv Tool Loop Stability
Author: Mesh-LLM

// Use this skill when certifying mesh-llm KV/cache stability under repeated OpenAI tool-call loops, same-prefix cache reuse, suffix-prefill limits, or native Skippy slot/decode/eviction failures.

Run Skill in Manus

$ git log --oneline --stat

stars:1,083

forks:134

updated:May 26, 2026 at 23:38

SKILL.md

readonly

name	kv-tool-loop-stability
description	Use this skill when certifying mesh-llm KV/cache stability under repeated OpenAI tool-call loops, same-prefix cache reuse, suffix-prefill limits, or native Skippy slot/decode/eviction failures.
metadata	{"short-description":"Certify KV/tool-loop stability"}

KV Tool-Loop Stability

Use this skill when changing Skippy KV slot cleanup, prefix-cache lookup, OpenAI tool-loop behavior, agent harnesses, or any runtime path related to llama_decode failed, failed to find a memory slot, low same-prefix cache reuse, or proactive eviction failures.

Workflow

Attach to an existing OpenAI-compatible /v1 endpoint. This harness does not start nodes, load models, join meshes, or change routing policy.
Prefer a direct model when reproducing Skippy KV/cache issues. Use auto only when intentionally validating routed behavior.
Run --print-plan first and confirm the models, attempts, pressure_turns, timeout, cache thresholds, output directory, and native logs.
Pass the active Skippy native log when available. The harness checkpoints native logs at run start and scans only appended bytes.
Preserve the evidence directory: manifest.json, results.jsonl, summary.json, summary.md, and transcripts/*.jsonl.

Commands

Preview the run without touching the endpoint:

scripts/qa-kv-tool-loop-stability.py \
  --base-url http://127.0.0.1:9337/v1 \
  --models Qwen/Qwen2.5-3B-Instruct-GGUF:q4_k_m \
  --attempts 5 \
  --pressure-turns 8 \
  --timeout 180 \
  --min-cached-tokens 2048 \
  --suffix-prefill-limit 256 \
  --native-log ~/.mesh-llm/runtime/<pid>/logs/skippy-native.log \
  --output-dir target/kv-tool-loop-stability/local \
  --print-plan

Run the certification:

scripts/qa-kv-tool-loop-stability.py \
  --base-url http://127.0.0.1:9337/v1 \
  --models Qwen/Qwen2.5-3B-Instruct-GGUF:q4_k_m \
  --attempts 5 \
  --pressure-turns 8 \
  --timeout 180 \
  --min-cached-tokens 2048 \
  --suffix-prefill-limit 256 \
  --native-log ~/.mesh-llm/runtime/<pid>/logs/skippy-native.log \
  --output-dir target/kv-tool-loop-stability/local

Reporting Rules

Report the model list, attempts, pressure turns, timeout, cache thresholds, success rate, native log paths, and output directory.
Include the summary verdict and failing phase details from summary.md or summary.json.
Do not paste full prompts, auth headers, huge stable prefixes, or private endpoint data.
If no native log is available, say that native-log scanning was not run.

Validation

When changing this harness, run:

python3 -m unittest scripts.tests.test_qa_kv_tool_loop_stability
python3 -m py_compile scripts/qa-kv-tool-loop-stability.py scripts/tests/test_qa_kv_tool_loop_stability.py

related-skills.json

same repository

skippy-bench.md

from "Mesh-LLM/mesh-llm"

Use this skill when running benchmark orchestration, local single-stage or split benchmarks, benchmark report flow, or performance-oriented skippy runtime checks.

2026-05-271.1k

skippy-correctness.md

from "Mesh-LLM/mesh-llm"

Use this skill when validating skippy staged execution against full-model execution, adding model families, changing split boundaries, testing activation wire dtypes, or diagnosing mismatch behavior.

2026-05-271.1k

skippy-model-package.md

from "Mesh-LLM/mesh-llm"

Use this skill when inspecting GGUF models, planning layer ranges, generating or validating skippy package artifacts, fake packages for direct GGUFs, materialized stage cache behavior, or GGUF writer integration.

2026-05-271.1k

skippy-server.md

from "Mesh-LLM/mesh-llm"

Use this skill when running, configuring, debugging, or embedding skippy-server, binary stage transport, OpenAI frontend integration, activation wire dtype settings, stage configs, lifecycle status, or nonblocking telemetry.

2026-05-271.1k

hf-layer-package-jobs.md

from "Mesh-LLM/mesh-llm"

Use when changing mesh-llm automation or CLI flows that discover Hugging Face GGUF models, plan CPU Hugging Face Jobs for layer-package splitting, estimate max cost, or publish skippy layer packages/catalog entries.

2026-05-081.1k

telemetry-privacy-review.md

from "Mesh-LLM/mesh-llm"

Use this skill when adding, renaming, removing, or reviewing mesh-llm OTLP metrics, telemetry attributes, metrics exporter settings, or telemetry documentation.

2026-05-071.1k

package.json

"author": "Mesh-LLM"

"repository": "Mesh-LLM/mesh-llm"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software Quality Assurance Analysts and TestersComputer and Mathematical Occupations15-1253L4

name	kv-tool-loop-stability
description	Use this skill when certifying mesh-llm KV/cache stability under repeated OpenAI tool-call loops, same-prefix cache reuse, suffix-prefill limits, or native Skippy slot/decode/eviction failures.
metadata	{"short-description":"Certify KV/tool-loop stability"}

KV Tool-Loop Stability

Workflow

Attach to an existing OpenAI-compatible /v1 endpoint. This harness does not start nodes, load models, join meshes, or change routing policy.
Prefer a direct model when reproducing Skippy KV/cache issues. Use auto only when intentionally validating routed behavior.
Run --print-plan first and confirm the models, attempts, pressure_turns, timeout, cache thresholds, output directory, and native logs.
Pass the active Skippy native log when available. The harness checkpoints native logs at run start and scans only appended bytes.
Preserve the evidence directory: manifest.json, results.jsonl, summary.json, summary.md, and transcripts/*.jsonl.

Commands

Preview the run without touching the endpoint:

scripts/qa-kv-tool-loop-stability.py \
  --base-url http://127.0.0.1:9337/v1 \
  --models Qwen/Qwen2.5-3B-Instruct-GGUF:q4_k_m \
  --attempts 5 \
  --pressure-turns 8 \
  --timeout 180 \
  --min-cached-tokens 2048 \
  --suffix-prefill-limit 256 \
  --native-log ~/.mesh-llm/runtime/<pid>/logs/skippy-native.log \
  --output-dir target/kv-tool-loop-stability/local \
  --print-plan

Run the certification:

scripts/qa-kv-tool-loop-stability.py \
  --base-url http://127.0.0.1:9337/v1 \
  --models Qwen/Qwen2.5-3B-Instruct-GGUF:q4_k_m \
  --attempts 5 \
  --pressure-turns 8 \
  --timeout 180 \
  --min-cached-tokens 2048 \
  --suffix-prefill-limit 256 \
  --native-log ~/.mesh-llm/runtime/<pid>/logs/skippy-native.log \
  --output-dir target/kv-tool-loop-stability/local

Reporting Rules

Report the model list, attempts, pressure turns, timeout, cache thresholds, success rate, native log paths, and output directory.
Include the summary verdict and failing phase details from summary.md or summary.json.
Do not paste full prompts, auth headers, huge stable prefixes, or private endpoint data.
If no native log is available, say that native-log scanning was not run.

Validation

When changing this harness, run:

python3 -m unittest scripts.tests.test_qa_kv_tool_loop_stability
python3 -m py_compile scripts/qa-kv-tool-loop-stability.py scripts/tests/test_qa_kv_tool_loop_stability.py

kv-tool-loop-stability

KV Tool-Loop Stability

Workflow

Commands

Reporting Rules

Validation

More from this repository

More from this repository

KV Tool-Loop Stability

Workflow

Commands

Reporting Rules

Validation