Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

$pwd:

sweep

Name: Sweep
Author: JoaquinCampo

// Prepare and run a KV-cache compression sweep. Loads sweep configuration, validates prerequisites, and provides the exact commands needed. Use before starting any GPU experiment.

In Manus ausführen

$ git log --oneline --stat

stars:0

forks:0

updated:23. Februar 2026 um 05:30

SKILL.md

readonly

related-skills.json

gleiches Repository

advise.md

from "JoaquinCampo/kvguard"

Search past decisions, failures, and experiment logs for relevant context before starting a task. Use this before any significant implementation or experiment to avoid repeating mistakes.

2026-02-230

analyze-results.md

from "JoaquinCampo/kvguard"

Analyze sweep results after completion. Computes accuracy, CFR, signal statistics, and generates comparison tables. Use after a sweep completes to understand the data.

2026-02-230

compare-models.md

from "JoaquinCampo/kvguard"

Compare results across models (Qwen vs Llama) at matching compression configurations. Generates side-by-side tables and identifies cross-model patterns.

2026-02-230

retrospective.md

from "JoaquinCampo/kvguard"

After completing a significant task or experiment, extract lessons learned and update the project knowledge base. Captures what worked, what failed, and what to remember for next time.

2026-02-230

package.json

"author": "JoaquinCampo"

"repository": "JoaquinCampo/kvguard"

GitHub-Repository öffnen Creator-Repositorys ansehen

$ install --global

$ download --local

In Manus ausführen

$ useful --forSOC

DatenwissenschaftlerInformatik- und Mathematikberufe15-2051L4

name	sweep
description	Prepare and run a KV-cache compression sweep. Loads sweep configuration, validates prerequisites, and provides the exact commands needed. Use before starting any GPU experiment.

Sweep Preparation and Execution

Before running a sweep, verify prerequisites and provide the correct commands.

Steps

Check environment:
- Run uv run python -c "import torch; print(f'CUDA: {torch.cuda.is_available()}, MPS: {torch.backends.mps.is_available()}')" to verify GPU access
- Run make check to ensure code is clean
Check existing results:
- Count files: find results -name "*_$1p.json" 2>/dev/null | wc -l
- List what's done: find results -name "*_$1p.json" -exec basename {} \;
- Check checkpoints: find results -name "*.ckpt.jsonl" -exec wc -l {} \;
Validate model access:
- Check if model is cached: ls ~/.cache/huggingface/hub/models--$(echo "$0" | tr '/' '--')/ 2>/dev/null
- If not cached, suggest: bash scripts/download_models.sh

Provide the sweep command:

uv run kvguard sweep \
  --num-prompts ${1:-500} \
  --model "${0:-Qwen/Qwen2.5-7B-Instruct}" \
  --output-dir results \
  --max-new-tokens 512

For the full Phase 2 pipeline (both models + train + eval):

nohup bash scripts/run_phase2.sh &
# Monitor with: bash scripts/check_status.sh

Sweep configuration

16 configs per model:

Baseline: none @ 0.0
StreamingLLM: 0.25, 0.5, 0.625, 0.75, 0.875
SnapKV: 0.25, 0.5, 0.625, 0.75, 0.875
ObservedAttention: 0.25, 0.5, 0.625, 0.75, 0.875

Expected output: results/{press}/{ModelShort}_{ratio}_{num_prompts}p.json

Checkpoint files: results/{press}/{ModelShort}_{ratio}.ckpt.jsonl (auto-resume on restart)

Time estimates (5090 GPU)

500 prompts x 16 configs ≈ 12-18h per model
Full Phase 2 (2 models) ≈ 48-60h total including train/eval

sweep

Mehr aus diesem Repository

Mehr aus diesem Repository

Sweep Preparation and Execution

Steps

Sweep configuration

Time estimates (5090 GPU)

Sweep Preparation and Execution

Steps

Sweep configuration

Time estimates (5090 GPU)