تشغيل أي مهارة في Manus بنقرة واحدة

ابدأ الآن

scan-logs

النجوم١٬١٢٩

التفرعات١٣٣

آخر تحديث١٩ مايو ٢٠٢٦ في ٢٣:٣٠

Scan logs too large to read directly, using Gemini (scripts/logscan.py).

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

marin-community

marin-community/marin

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

مديرو الشبكات وأنظمة الحاسوبمهن الحاسوب والرياضيات·SOC 15-1244

SKILL.md

readonly

name	scan-logs
description	Scan logs too large to read directly, using Gemini (scripts/logscan.py).

Skill: Scan Logs

Use scripts/logscan.py to analyze large log files. Two composable modes — grep (find matching lines) and summarize (produce a markdown report) — used independently or piped together.

When to Use

Log files too large to read in context (>1000 lines)
Searching for errors, anomalies, or patterns in job/worker/controller logs
Triaging failures from Iris, Zephyr, or training jobs

Prerequisites

GEMINI_API_KEY must be set.

Modes

grep — find matching lines

Returns original log lines (with line numbers) matching a natural-language query. Uses small chunks (~5k tokens) for precision.

uv run scripts/logscan.py grep <logfile> "<query>"

Output goes to stdout as <line_number>: <line>, one per match.

summarize — produce a markdown report

Summarizes the log into a coherent narrative focused on the query. Uses larger chunks (~50k tokens) and hierarchically reduces per-chunk summaries into a final report.

uv run scripts/logscan.py summarize <logfile> "<query>"

Output is a markdown report on stdout.

Piping modes together

grep's stdout feeds directly into summarize via --stdin — narrow to relevant lines first, then summarize:

uv run scripts/logscan.py grep log.txt "errors" \
  | uv run scripts/logscan.py summarize --stdin "summarize these errors"

Arguments

Argument	Description
`mode`	`grep` or `summarize`
`logfile`	Path to the log file (optional if `--stdin`)
`query`	Natural language description of what to look for
`--chunk-tokens N`	Tokens per chunk (default: 5000 for grep, 50000 for summarize)
`--concurrency N`	Max parallel requests (default: 16)
`--model NAME`	Gemini model (default: `gemini-2.5-flash-lite`)
`-v, --verbose`	Print per-chunk results to stderr
`--stdin`	Read input from stdin instead of a file

Examples

# Find OOM errors in a training log
uv run scripts/logscan.py grep /tmp/train.log "out of memory errors or OOM kills"

# Summarize TPU failures
uv run scripts/logscan.py summarize /tmp/worker.log "TPU errors, device failures, or FAILED_PRECONDITION"

# grep then summarize for focused analysis
uv run scripts/logscan.py grep /tmp/controller.log "timeout" \
  | uv run scripts/logscan.py summarize --stdin "what caused the timeouts?"

# Use a more capable model for complex analysis
uv run scripts/logscan.py summarize /tmp/big.log "race conditions or deadlocks" --model gemini-2.5-flash

Output

grep: Line-numbered matching lines to stdout. Progress to stderr.
summarize: Markdown report to stdout. Progress and token usage to stderr.

Both modes print token usage stats to stderr when complete.

Integration with Other Skills

babysit-*: analyze logs from failed jobs before deciding on recovery
debug: use grep to find the failure region, then Read specific line ranges
triage-canary: use summarize to scan canary ferry logs for the root cause

Tips

grep first to narrow down, then summarize the filtered output via --stdin
For very large files (>100k lines), summarize handles hierarchical reduction automatically
Add -v to see per-chunk results as they complete

المزيد من هذا المستودع

نفس المستودع

commit

marin-community/marin

Lint, run the pre-PR checks, commit, push, and author or update the branch's pull request in the required plain-text format. Use when committing, pushing, or creating/updating a PR.

2026-06-201.1k

change-grug

marin-community/marin

Modify or upstream a Grug/Grugformer experiment variant.

2026-06-191.1k

evaluate-zephyr-perf

marin-community/marin

Run a perf gate on a PR that touches lib/zephyr internals.

2026-06-191.1k

organize-experiments

marin-community/marin

Curate the experiment report index at docs/reports/index.md.

2026-06-191.1k

triage-canary

marin-community/marin

Triage a failed canary ferry run (CI-invoked).

2026-06-191.1k

refresh-tpu-vllm-forks

marin-community/marin

Refresh Marin TPU-vLLM forks from a tpu-inference release/LKG pair, update exact SHA pins, run TPU smokes, and open the Marin PR.

2026-06-171.1k

name	scan-logs
description	Scan logs too large to read directly, using Gemini (scripts/logscan.py).

Skill: Scan Logs

Use scripts/logscan.py to analyze large log files. Two composable modes — grep (find matching lines) and summarize (produce a markdown report) — used independently or piped together.

When to Use

Log files too large to read in context (>1000 lines)
Searching for errors, anomalies, or patterns in job/worker/controller logs
Triaging failures from Iris, Zephyr, or training jobs

Prerequisites

GEMINI_API_KEY must be set.

Modes

grep — find matching lines

Returns original log lines (with line numbers) matching a natural-language query. Uses small chunks (~5k tokens) for precision.

uv run scripts/logscan.py grep <logfile> "<query>"

Output goes to stdout as <line_number>: <line>, one per match.

summarize — produce a markdown report

Summarizes the log into a coherent narrative focused on the query. Uses larger chunks (~50k tokens) and hierarchically reduces per-chunk summaries into a final report.

uv run scripts/logscan.py summarize <logfile> "<query>"

Output is a markdown report on stdout.

Piping modes together

grep's stdout feeds directly into summarize via --stdin — narrow to relevant lines first, then summarize:

uv run scripts/logscan.py grep log.txt "errors" \
  | uv run scripts/logscan.py summarize --stdin "summarize these errors"

Arguments

Argument	Description
`mode`	`grep` or `summarize`
`logfile`	Path to the log file (optional if `--stdin`)
`query`	Natural language description of what to look for
`--chunk-tokens N`	Tokens per chunk (default: 5000 for grep, 50000 for summarize)
`--concurrency N`	Max parallel requests (default: 16)
`--model NAME`	Gemini model (default: `gemini-2.5-flash-lite`)
`-v, --verbose`	Print per-chunk results to stderr
`--stdin`	Read input from stdin instead of a file

Examples

# Find OOM errors in a training log
uv run scripts/logscan.py grep /tmp/train.log "out of memory errors or OOM kills"

# Summarize TPU failures
uv run scripts/logscan.py summarize /tmp/worker.log "TPU errors, device failures, or FAILED_PRECONDITION"

# grep then summarize for focused analysis
uv run scripts/logscan.py grep /tmp/controller.log "timeout" \
  | uv run scripts/logscan.py summarize --stdin "what caused the timeouts?"

# Use a more capable model for complex analysis
uv run scripts/logscan.py summarize /tmp/big.log "race conditions or deadlocks" --model gemini-2.5-flash

Output

grep: Line-numbered matching lines to stdout. Progress to stderr.
summarize: Markdown report to stdout. Progress and token usage to stderr.

Both modes print token usage stats to stderr when complete.

Integration with Other Skills

babysit-*: analyze logs from failed jobs before deciding on recovery
debug: use grep to find the failure region, then Read specific line ranges
triage-canary: use summarize to scan canary ferry logs for the root cause

Tips

grep first to narrow down, then summarize the filtered output via --stdin
For very large files (>100k lines), summarize handles hierarchical reduction automatically
Add -v to see per-chunk results as they complete