تشغيل أي مهارة في Manus بنقرة واحدة

$pwd:

regression-search

Name: Regression Search
Author: sonichi

// Search phone-call history for when a feature regressed (find-regression.py) and drill into a single call to see what went wrong (diagnose-call.py). Skips reading 100+ transcripts by hand.

تشغيل في Manus

$ git log --oneline --stat

stars:٣٢٧

forks:٦٢

updated:٢٣ مايو ٢٠٢٦ في ١٣:٥٤

مستكشف الملفات

3 ملفات

SKILL.md

readonly

name	regression-search
description	Search phone-call history for when a feature regressed (find-regression.py) and drill into a single call to see what went wrong (diagnose-call.py). Skips reading 100+ transcripts by hand.

Regression Search

Two scripts for hunting down bad calls without reading every transcript:

find-regression.py — search results/calls/calls.jsonl for calls touching a feature, classify each as working/broken, print a sorted timeline.
diagnose-call.py — drill into a single call by SID, report refusals/errors/silences/repeated requests, optionally show metrics from data/call-metrics.jsonl.

Closes #188.

When to use

"When did the X feature stop working?" — pass the feature keyword.
"Has feature Y improved?" — see the broken/working trend over time.
Before shipping a fix — sanity check that the regression is reproducible.

Usage

python3 skills/regression-search/scripts/find-regression.py "record"
python3 skills/regression-search/scripts/find-regression.py "summon" --since 2026-04-01
python3 skills/regression-search/scripts/find-regression.py "play" --json

Flags:

--since YYYY-MM-DD — only show calls on/after this date
--json — machine-readable output
--show-snippet — print a one-line transcript snippet for each call

Heuristics

A call is broken for a query if any of:

Sutando refuses ("I can't", "I'm not able", "I'm unable", "sorry I cannot")
Sutando reports an error ("error", "failed", "didn't work", "something went wrong")
The user repeats the same request 2+ times in a row (Sutando didn't respond usefully)
Sutando says "(Silence)" after the user mentions the feature

Otherwise the call is working if Sutando's response includes the feature keyword and isn't flagged broken.

These are intentionally crude — the goal is "good enough to find the regression window without reading 163 transcripts." Tune as you find false positives.

Limitations

Keyword matching only. "recording doesn't stop" vs "recording won't start" both match record. The issue calls this out as future work.
No semantic understanding. A call where Sutando talks about recording but the user wanted something else still matches.
Doesn't correlate with git commits — manual step for now.

diagnose-call.py

python3 skills/regression-search/scripts/diagnose-call.py de1f04733fc2
python3 skills/regression-search/scripts/diagnose-call.py CA701fc4129779... --metrics
python3 skills/regression-search/scripts/diagnose-call.py de1f04733fc2 --json

Accepts a full SID or just the last 12 characters. Reports turn counts, refusals, errors, silences, repeated user requests, and the ending style (normal vs abrupt user end vs sutando silence). With --metrics, also pulls per-event tool-call timeline from data/call-metrics.jsonl (requires PR #223). Exit code 1 if any issues are found, 0 if clean — useful for CI.

Typical workflow: run find-regression.py to surface broken candidates, then diagnose-call.py <sid> to drill into the worst one.

Future work

Auto-correlate regression windows with git log
Smarter NLP-based query matching (query: "recording doesn't stop" vs "recording won't start")

related-skills.json

نفس المستودع

catchup-after-startup.md

from "sonichi/sutando"

Rebuild last-session context from everything persisted to disk (session-state.md, conversation.log, sqlite, PRs, tasks, build_log). Run as the first action of a fresh session so the conversation buffer has context before the user types. Recall half of issue #1032.

2026-05-23327

proactive-loop.md

from "sonichi/sutando"

Start Sutando's autonomous proactive loop. Monitors tasks, runs health checks, and builds missing capabilities on a recurring schedule.

2026-05-23327

claude-router.md

from "sonichi/sutando"

Choose between the local Codex CLI and Gemini CLI from Claude Code. Use for automatic model selection when the user wants the best local delegate for code review, repo-wide analysis, planning, or implementation.

2026-05-23327

discord-voice.md

from "sonichi/sutando"

Sutando joins a Discord voice channel and runs a 2-way Gemini Live conversation. Standalone TS process — discord.js + @discordjs/voice + bodhi VoiceSession.

2026-05-23327

phone-conversation.md

from "sonichi/sutando"

Make conversational phone calls and join Zoom meetings via Twilio + Gemini. Multi-turn AI conversations on the phone on behalf of the user.

2026-05-23327

db-browser.md

from "sonichi/sutando"

Install DB Browser for SQLite (if not already installed) and open a .sqlite file in it. macOS only.

2026-05-23327

package.json

"author": "sonichi"

"repository": "sonichi/sutando"

فتح مستودع GitHub عرض مستودعات المنشئ

$ install --global

$ download --local

تشغيل في Manus

$ useful --forSOC

محللو ضمان جودة البرمجيات والمختبرونمهن الحاسوب والرياضيات15-1253L4

name	regression-search
description	Search phone-call history for when a feature regressed (find-regression.py) and drill into a single call to see what went wrong (diagnose-call.py). Skips reading 100+ transcripts by hand.

Regression Search

Two scripts for hunting down bad calls without reading every transcript:

find-regression.py — search results/calls/calls.jsonl for calls touching a feature, classify each as working/broken, print a sorted timeline.
diagnose-call.py — drill into a single call by SID, report refusals/errors/silences/repeated requests, optionally show metrics from data/call-metrics.jsonl.

Closes #188.

When to use

"When did the X feature stop working?" — pass the feature keyword.
"Has feature Y improved?" — see the broken/working trend over time.
Before shipping a fix — sanity check that the regression is reproducible.

Usage

python3 skills/regression-search/scripts/find-regression.py "record"
python3 skills/regression-search/scripts/find-regression.py "summon" --since 2026-04-01
python3 skills/regression-search/scripts/find-regression.py "play" --json

Flags:

--since YYYY-MM-DD — only show calls on/after this date
--json — machine-readable output
--show-snippet — print a one-line transcript snippet for each call

Heuristics

A call is broken for a query if any of:

Sutando refuses ("I can't", "I'm not able", "I'm unable", "sorry I cannot")
Sutando reports an error ("error", "failed", "didn't work", "something went wrong")
The user repeats the same request 2+ times in a row (Sutando didn't respond usefully)
Sutando says "(Silence)" after the user mentions the feature

Otherwise the call is working if Sutando's response includes the feature keyword and isn't flagged broken.

These are intentionally crude — the goal is "good enough to find the regression window without reading 163 transcripts." Tune as you find false positives.

Limitations

Keyword matching only. "recording doesn't stop" vs "recording won't start" both match record. The issue calls this out as future work.
No semantic understanding. A call where Sutando talks about recording but the user wanted something else still matches.
Doesn't correlate with git commits — manual step for now.

diagnose-call.py

python3 skills/regression-search/scripts/diagnose-call.py de1f04733fc2
python3 skills/regression-search/scripts/diagnose-call.py CA701fc4129779... --metrics
python3 skills/regression-search/scripts/diagnose-call.py de1f04733fc2 --json

Typical workflow: run find-regression.py to surface broken candidates, then diagnose-call.py <sid> to drill into the worst one.

Future work

Auto-correlate regression windows with git log
Smarter NLP-based query matching (query: "recording doesn't stop" vs "recording won't start")

regression-search

Regression Search

When to use

Usage

Heuristics

Limitations

diagnose-call.py

Future work

المزيد من هذا المستودع

المزيد من هذا المستودع

Regression Search

When to use

Usage

Heuristics

Limitations

diagnose-call.py

Future work