Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

$pwd:

medkit-attending-debrief

Name: Medkit Attending Debrief
Author: bedriyan

// Reference for the DEBRIEF MODE that lives inside the `medkit-attending` Managed Agent. Use this skill when debugging why the agent emitted a particular `render_case_evaluation` payload, when modifying the rubric scoring logic, or when a citation appears unresolved in the UI. NOT used to author new rubrics — see medkit-rubric-author for that.

Exécuter dans Manus

$ git log --oneline --stat

stars:272

forks:56

updated:26 avril 2026 à 00:40

SKILL.md

readonly

name	medkit-attending-debrief
description	Reference for the DEBRIEF MODE that lives inside the `medkit-attending` Managed Agent. Use this skill when debugging why the agent emitted a particular `render_case_evaluation` payload, when modifying the rubric scoring logic, or when a citation appears unresolved in the UI. NOT used to author new rubrics — see medkit-rubric-author for that.

medkit-attending — DEBRIEF MODE reference

The medkit-attending Managed Agent (Opus 4.7) runs in two modes during a session:

Live mode — observes the encounter, optionally emits render_triage_badge on ER arrivals.
DEBRIEF MODE — kicks in when the trainee submits a [debrief request] message at end of encounter; emits exactly one render_case_evaluation and stops.

This skill documents mode 2. The full system prompt is in backend/server.py under MEDKIT_ATTENDING_SYSTEM_PROMPT.

Trigger contract

The trainee's frontend posts a user.message whose body starts with the literal header [debrief request], followed by a JSON block produced by buildDebriefRequest(). The block contains:

case_id and case_summary (chief complaint, correct diagnosis, severity)
rubric — full CaseRubric object (data_gathering, clinical_management, interpersonal, optional safety_netting)
registry_slice — only the guidelines + recommendations cited by the rubric; acts as both context AND an allowlist
encounter_log — chronological history Q&A, ordered tests with timestamps, treatments given, prescriptions, submitted diagnosis, correctness flag

Output contract

A single render_case_evaluation tool use whose input validates against caseEvaluationInput (Zod schema mirrors the JSON schema in backend/server.py:MEDKIT_CUSTOM_TOOLS). Required fields:

case_id, global_rating, domain_scores, criteria, highlights, improvements, narrative

Optional: safety_breach (object or null).

Hard rules baked into the agent prompt

Cite, don't invent. Every clinical_management criterion's guideline_ref must appear in registry_slice.recommendations[].recId. If no rec applies, the agent drops the criterion. Never fabricates.
Specific evidence. "You missed ICE" is not enough. The expected bar is a transcript-quoted observation tied to the case (e.g. patient hinted at father's stroke, trainee didn't pick it up).
Safety first. A contraindicated drug, missed red-flag escalation, or no safety-netting on a high-risk diagnosis sets safety_breach and the narrative leads with it regardless of total score.
Bands for verdict (per domain and global): ≥0.85 excellent, ≥0.70 good, ≥0.55 satisfactory, ≥0.40 borderline, otherwise clear-fail.
No clinical advice for real patients. Output framed as training only.

Files in the chain

File	Role
backend/server.py	`MEDKIT_ATTENDING_SYSTEM_PROMPT` (DEBRIEF MODE section) + `MEDKIT_CUSTOM_TOOLS[render_case_evaluation]` JSON schema
src/agents/customTools.ts	`caseEvaluationInput` Zod mirror; validates the tool-use input before render
src/agents/debriefRequest.ts	Packs the encounter into the `[debrief request]` payload
src/agents/useAttendingDebrief.ts	Hook: bootstrap → session → message → stream → eval emit
src/components/CaseEvaluationCard.tsx	Standalone card (used elsewhere if needed)
src/components/DebriefScreen.tsx	Cozy-cartoon screen consuming the live evaluation

Deploying changes to the agent

The agent definition lives on Anthropic's platform. After editing the system prompt or tool schema in backend/server.py:

Restart the FastAPI backend (so the Python module re-loads the constants).
curl -X POST http://127.0.0.1:8787/agent/refresh -H "Origin: http://localhost:5173".
The response shows the new agent version. Existing sessions keep their pinned version; new sessions pick up the latest.

If the schema or system prompt drifts between code and platform, the agent might emit shapes the Zod parser rejects. The useAttendingDebrief hook returns a validation error in that case; check the browser console.

Smoke tests

scripts/verify/rubric-smoke.ts — every cited guideline_ref in every authored rubric resolves; auto-rubric fallback works.
scripts/verify/evaluation-flow.ts — end-to-end with a synthetic encounter; Zod validates a hand-crafted evaluation.
scripts/verify/live-debrief.ts — drives a live agent session through the running backend; takes ~60–90 s per case.

Run these whenever you touch the rubric schema, the registry, or the agent prompt.

Anti-patterns

Editing the Zod schema without updating the matching JSON schema in backend/server.py (or vice versa) — they MUST match.
Letting the agent emit render_case_grade (the legacy flat-score tool) — it was deprecated when DEBRIEF MODE shipped; the system prompt should never mention it.
Treating an unresolved citation in the UI as an agent bug. It usually means the rubric cites a recId that isn't in the registry: either author the recommendation in guidelines.ts (via medkit-guideline-curator) or remove the citation from the rubric.

related-skills.json

même dépôt

attending-debrief.md

from "bedriyan/medkit-app"

Scores a completed OSCE encounter against the case rubric and produces a structured debrief. Cites guideline IDs from registry. Never invents recommendation classes or LoE.

2026-04-26272

case-generator.md

from "bedriyan/medkit-app"

Generates OSCE-style GP case JSON variants from an authoritative guideline + a variant brief. Outputs strict JSON conforming to cases/case.schema.json. Every clinical fact and rubric item is traceable back to a registry guideline ID.

2026-04-26272

guideline-curator.md

from "bedriyan/medkit-app"

Drafts entries for guidelines/registry.json by fetching authoritative society/agency guidelines via WebFetch, restricted to the source whitelist. Output is always verificationStatus="auto-fetched" — only the MD signs off "verified". Designed to run on a weekly /loop to keep the registry current.

2026-04-26272

patient-roleplay.md

from "bedriyan/medkit-app"

Voices the patient in real time during a MedKit OSCE encounter. Reads hidden.history_facts + personality + planted_cues + lies. Outputs naturalistic spoken utterances; never breaks character; never reveals more than a real standardised patient would.

2026-04-26272

simulation-tick.md

from "bedriyan/medkit-app"

World physics for MedKit. For every student action (order test, prescribe, refer, examine, advise) returns strict JSON describing validity, what the student sees, time cost, and any change to patient state. Never speaks to the student.

2026-04-26272

medkit-guideline-curator.md

from "bedriyan/medkit-app"

Curate or refresh entries in `src/data/guidelines.ts` from authoritative society sources via WebFetch. Use whenever the registry needs a new condition (because a hero case rubric needs a citation that doesn't exist yet) or whenever existing entries should be checked for newer guideline versions. Designed to run on a `/loop 7d /medkit-guideline-curator` weekly schedule. Output is always `verificationStatus: "auto-fetched"` — a clinician must sign off "verified" by hand.

2026-04-26272

package.json

"author": "bedriyan"

"repository": "bedriyan/medkit-app"

Ouvrir le dépôt GitHub Voir les dépôts du créateur

$ install --global

$ download --local

Exécuter dans Manus

$ useful --forSOC

Développeurs de logicielsProfessions informatiques et mathématiques15-1252L4

name	medkit-attending-debrief
description	Reference for the DEBRIEF MODE that lives inside the `medkit-attending` Managed Agent. Use this skill when debugging why the agent emitted a particular `render_case_evaluation` payload, when modifying the rubric scoring logic, or when a citation appears unresolved in the UI. NOT used to author new rubrics — see medkit-rubric-author for that.

medkit-attending — DEBRIEF MODE reference

The medkit-attending Managed Agent (Opus 4.7) runs in two modes during a session:

Live mode — observes the encounter, optionally emits render_triage_badge on ER arrivals.
DEBRIEF MODE — kicks in when the trainee submits a [debrief request] message at end of encounter; emits exactly one render_case_evaluation and stops.

This skill documents mode 2. The full system prompt is in backend/server.py under MEDKIT_ATTENDING_SYSTEM_PROMPT.

Trigger contract

The trainee's frontend posts a user.message whose body starts with the literal header [debrief request], followed by a JSON block produced by buildDebriefRequest(). The block contains:

case_id and case_summary (chief complaint, correct diagnosis, severity)
rubric — full CaseRubric object (data_gathering, clinical_management, interpersonal, optional safety_netting)
registry_slice — only the guidelines + recommendations cited by the rubric; acts as both context AND an allowlist
encounter_log — chronological history Q&A, ordered tests with timestamps, treatments given, prescriptions, submitted diagnosis, correctness flag

Output contract

A single render_case_evaluation tool use whose input validates against caseEvaluationInput (Zod schema mirrors the JSON schema in backend/server.py:MEDKIT_CUSTOM_TOOLS). Required fields:

case_id, global_rating, domain_scores, criteria, highlights, improvements, narrative

Optional: safety_breach (object or null).

Hard rules baked into the agent prompt

Cite, don't invent. Every clinical_management criterion's guideline_ref must appear in registry_slice.recommendations[].recId. If no rec applies, the agent drops the criterion. Never fabricates.
Specific evidence. "You missed ICE" is not enough. The expected bar is a transcript-quoted observation tied to the case (e.g. patient hinted at father's stroke, trainee didn't pick it up).
Safety first. A contraindicated drug, missed red-flag escalation, or no safety-netting on a high-risk diagnosis sets safety_breach and the narrative leads with it regardless of total score.
Bands for verdict (per domain and global): ≥0.85 excellent, ≥0.70 good, ≥0.55 satisfactory, ≥0.40 borderline, otherwise clear-fail.
No clinical advice for real patients. Output framed as training only.

Files in the chain

File	Role
backend/server.py	`MEDKIT_ATTENDING_SYSTEM_PROMPT` (DEBRIEF MODE section) + `MEDKIT_CUSTOM_TOOLS[render_case_evaluation]` JSON schema
src/agents/customTools.ts	`caseEvaluationInput` Zod mirror; validates the tool-use input before render
src/agents/debriefRequest.ts	Packs the encounter into the `[debrief request]` payload
src/agents/useAttendingDebrief.ts	Hook: bootstrap → session → message → stream → eval emit
src/components/CaseEvaluationCard.tsx	Standalone card (used elsewhere if needed)
src/components/DebriefScreen.tsx	Cozy-cartoon screen consuming the live evaluation

Deploying changes to the agent

The agent definition lives on Anthropic's platform. After editing the system prompt or tool schema in backend/server.py:

Restart the FastAPI backend (so the Python module re-loads the constants).
curl -X POST http://127.0.0.1:8787/agent/refresh -H "Origin: http://localhost:5173".
The response shows the new agent version. Existing sessions keep their pinned version; new sessions pick up the latest.

Smoke tests

scripts/verify/rubric-smoke.ts — every cited guideline_ref in every authored rubric resolves; auto-rubric fallback works.
scripts/verify/evaluation-flow.ts — end-to-end with a synthetic encounter; Zod validates a hand-crafted evaluation.
scripts/verify/live-debrief.ts — drives a live agent session through the running backend; takes ~60–90 s per case.

Run these whenever you touch the rubric schema, the registry, or the agent prompt.

Anti-patterns

Editing the Zod schema without updating the matching JSON schema in backend/server.py (or vice versa) — they MUST match.
Letting the agent emit render_case_grade (the legacy flat-score tool) — it was deprecated when DEBRIEF MODE shipped; the system prompt should never mention it.
Treating an unresolved citation in the UI as an agent bug. It usually means the rubric cites a recId that isn't in the registry: either author the recommendation in guidelines.ts (via medkit-guideline-curator) or remove the citation from the rubric.

medkit-attending-debrief

medkit-attending — DEBRIEF MODE reference

Trigger contract

Output contract

Hard rules baked into the agent prompt

Files in the chain

Deploying changes to the agent

Smoke tests

Anti-patterns

Plus depuis ce dépôt

Plus depuis ce dépôt

medkit-attending — DEBRIEF MODE reference

Trigger contract

Output contract

Hard rules baked into the agent prompt

Files in the chain

Deploying changes to the agent

Smoke tests

Anti-patterns