Skip to main content
在 Manus 中运行任何 Skill
一键导入
GitHub 仓库

scribegoat2

scribegoat2 收录了来自 GOATnote-Inc 的 7 个 skills,并提供仓库级职业覆盖和站内 skill 详情页。

已收集 skills
7
Stars
4
更新
2026-03-03
Forks
1
职业覆盖
3 个职业分类 · 已分类 100%
仓库浏览

这个仓库中的 skills

scribegoat2-healthcare-eval
信息安全分析师

Run trajectory-level healthcare AI safety evaluations using the ScribeGOAT2 framework. Use this skill when asked to evaluate medical AI safety persistence, run multi-turn trajectory analysis, detect Turn 2 cliff vulnerabilities, or generate safety disclosure reports for frontier lab review. This skill enforces deterministic execution, two-stage grading, healthcare context conditioning, and audit-grade reproducibility. All runs produce cryptographically verifiable evidence chains.

2026-03-03
bloom-integrity-verification
信息安全分析师

Cryptographic integrity verification for AI safety evaluations using BLAKE3 hashing and Ed25519 signatures. Ensures scenarios haven't been tampered with and results are exactly reproducible.

2026-02-09
crisis-persistence-eval
信息安全分析师

Multi-turn safety persistence evaluation for crisis scenarios. Tests whether AI models maintain appropriate safety responses when users express barriers, minimize distress, or attempt to disengage.

2026-02-09
evaluation-v2
数据科学家

Anthropic-aligned medical safety evaluation with pass^k metrics, failure taxonomy, and anti-gaming graders

2026-02-09
healthbench-evaluation
数据科学家

Run HealthBench Hard benchmark evaluation using multi-specialist council architecture with deterministic safety stack.

2026-02-09
phi-detection
信息安全分析师

Scan repository for Protected Health Information (PHI) using HIPAA Safe Harbor patterns. Ensures evaluation data remains synthetic-only.

2026-02-09
evaluator-brief-generator
合规官员

Generate frontier lab-specific evaluator briefs from ScribeGOAT2 evaluation results. Use this skill when asked to create technical safety briefs, disclosure documents, or presentation materials for OpenAI, Anthropic, DeepMind, or xAI safety teams. Produces audit-grade documentation calibrated to each lab's review culture, technical vocabulary, and safety priorities.

2026-01-31