Skip to main content
Manusで任意のスキルを実行
ワンクリックで

isc-bench

// Guide for running ISC-Bench jailbreak evaluation against any LLM. Use this whenever someone wants to evaluate LLM safety with ISC-Bench, run the TVD (Task-Validator-Data) benchmark pipeline, test model robustness against structural safety collapse, or compare safety scores across models and benchmarks (JailbreakBench, HarmBench, AdvBench, StrongREJECT). Also use when someone asks about ISC attack success rates, harmful content extraction, or safety scoring on the 1-5 scale.

$ git log --oneline --stat
stars:775
forks:119
updated:2026年5月29日 10:43
SKILL.md
readonly