Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic

isc-bench

// Guide for running ISC-Bench jailbreak evaluation against any LLM. Use this whenever someone wants to evaluate LLM safety with ISC-Bench, run the TVD (Task-Validator-Data) benchmark pipeline, test model robustness against structural safety collapse, or compare safety scores across models and benchmarks (JailbreakBench, HarmBench, AdvBench, StrongREJECT). Also use when someone asks about ISC attack success rates, harmful content extraction, or safety scoring on the 1-5 scale.

$ git log --oneline --stat
stars:775
forks:119
updated:29 mai 2026 à 10:43
SKILL.md
readonly