Skip to main content
Ejecuta cualquier Skill en Manus
con un clic
$pwd:

isc-bench

// Guide for running ISC-Bench jailbreak evaluation against any LLM. Use this whenever someone wants to evaluate LLM safety with ISC-Bench, run the TVD (Task-Validator-Data) benchmark pipeline, test model robustness against structural safety collapse, or compare safety scores across models and benchmarks (JailbreakBench, HarmBench, AdvBench, StrongREJECT). Also use when someone asks about ISC attack success rates, harmful content extraction, or safety scoring on the 1-5 scale.

$ git log --oneline --stat
stars:775
forks:119
updated:29 de mayo de 2026, 10:43
SKILL.md
readonly