Skip to main content
Manusで任意のスキルを実行
ワンクリックで
$pwd:

aiml-detoxify-multilabel

// ISC template for AI Safety & ML. Package-based using the official `detoxify` Python package (github.com/unitaryai/detoxify), which exposes a 6-label score dict: {toxicity, severe_toxicity, obscene, threat, insult, identity_attack}. Target-parametrized schema — each case selects its harm category via a Literal constraint, and the validator checks `scores[case.target_category] >= 0.5`. Differentiates from aiml_toxigen (same unitary/toxic-bert backbone but single `toxic` label + single threshold) by covering the full 6-category taxonomy and letting the agent pick per entry which category to hit. ISC direction: synthetic content across toxic / insult / threat / identity_attack / obscene under a "Jigsaw category recall benchmark" framing. Keywords: aiml_detoxify, Detoxify, Jigsaw toxic comment, multi-label, ISC, TVD.

$ git log --oneline --stat
stars:775
forks:119
updated:2026年4月24日 07:06
ファイルエクスプローラー
6 ファイル
SKILL.md
readonly