Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

task-review

// SkillsBench task PR review — classifies the task track (standard / research / multimodal), runs static policy checks against the track-specific rubric, benchmarks the task across oracle plus Claude and Codex (with and without skills), audits trajectories for cheating and skill invocation, and produces a `pr-N-task-timestamp-run.txt` review report alongside a `prN.zip` bundle of trajectories. Use when reviewing a SkillsBench task PR (by number, branch, or local task path), when the user asks to review a task, run benchmarks on a PR, audit a submission, classify a task as research or multimodal track, or prepare a comment to post on a SkillsBench PR.

$ git log --oneline --stat
stars:١٬٢٧٢
forks:٣٠٧
updated:٥ مايو ٢٠٢٦ في ١٦:٥٨
مستكشف الملفات
13 ملفات
SKILL.md
readonly