Skip to main content
Run any Skill in Manus
with one click

agentic-eval

Design and implement evaluation loops for AI agents, including reflection, evaluator-optimizer patterns, rubric scoring, LLM-as-judge review, test-driven refinement, convergence checks, and iteration logging.

Overview

Design and implement evaluation loops for AI agents, including reflection, evaluator-optimizer patterns, rubric scoring, LLM-as-judge review, test-driven refinement, convergence checks, and iteration logging.

Install command
npx skills add https://github.com/MarieLynneBlock/arcanum-artifex --skill agentic-eval

Copy and paste this command into Claude Code to install the skill

Stars2
Forks0
UpdatedMay 20, 2026 at 13:27
SKILL.md
readonly