Skip to main content
Run any Skill in Manus
with one click

sigil-experiments

Run any Python LLM agent as a Sigil offline-evaluation experiment using the core sigil-sdk (no framework adapter required): record generations, run over a dataset (or A/B two versions), grade locally, and publish scores to Sigil. Use when a user wants to evaluate/compare agent runs, gate a PR on agent quality, or upload an old eval run to Sigil and is NOT using a supported framework adapter (LangGraph, LangChain, etc.) — for those, prefer the framework skill.

Stars49
Forks11
UpdatedJune 6, 2026 at 01:51
SKILL.md
readonly