Run any Skill in Manus with one click

$pwd:

eval-setup

Name: Eval Setup
Author: opendatahub-io

// Optional environment configurator for the agent-eval-harness. Configures MLflow tracking, verifies API keys, and troubleshoots dependency issues. Not required for basic usage — dependencies auto-install via SessionStart hook and agent_eval is available via symlinks. Use when the user wants to configure MLflow tracking, troubleshoot import errors, verify the environment, or set up a remote MLflow server. Also triggers on "configure mlflow", "set up tracking", "ModuleNotFoundError", "mlflow not installed", "missing dependencies", or "check my eval environment".

Run Skill in Manus

$ git log --oneline --stat

stars:6

forks:8

updated:May 6, 2026 at 12:29

File Explorer

3 files

SKILL.md

readonly

package.json

"author": "opendatahub-io"

"repository": "opendatahub-io/agent-eval-harness"

View GitHub Repository

$ install --globalskills.sh

$ download --local

Run Skill in Manus

[HINT] Download the complete skill directory including SKILL.md and all related files

Argument

Required

Default

Description

--tracking-uri <uri>

auto-detect

MLflow tracking URI (skips interactive setup)

--skip-mlflow

false

Skip MLflow setup entirely

--runs-dir <path>

eval/runs

Directory where eval runs are stored

VENV_PYTHON="${CLAUDE_SKILL_DIR}/../../.eval-venv/bin/python3" test -x "$VENV_PYTHON" && echo "venv: OK" || echo "venv: MISSING" "$VENV_PYTHON" -c "import yaml; print('pyyaml: OK')" 2>&1 || echo "pyyaml: MISSING"

VENV_DIR="${CLAUDE_SKILL_DIR}/../../.eval-venv" # Use uv if available, otherwise venv pip if command -v uv &>/dev/null; then uv pip install --python "$VENV_DIR/bin/python3" 'mlflow[genai]>=3.5' 'anthropic[vertex]>=0.40' else "$VENV_DIR/bin/pip" install 'mlflow[genai]>=3.5' 'anthropic[vertex]>=0.40' fi

test -f eval.yaml && PYTHONPATH=${CLAUDE_SKILL_DIR}/scripts python3 -c " from agent_eval.config import EvalConfig config = EvalConfig.from_yaml('eval.yaml') import os for key, value in config.execution.env.items(): if isinstance(value, str) and value.startswith('\$'): var_name = value[1:] status = 'set' if os.environ.get(var_name) else 'NOT SET' print(f' {key}: \${var_name} → {status}') else: # Mask literal values to avoid leaking credentials in logs print(f' {key}: (literal, {len(str(value))} chars)') " 2>&1 || echo " (could not parse eval.yaml)"

PYTHONPATH=${CLAUDE_SKILL_DIR}/scripts python3 -c " from agent_eval.config import EvalConfig from agent_eval.mlflow.experiment import setup_experiment, resolve_tracking_uri config = EvalConfig.from_yaml('eval.yaml') if config.mlflow.experiment: setup_experiment(config.mlflow.experiment, tracking_uri=resolve_tracking_uri(config)) print(f'Experiment created: {config.mlflow.experiment} on {resolve_tracking_uri(config)}') else: print('No mlflow.experiment in eval.yaml, skipping') "