Skip to main content
Manusで任意のスキルを実行
ワンクリックで

langgraph-testing-evaluation

Use this skill when you need to test or evaluate LangGraph/LangChain agents: writing unit or integration tests, generating test scaffolds, mocking LLM/tool behavior, running trajectory evaluation (match or LLM-as-judge), running LangSmith dataset evaluations, and comparing two agent versions with A/B-style offline analysis. Use it for Python and JavaScript/TypeScript workflows, evaluator design, experiment setup, regression gates, and debugging flaky/incorrect evaluation results.

概要

Use this skill when you need to test or evaluate LangGraph/LangChain agents: writing unit or integration tests, generating test scaffolds, mocking LLM/tool behavior, running trajectory evaluation (match or LLM-as-judge), running LangSmith dataset evaluations, and comparing two agent versions with A/B-style offline analysis. Use it for Python and JavaScript/TypeScript workflows, evaluator design, experiment setup, regression gates, and debugging flaky/incorrect evaluation results.

インストールコマンド
npx skills add https://github.com/Lubu-Labs/langchain-agent-skills --skill langgraph-testing-evaluation

このコマンドをClaude Codeにコピー&ペーストしてスキルをインストール

スター97
フォーク13
更新日2026年2月10日 15:48
ファイルエクスプローラー
18 ファイル
SKILL.md
readonly