Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:
langwatch
GitHub 创作者资料

langwatch

按仓库查看 3 个 GitHub 仓库中的 19 个已收集 skills,并展示近似职业覆盖。

已收集 skills
19
仓库
3
职业领域
1
更新
2026-04-28
职业覆盖
该创作者主要覆盖的职业大类。
仓库浏览

仓库与代表性 skills

#001
skills
13 个 skills21更新于 2026-04-28
占该创作者 68%
datasets
软件质量保证分析师与测试员

Generate realistic synthetic evaluation datasets by analyzing the user's codebase, prompts, production traces, and reference materials. Interactive, consultant-style — asks clarifying questions, proposes a plan, generates a preview for approval, then delivers a complete dataset uploaded to LangWatch. Use when user asks to generate, create, or build a dataset for evaluation, testing, or benchmarking.

2026-04-28
analytics
数据科学家

Analyze your AI agent's performance using LangWatch analytics. Use when the user wants to understand costs, latency, error rates, usage trends, or debug specific traces. Works with any LangWatch-instrumented agent.

2026-04-24
evaluations
软件质量保证分析师与测试员

Set up comprehensive evaluations for your AI agent with LangWatch — experiments (batch testing), evaluators (scoring functions), datasets, online evaluation (production monitoring), and guardrails (real-time blocking). Supports both code (SDK) and platform (CLI) approaches. Use when the user wants to evaluate, test, benchmark, monitor, or safeguard their agent.

2026-04-24
level-up
软件开发工程师

Take your AI agent to the next level with full LangWatch integration. Adds tracing, prompt versioning, evaluation experiments, and simulation tests in one go. Use when the user wants comprehensive observability, testing, and prompt management for their agent.

2026-04-24
prompts
软件开发工程师

Version and manage your agent's prompts with LangWatch Prompts CLI. Use for both onboarding (set up prompt versioning for an entire codebase) and targeted operations (version a specific prompt, create a new prompt version). Supports Python and TypeScript.

2026-04-24
debug-instrumentation
软件开发工程师

Debug and improve your LangWatch traces. Inspects production traces for missing input/output, disconnected spans, unlabeled traces, and missing metadata. Use when traces look broken or incomplete.

2026-04-24
evaluate-multimodal
软件质量保证分析师与测试员

Evaluate multimodal AI agents that process images, audio, PDFs, or other files. Sets up evaluations using LangWatch's LLM-as-judge with image inputs, Scenario's multimodal testing, and document parsing evaluation patterns. Use when your agent handles non-text inputs.

2026-04-24
generate-rag-dataset
数据科学家

Generate a synthetic evaluation dataset from your RAG knowledge base. Creates diverse Q&A pairs with expected answers and relevant context, ready for LangWatch experiments and platform import. Use when you need test data for your RAG pipeline.

2026-04-24
当前展示该仓库 Top 8 / 13 个已收集 skills。
#002
langwatch
5 个 skills3.3k321更新于 2026-04-20
占该创作者 26%
已展示 3 / 3 个仓库
已展示全部仓库