Skip to main content
Run any Skill in Manus
with one click
$pwd:
langwatch
GitHub creator profile

langwatch

Repository-level view of 19 collected skills across 3 GitHub repositories, including approximate occupation coverage.

skills collected
19
repositories
3
occupation fields
1
updated
2026-04-28
occupation focus
Major fields detected across this creator.
repository explorer

Repositories and representative skills

#001
skills
13 skills21updated 2026-04-28
68% of creator
datasets
Software Quality Assurance Analysts & Testers

Generate realistic synthetic evaluation datasets by analyzing the user's codebase, prompts, production traces, and reference materials. Interactive, consultant-style — asks clarifying questions, proposes a plan, generates a preview for approval, then delivers a complete dataset uploaded to LangWatch. Use when user asks to generate, create, or build a dataset for evaluation, testing, or benchmarking.

2026-04-28
analytics
Data Scientists

Analyze your AI agent's performance using LangWatch analytics. Use when the user wants to understand costs, latency, error rates, usage trends, or debug specific traces. Works with any LangWatch-instrumented agent.

2026-04-24
evaluations
Software Quality Assurance Analysts & Testers

Set up comprehensive evaluations for your AI agent with LangWatch — experiments (batch testing), evaluators (scoring functions), datasets, online evaluation (production monitoring), and guardrails (real-time blocking). Supports both code (SDK) and platform (CLI) approaches. Use when the user wants to evaluate, test, benchmark, monitor, or safeguard their agent.

2026-04-24
level-up
Software Developers

Take your AI agent to the next level with full LangWatch integration. Adds tracing, prompt versioning, evaluation experiments, and simulation tests in one go. Use when the user wants comprehensive observability, testing, and prompt management for their agent.

2026-04-24
prompts
Software Developers

Version and manage your agent's prompts with LangWatch Prompts CLI. Use for both onboarding (set up prompt versioning for an entire codebase) and targeted operations (version a specific prompt, create a new prompt version). Supports Python and TypeScript.

2026-04-24
debug-instrumentation
Software Developers

Debug and improve your LangWatch traces. Inspects production traces for missing input/output, disconnected spans, unlabeled traces, and missing metadata. Use when traces look broken or incomplete.

2026-04-24
evaluate-multimodal
Software Quality Assurance Analysts & Testers

Evaluate multimodal AI agents that process images, audio, PDFs, or other files. Sets up evaluations using LangWatch's LLM-as-judge with image inputs, Scenario's multimodal testing, and document parsing evaluation patterns. Use when your agent handles non-text inputs.

2026-04-24
generate-rag-dataset
Data Scientists

Generate a synthetic evaluation dataset from your RAG knowledge base. Creates diverse Q&A pairs with expected answers and relevant context, ready for LangWatch experiments and platform import. Use when you need test data for your RAG pipeline.

2026-04-24
Showing top 8 of 13 collected skills in this repository.
#002
langwatch
5 skills3.3k321updated 2026-04-20
26% of creator
Showing 3 of 3 repositories
All repositories loaded
langwatch GitHub Skills | SkillsMP