تشغيل أي مهارة في Manus بنقرة واحدة

$pwd:

aio11y

Name: Aio11y
Author: grafana

// Use when the user wants to list or search AI Observability conversations, inspect generations, manage evaluators (create, test, delete), set up evaluation rules, check scores, or browse evaluator templates. Trigger on phrases like "list conversations", "search generations", "what did the agent do", "debug LLM conversation", "create evaluator", "set up evaluation rule", "test evaluator", "check scores", "evaluate generation quality", or "set up online evaluation".

تشغيل في Manus

$ git log --oneline --stat

stars:٣٠٢

forks:٢١

updated:٢٠ مايو ٢٠٢٦ في ٠٥:٤٠

مستكشف الملفات

2 ملفات

SKILL.md

readonly

related-skills.json

نفس المستودع

diagnose-entity-graph.md

from "grafana/gcx"

Diagnose Entity Graph problems: missing entities, missing edges, disconnected clusters, or filtering issues. Use when the user reports that Entity Graph doesn't look right, services are missing, edges aren't appearing, or environments can't be filtered. Triggers for: "entity graph is empty", "services missing from entity graph", "no edges in entity graph", "disconnected services", "can't filter entity graph", "entity graph not working", "diagnose entity graph", "debug knowledge graph".

2026-05-22302

create-dashboard.md

from "grafana/gcx"

Use when the user wants to create a new Grafana dashboard, design dashboard panels, variables, queries, or layout, or make a material visual redesign of an existing dashboard. This skill uses gcx plus `gcx dashboards snapshot` as a visual feedback loop. Triggers on "create dashboard", "new dashboard", "build dashboard", "dashboard for <service>", "improve this dashboard", or "iterate on a dashboard".

2026-05-22302

generate-resource-stubs.md

from "grafana/gcx"

Use when the user explicitly wants typed Go stub files, generated resource skeletons, or grafana-foundation-sdk builder boilerplate for dashboards or alert rules. This is scaffolding only; for designing or creating a usable dashboard with datasource discovery and snapshot iteration, use the create-dashboard skill instead. Triggers on "generate stub", "dashboard stub", "create alert rule stub", "foundation-sdk builder", or "builder boilerplate".

2026-05-22302

manage-dashboards.md

from "grafana/gcx"

Use for operational management of existing Grafana dashboards: list, get, search, create or update from an already-authored manifest, delete, inspect and restore versions, pull/push/validate/promote dashboard resource files, manage dashboard folders, or render PNG snapshots. For designing or creating a new dashboard, or for material visual/dashboard UX changes, use the create-dashboard skill instead.

2026-05-22302

debug-with-grafana.md

from "grafana/gcx"

Structured diagnostic workflow for debugging application issues using Grafana observability data. Use when the user reports errors, latency spikes, service degradation, HTTP 500s, or wants to investigate why a service is behaving unexpectedly. Triggers for: "my API is returning 500 errors", "latency is spiking", "service seems down", "help me debug using Grafana", "investigate why requests are failing", "something is wrong with my service".

2026-05-22302

oncall-triage.md

from "grafana/gcx"

Use when the user is triaging what is actively paging in Grafana OnCall, or asks about active alert groups, acknowledging or silencing or resolving fires, on-call queue, or "what's paging right now". Trigger on phrases like "what's paging", "on-call alerts", "ack this", "silence the page", "what's firing in OnCall", "show me active pages", or any reference to OnCall alert groups. For root cause of why a Grafana alert rule is evaluating (rule-side, pre-routing) use investigate-alert. For schedules, integrations, or escalation chains use the gcx skill.

2026-05-22302

package.json

"author": "grafana"

"repository": "grafana/gcx"

فتح مستودع GitHub عرض مستودعات المنشئ

$ install --global

$ download --local

تشغيل في Manus

$ useful --forSOC

مديرو الشبكات وأنظمة الحاسوبمهن الحاسوب والرياضيات15-1244L4

name	aio11y
description	Use when the user wants to list or search AI Observability conversations, inspect generations, manage evaluators (create, test, delete), set up evaluation rules, check scores, or browse evaluator templates. Trigger on phrases like "list conversations", "search generations", "what did the agent do", "debug LLM conversation", "create evaluator", "set up evaluation rule", "test evaluator", "check scores", "evaluate generation quality", or "set up online evaluation".
allowed-tools	["gcx","Bash","Read","Write","Edit"]

AI Observability

AI Observability is Grafana's AI observability platform. It records what LLM-powered applications do in production and scores the quality of their output.

Applications send generations (individual LLM API calls — request, response, model, tokens, tool calls) to AI Observability. Generations belonging to the same user session are grouped into a conversation.

Evaluators are scoring functions (LLM judge, regex, heuristic, JSON schema, etc.) that assess generation quality. Rules bind evaluators to production traffic, they select which generations to evaluate (e.g. only user-visible turns), filter by agent/model, and control sampling rate. When a rule matches a generation, AI Observability runs the bound evaluators and writes scores.

All commands live under gcx aio11y. Use gcx aio11y <subcommand> --help for flags and usage.

Command Groups

Group	Purpose
`conversations`	List, get, search conversations
`generations`	Get a single generation
`agents`	List agents, get details, view versions
`evaluators`	List, get, create, delete, test evaluators
`rules`	List, get, create, update, delete evaluation rules
`templates`	List, get built-in evaluator templates
`scores`	List scores for a generation
`judge`	List judge providers and models
`experiments`	List, get, create, update, cancel runs; inspect scores and reports

Delete commands (evaluators delete, rules delete) require -f to skip confirmation in agent mode. List first to confirm the target ID:

gcx aio11y evaluators list
gcx aio11y evaluators delete <id> -f

gcx aio11y rules list
gcx aio11y rules delete <id> -f

Deleting an evaluator referenced by a rule may leave the rule pointing at a missing evaluator — check gcx aio11y rules list after.

Conversation Search

Defaults to last 24 hours. Filter syntax: key operator "value", space-separated.

gcx aio11y conversations search --filters 'agent = "my-agent" status = "error"'
gcx aio11y conversations search --filters 'agent = "my-agent"' --from 2026-04-01T00:00:00Z --to 2026-04-14T00:00:00Z

Filter keys: model, provider, agent, agent.version, status, error.type, error.category, duration, tool.name, operation, namespace, cluster, service, generation_count, eval.passed, eval.evaluator_id, eval.score_key, eval.score

Operators: =, !=, >, <, >=, <=, =~ (regex)

Evaluator Kind Decision Table

User describes	Kind
"check if response is helpful / toxic / grounded"	`llm_judge`
"combined quality score with explanation"	`llm_judge`
"validate JSON output format"	`json_schema`
"check if response contains / doesn't contain X"	`regex`
"response must be non-empty and at least N chars"	`heuristic`
"check multiple conditions (non-empty AND has greeting)"	`heuristic`

Input Format

gcx aio11y evaluators get -o yaml and gcx aio11y rules get -o yaml emit K8s-style manifests (apiVersion/kind/metadata/spec). The create -f and update -f commands expect top-level fields only. Do not round-trip get output into create/update.

Evaluator definition:

evaluator_id: my-evaluator
kind: llm_judge
description: "..."
config:
  provider: openai
  model: gpt-4o
  system_prompt: "..."
  user_prompt: "..."
  temperature: 0
  max_tokens: 256
output_keys:
  - key: score
    type: number
    min: 1
    max: 10
    pass_threshold: 4

Rule definition:

rule_id: my-rule
enabled: true
selector: user_visible_turn
sample_rate: 1.0
evaluator_ids:
  - my-evaluator
match:
  agent_name:
    - my-agent

Evaluators use create-or-update semantics: re-creating with the same evaluator_id updates it.

Setting Up Online Evaluation

Pick a template: gcx aio11y templates list, then gcx aio11y templates get <id> -o yaml. Template output includes kind, config, and output_keys — copy these into a new evaluator definition and add your own evaluator_id. Do not pass the template output directly to evaluators create.
Write an evaluator YAML using the input format above, create: gcx aio11y evaluators create -f evaluator.yaml
Test against a real generation: gcx aio11y evaluators test -e <evaluator-id> -g <generation-id>
Iterate until the evaluator scores as expected
Write a rule YAML (see references/rule-templates.md), create: gcx aio11y rules create -f rule.yaml
Verify: gcx aio11y rules list

Rule Selectors

Selector	What it evaluates
`user_visible_turn`	Final assistant generation visible to the user
`all_assistant_generations`	Every assistant generation in the conversation
`tool_call_steps`	Tool call generations

Rule Match Keys

All values are arrays. Glob-capable keys support *, ?, [...] patterns.

Key	Glob	Description
`agent_name`	yes	Agent name
`agent_version`	yes	Agent version string
`operation_name`	yes	Operation name
`model.provider`	yes	Model provider (e.g. `openai`, `anthropic`)
`model.name`	yes	Model name (e.g. `gpt-4o`, `claude-sonnet-4-5-20250514`)
`mode`	no	`SYNC` or `STREAM`
`error.type`	no	Error type (also accepts `present`/`absent`)
`error.category`	no	Error category (also accepts `present`/`absent`)
`tags.<key>`	no	Custom tag value (e.g. `tags.env`)

aio11y

المزيد من هذا المستودع

المزيد من هذا المستودع

AI Observability

Command Groups

Conversation Search

Evaluator Kind Decision Table

Input Format

Setting Up Online Evaluation

Rule Selectors

Rule Match Keys

AI Observability

Command Groups

Conversation Search

Evaluator Kind Decision Table

Input Format

Setting Up Online Evaluation

Rule Selectors

Rule Match Keys