一键在 Manus 中运行任何 Skill

$pwd:

qa-runner

Name: Qa Runner
Author: dynatrace-oss

// AI-guided QA walkthrough for DSOA releases. Automates version detection, deployment commands, notebook deployment, and interactive test walkthrough. Use when a QA engineer needs to execute the DSOA release test suite.

在 Manus 中运行

$ git log --oneline --stat

stars:9

forks:2

updated:2026年4月14日 08:06

SKILL.md

readonly

related-skills.json

同仓库

dashboard-docs.md

from "dynatrace-oss/dynatrace-snowflake-observability-agent"

Create and update dashboard and workflow documentation

2026-04-149

dynatrace-dashboard.md

from "dynatrace-oss/dynatrace-snowflake-observability-agent"

Create and update Dynatrace dashboards for DSOA telemetry

2026-04-149

dynatrace-workflow.md

from "dynatrace-oss/dynatrace-snowflake-observability-agent"

Create and update Dynatrace workflows for DSOA automation

2026-04-149

plugin-development.md

from "dynatrace-oss/dynatrace-snowflake-observability-agent"

Create and update DSOA plugins — full development lifecycle from planning through validation

2026-04-149

pr-reviewer.md

from "dynatrace-oss/dynatrace-snowflake-observability-agent"

Review a pull request and process review comments left by others

2026-04-149

snowflake-synthetic.md

from "dynatrace-oss/dynatrace-snowflake-observability-agent"

Create and update Snowflake synthetic test setups for DSOA telemetry validation

2026-04-149

package.json

"author": "dynatrace-oss"

"repository": "dynatrace-oss/dynatrace-snowflake-observability-agent"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

软件质量保证分析师与测试员计算机与数学类职业15-1253L4

一键运行任何 Skill

name	qa-runner
description	AI-guided QA walkthrough for DSOA releases. Automates version detection, deployment commands, notebook deployment, and interactive test walkthrough. Use when a QA engineer needs to execute the DSOA release test suite.
license	MIT
compatibility	opencode
metadata	{"audience":"qa-engineers, developers"}

Skill: DSOA Release QA Runner

Use this skill when asked to:

Start the QA process for a DSOA release
Walk a QA engineer through the release test suite
Deploy and open the QA test notebook
Generate a QA signoff summary

Overview

The QA runner executes in five sequential phases. Complete each phase fully before moving to the next. Do not skip phases.

Phase	Name	Who acts	Output
1	Version discovery	AI (automated)	Verified version tags + config files
2	Deployment guidance	Human (AI provides commands)	Both environments running
3	Notebook deployment	AI (runs script)	Notebook URL
3.5	Auto-evaluation	AI (runs DQL via MCP)	Pass/fail for auto-evaluable tests
4	Test walkthrough	Interactive	Pass/fail per checklist item
5	QA signoff	AI	Markdown report file

Phase 1 — Version Discovery

Run all of the following automatically without waiting for the human.

1a. Determine current version

grep '^VERSION' src/dtagent/version.py | head -1

Store as CURR_VERSION (e.g. 0.9.4).

1b. Derive version tags

The 3-digit tag is: printf "%03d" $((minor * 10 + patch))

bash -c '
v="'"${CURR_VERSION}"'"
minor=$(echo "$v" | cut -d. -f2)
patch=$(echo "$v" | cut -d. -f3)
printf "%03d\n" $(( minor * 10 + patch ))
'

Store as CURR_TAG (e.g. 094). The deployment environment is DEV-${CURR_TAG}.

1c. Determine previous version

Default rule: decrement the patch component of CURR_VERSION by 1. Example: 0.9.4 → 0.9.3 → tag 093.

Override: If the human specifies a different previous version (e.g. because the previous release was a hotfix like 0.9.3.1), use that version instead. Ask:

"The default previous version is {auto_prev}. Is that correct, or should I use a different version (e.g. 0.9.3.1)? Type the version or press Enter to accept the default."

Store as PREV_VERSION and derive PREV_TAG using the same algorithm.

1d. Verify config files

ls conf/config-dev-{CURR_TAG}.yml conf/config-dev-{PREV_TAG}.yml 2>&1

If conf/config-dev-{CURR_TAG}.yml is missing: stop and instruct the human to create it (pointing to the current Snowflake account and Dynatrace tenant).
If conf/config-dev-{PREV_TAG}.yml is missing: warn the human that cross-version comparison tiles will show only the current environment. Ask whether to proceed or to create the file first.

1e. Extract tenant info

yq '.core.dynatrace_tenant_address' conf/config-dev-{CURR_TAG}.yml
yq '.core.deployment_environment'   conf/config-dev-{CURR_TAG}.yml
yq '.core.dynatrace_tenant_address' conf/config-dev-{PREV_TAG}.yml
yq '.core.deployment_environment'   conf/config-dev-{PREV_TAG}.yml

Verify both configs point to the same dynatrace_tenant_address. If they differ, warn the human — both environments must send data to the same tenant for comparison tiles to work.

Phase 1 output

Report the following before proceeding:

Current version:   {CURR_VERSION}  (tag: {CURR_TAG},  env: DEV-{CURR_TAG})
Previous version:  {PREV_VERSION}  (tag: {PREV_TAG},  env: DEV-{PREV_TAG})
Dynatrace tenant:  {TENANT_ADDR}
Config files:      conf/config-dev-{CURR_TAG}.yml  ✓
                   conf/config-dev-{PREV_TAG}.yml  ✓ / ⚠ missing

Ask the human to confirm before proceeding to Phase 2.

Phase 2 — Deployment Guidance

Instruct the human to run the following commands. Both use --scope=all for a fresh, complete deployment of the agent into each environment.

Deploy the current version

./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=all --options=skip_confirm

Deploy the previous version

./scripts/deploy/deploy.sh dev-{PREV_TAG} --scope=all --options=skip_confirm

Important notes to share:

Both deployments must target the same Snowflake account (different schemas/roles differentiated by deployment_environment tag, not by database name).
Both deployments must target the same Dynatrace tenant.
Wait for each deployment to complete and the Snowflake task scheduler to run at least one execution cycle before proceeding.
Timing expectations:
- Most plugins start emitting telemetry within 30 minutes of the first task run.
- Some plugins (e.g. those querying heavy Snowflake views) may take several hours before their first successful execution.
- Budget-related plugins run at most once per day — their data will not appear until the daily schedule fires.
- Recommendation: deploy today, then come back the next day to perform the full test walkthrough once all plugin data is available.

After both deploys, ask:

"Have both deployments completed successfully and is telemetry appearing in Dynatrace? (yes / no / need help)"

If the human says "need help":

Check Snowflake task history for the DTAGENT task
Check agent operational logs: fetch logs | filter dsoa.run.context == "self_monitoring"
Check for ERROR-level log entries from the agent

Phase 3 — Notebook Deployment

Run the deploy script:

./scripts/test/deploy_test_notebook.sh \
    --curr-version={CURR_VERSION} \
    --prev-version={PREV_VERSION}

Before running the script, verify that every type: dql tile in test/qa/test-suite/test-suite.yml has showInput: false set — this hides the DQL code in the rendered notebook ("Hide Input" option in the UI). The expected count of showInput: false lines must equal the count of type: dql lines:

grep -c "type: dql"      test/qa/test-suite/test-suite.yml
grep -c "showInput: false" test/qa/test-suite/test-suite.yml

If any tile is missing it, add showInput: false on the line immediately after type: dql:

sed -i '' 's/^    type: dql$/    type: dql\n    showInput: false/' \
    test/qa/test-suite/test-suite.yml

The script:

Reads conf/config-dev-{CURR_TAG}.yml to get the tenant address
Finds the matching dtctl context
Converts test/qa/test-suite/test-suite.yml → JSON and injects the notebook name
Deploys via dtctl apply and prints the notebook URL
Writes the assigned notebook ID back into the YAML for future runs

If dtctl is not authenticated, instruct the human to run:

dtctl auth login

Then retry the script.

After a successful deploy, share the notebook URL with the human and ask them to confirm it opens in Dynatrace. If the notebook ID needs to be committed to the YAML, remind the human to do so after the QA session.

Phase 3.5 — Auto-Evaluation (AI runs DQL via MCP)

Run all auto-evaluable tests using the execute_dql MCP tool without waiting for the human. Do not cap at 10 — run every test in this section.

Due to the MCP rate limit (5 calls per 20 seconds), send tests in batches of 5 with a brief pause between batches if needed.

Substitutions:

DEV-{CURR_TAG} → current deployment environment (e.g. DEV-094)
DEV-{PREV_TAG} → previous deployment environment (e.g. DEV-093)
Default timeframe: now()-24h unless noted per test.

Each test specifies a DQL and a Pass condition. Record each result as PASS, FAIL, or SKIP (with reason). Reference the matching checklist ID (e.g. C4.9) in the report.

Should-be-empty checks (0 rows = PASS)

AE-C4.9 — No supportability.non_persisted_attribute_keys

fetch spans
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter isNotNull(supportability.non_persisted_attribute_keys)
| summarize count = count()

Pass: count == 0.

AE-C5.4 — Timestamps in BizEvents are current

fetch bizevents
| filter telemetry.exporter.name == "dynatrace.snowagent"
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter dsoa.run.context == "self_monitoring"
| filter dsoa.task.exec.status == "STARTED"
| fields timestamp, dsoa.run.id
| join [
  fetch logs
  | filter db.system == "snowflake"
  | filter deployment.environment == "DEV-{CURR_TAG}"
  | filter dsoa.run.context == "self_monitoring"
  | filter isNotNull(dsoa.run.id)
  | summarize {min_timestamp = min(timestamp)}, by: {dsoa.run.id}
]
, kind:leftOuter
, on: {left[dsoa.run.id] == right[dsoa.run.id]}
, fields: {min_timestamp}
| filter isNotNull(min_timestamp)
| fieldsAdd timeShift = abs(timestamp - min_timestamp)
| filterOut timeShift < 10min
| summarize count = count()

Pass: count == 0.

AE-C4.4 — Completeness span.events

fetch spans, from: now()-7d
| filter dsoa.run.context == "query_history"
| filter deployment.environment == "DEV-{CURR_TAG}"
| fields events_count = arraySize(span.events),
         events_added = snowagent.debug.span.events.added,
         events_failed = snowagent.debug.span.events.failed,
         supportability.dropped_events_count
| filter events_count + coalesce(supportability.dropped_events_count, 0) != events_added
| summarize count = count()

Pass: count == 0.

AE-C4.7 — No missing span.parent_id for child queries in same DSOA run

fetch spans, from: now()-24h
| filter db.system == "snowflake"
| filter isNotNull(dsoa.run.context)
| filter isNotNull(snowflake.query.parent_id)
| filter deployment.environment == "DEV-{CURR_TAG}"
| joinNested parent_spans = [
  fetch spans
  | filter db.system == "snowflake"
  | filter isNotNull(dsoa.run.context)
  | filter isNotNull(snowflake.query.parent_id)
  | filter deployment.environment == "DEV-{CURR_TAG}"
  | fields span.id, snowflake.query.id, dsoa.run.id
], on: {left[snowflake.query.parent_id] == right[snowflake.query.id]}
, executionOrder:leftFirst
| filterOut isNull(parent_spans)
| expand parent_spans
| fieldsFlatten parent_spans, prefix: "parent."
| filter dsoa.run.id == parent.dsoa.run.id
| filter isNull(span.parent_id)
| summarize count = count()

Pass: count == 0.

AE-C1.2 — No ERROR-level agent logs

fetch logs
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter loglevel == "ERROR"
| filter db.system == "snowflake"
| summarize count = count(), by: {content}
| sort count desc

Pass: 0 rows. If rows appear, list the content values as FAIL notes.

Data-presence checks (data returned = PASS)

AE-C2.3 — Query history metrics are reported

timeseries avg(snowflake.time.execution), by: {deployment.environment}
| filter deployment.environment == "DEV-{CURR_TAG}"
| summarize count = count()

Pass: count > 0.

AE-C5.5 — Process metrics are reported

timeseries avg(process.cpu.utilization), by: {deployment.environment}
| filter deployment.environment == "DEV-{CURR_TAG}"
| summarize count = count()

Pass: count > 0.

AE-C5.7 — Self-monitoring BizEvents delivered for all plugins

fetch bizevents
| filter db.system == "snowflake"
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter dsoa.run.context == "self_monitoring"
| filter in(dsoa.task.exec.status, {"STARTED", "FINISHED"})
| summarize count = count(), by: {dsoa.task.name, dsoa.task.exec.status}
| filter count == 0

Pass: 0 rows (every plugin has count > 0 for both STARTED and FINISHED).

AE-C4.6 — span.parent_id present for child queries

fetch spans
| filter db.system == "snowflake"
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter isNotNull(dsoa.run.context)
| filter isNotNull(snowflake.query.parent_id)
| filter isNotNull(span.parent_id)
| summarize count = count()

Pass: count > 0.

AE-C2.1 — Budget metrics are reported

timeseries avg(snowflake.credits.limit), by: {deployment.environment}
| filter deployment.environment == "DEV-{CURR_TAG}"
| summarize count = count()

Pass: count > 0. Timing rule: The budget plugin runs at most once per day. Do not skip this check — instead, wait until at least 24 hours after deployment before evaluating. If the environment is < 24h old, defer the check and come back later.

Cross-version comparison checks

AE-C1.3 — No increase in dt.ingest.warnings (5% tolerance)

Run two queries — one per environment — then compare:

fetch logs
| filter telemetry.exporter.name == "dynatrace.snowagent"
| filter in(deployment.environment, {"DEV-{PREV_TAG}", "DEV-{CURR_TAG}"})
| filter isNotNull(dt.ingest.warnings)
| expand warning = dt.ingest.warnings
| summarize count = count(), by: {deployment.environment, warning}
| sort deployment.environment, warning

Pass condition: For each warning type, the count in DEV-{CURR_TAG} must not exceed the count in DEV-{PREV_TAG} by more than 5%, OR the absolute count in DEV-{CURR_TAG} must be lower than in DEV-{PREV_TAG}.

Formally: curr_count <= prev_count * 1.05 for each warning type. If DEV-{PREV_TAG} has 0 of a given warning and DEV-{CURR_TAG} has any, that is a FAIL. Record which warning types failed and their counts.

Auto-evaluation output

After running all tests, present the results in a table using the checklist IDs:

## Auto-Evaluation Results — DEV-{CURR_TAG}

| Test      | Description                                            | Result     | Notes |
|-----------|--------------------------------------------------------|------------|-------|
| AE-C4.9   | No non_persisted_attribute_keys                        | PASS/FAIL  |       |
| AE-C5.4   | BizEvent timestamps are current                        | PASS/FAIL  |       |
| AE-C4.4   | Completeness span.events                               | PASS/FAIL  |       |
| AE-C4.7   | No missing span.parent_id for child queries            | PASS/FAIL  |       |
| AE-C1.2   | No ERROR-level agent logs                              | PASS/FAIL  |       |
| AE-C2.3   | Query history metrics reported                         | PASS/FAIL  |       |
| AE-C5.5   | Process metrics reported                               | PASS/FAIL  |       |
| AE-C5.7   | Self-monitoring BizEvents all delivered                | PASS/FAIL  |       |
| AE-C4.6   | span.parent_id present for child queries               | PASS/FAIL  |       |
| AE-C2.1   | Budget metrics reported                                | PASS/FAIL  |       |
| AE-C1.3   | No increase in dt.ingest.warnings (5% tolerance)       | PASS/FAIL  |       |

Auto-evaluated: {N}/11 — {n} passed, {f} failed

Include the full table in the Phase 5 markdown report.

Phase 4 — Test Walkthrough

Walk through test/qa/RELEASE-CHECKLIST.md section by section. For each item:

State the item description clearly
For Section A (offline): ask [PASS], [FAIL], or [SKIP reason]
For Section B (deployment): provide the exact command, ask the human to run it, then confirm the result
For Section C (live telemetry): name the exact notebook tile to check, note whether it is a [COMPARE] tile (both DEV-{PREV} and DEV-{CURR} expected), and ask for the result

Keep a running tally in memory. Only proceed to the next item after recording the current item's result.

Tile navigation hints

Tell the human to open the notebook at the URL from Phase 3. The tiles are grouped by test theme matching the checklist sections. Within each group tiles appear in checklist order.

For [COMPARE] tiles, both series must be visible. If only one series appears, it likely means the other environment has not completed a run yet — ask the human to wait and refresh.

Handling failures

When a test fails:

Ask the human to describe what they see
Suggest the most likely investigation steps (e.g. check logs for that plugin, verify the Snowflake view exists, check task is not suspended)
Record the failure with a brief note
Continue to the next item — do not block the session on a single failure

Phase 5 — QA Signoff

Generate the result summary table:

Section              | Pass | Fail | Skip | Total
---------------------|------|------|------|------
A — Offline          |      |      |      |   5
B — Deployment       |      |      |      |  10
C1 — Data Volume     |      |      |      |   8
C2 — Metrics         |      |      |      |   8
C3 — Logs            |      |      |      |   6
C4 — Spans           |      |      |      |   9
C5 — Events          |      |      |      |   7
C6 — Active Queries  |      |      |      |   4
C7 — Shares          |      |      |      |   4
C8 — Plugin Lifecycle|      |      |      |   2
Total                |      |      |      |  63

List all failed and skipped items with the human's notes.

Generate the signoff line:

DSOA {CURR_VERSION} QA — {DATE} — {PASS}/{TOTAL} items passed
Tester: {human name or "QA"}
Notebook: {NOTEBOOK_URL}

Write the markdown report

Always write the full results to a file (do not just offer — write it):

mkdir -p test/qa/results
# file: test/qa/results/qa-{CURR_VERSION}-{YYYYMMDD}.md

The report file must have the following structure:

# DSOA {CURR_VERSION} QA Report — {DATE}

**Tester:** {NAME}
**Notebook:** [{NOTEBOOK_URL}]({NOTEBOOK_URL})
**Environment:** DEV-{CURR_TAG} vs DEV-{PREV_TAG}
**Tenant:** {TENANT_ADDR}

## Signoff

> DSOA {CURR_VERSION} QA — {DATE} — {PASS}/{TOTAL} items passed

## Auto-Evaluation (AI)

| Test | Description                                          | Result |
|------|------------------------------------------------------|--------|
| AE-1 | No non_persisted_attribute_keys                      | ...    |
...

Auto-evaluated: {N}/10 — {n} passed, {f} failed, {s} skipped

## Section Results

| Section              | Pass | Fail | Skip | Total |
|----------------------|------|------|------|-------|
| A — Offline          |      |      |      |   5   |
| B — Deployment       |      |      |      |  10   |
| C1 — Data Volume     |      |      |      |   8   |
| C2 — Metrics         |      |      |      |   8   |
| C3 — Logs            |      |      |      |   6   |
| C4 — Spans           |      |      |      |   9   |
| C5 — Events          |      |      |      |   7   |
| C6 — Active Queries  |      |      |      |   4   |
| C7 — Shares          |      |      |      |   4   |
| C8 — Plugin Lifecycle|      |      |      |   2   |
| **Total**            |      |      |      | **63**|

## Failures and Skips

### Failed items

- **{ID}** — {title}: {human's note}

### Skipped items

- **{ID}** — {title}: {reason}

## Notes

{any additional observations from the QA session}

Helper Reference

Version-to-tag algorithm (bash)

version_to_tag() {
    local version="$1"
    local minor patch
    minor=$(echo "$version" | cut -d. -f2)
    patch=$(echo "$version" | cut -d. -f3)
    printf "%03d" $(( minor * 10 + patch ))
}

Examples: 0.9.4 → 094 | 0.9.3.1 → 093 | 0.9.10 → 100

DQL semantics — `dsoa.run.plugin` vs `dsoa.run.context`

These two attributes have distinct meanings and must never be used interchangeably in DQL queries:

Attribute	Meaning	Example values
`dsoa.run.plugin`	The plugin that emitted the telemetry	`"shares"`, `"query_history"`
`dsoa.run.context`	The specific context (sub-task) within a plugin run	`"inbound_shares"`, `"outbound_shares"`, `"shares"`

Rule: Use dsoa.run.plugin when filtering for all telemetry produced by a plugin regardless of which context within that plugin emitted it. Use dsoa.run.context only when you need to target a specific named context.

Some plugins have a single context whose name matches the plugin name — in that case both filters return the same data. However, you must still use dsoa.run.plugin when the intent is to select by plugin, to keep semantics correct and future-proof against the plugin gaining additional contexts.

Example — correct (shares events from any context):

fetch events
| filter dsoa.run.plugin == "shares"

Example — correct (shares logs from specific inbound/outbound contexts):

fetch logs
| filter in(dsoa.run.context, {"inbound_shares", "outbound_shares"})

Example — wrong (uses context instead of plugin for a plugin-level query):

fetch events
| filter dsoa.run.context == "shares"   // WRONG — should be dsoa.run.plugin

B2 — Manual agent invocation

For B2 (manual execution test), call DTAGENT once per plugin using separate CALL APP.DTAGENT(ARRAY_CONSTRUCT('<plugin>')) statements — one for each plugin. Never call with all plugins in a single ARRAY_CONSTRUCT — the snow sql CLI has a hard 2-minute timeout and a full 16-plugin run will always exceed it.

The commented call template in src/dtagent.sql/agents/700_dtagent.sql already contains the correct separate-call form. Run each line individually:

use role DTAGENT_VIEWER; use database DTAGENT_DB; use warehouse DTAGENT_WH;
call APP.DTAGENT(ARRAY_CONSTRUCT('active_queries'));
call APP.DTAGENT(ARRAY_CONSTRUCT('budgets'));
-- ... one per plugin ...
call APP.DTAGENT(ARRAY_CONSTRUCT('warehouse_usage'));

Pre-requisite — snow sql timeout configuration (MANDATORY): Before running B2, ensure the connection profile in ~/.snowflake/config.toml has both timeout values set to at least 300 seconds (5 minutes) to handle cold starts on data-heavy plugins:

[connections.snow_agent_dev-{CURR_TAG}]
# ... existing settings ...
login_timeout = 300
network_timeout = 300

login_timeout covers the initial connection handshake; network_timeout covers query execution. Both are needed. Without these, the CLI silently cuts off at ~120 seconds and returns no output or error.

Run each plugin call via:

snow sql -c snow_agent_dev-{CURR_TAG} \
    --role DTAGENT_{TAG}_OWNER \
    --warehouse DTAGENT_{TAG}_WH \
    --database DTAGENT_{TAG}_DB \
    --schema APP \
    -q "call APP.DTAGENT(ARRAY_CONSTRUCT('<plugin>'));"

After all calls, verify telemetry arrives by checking for recent FINISHED biz events per plugin:

fetch bizevents, from: now()-30m
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter dsoa.task.exec.status == "FINISHED"
| summarize count = count(), by: {dsoa.task.name}
| sort dsoa.task.name asc

All 16 plugins should appear with count >= 1. Any missing plugin is a FAIL.

Note on dsoa.task.name format: Some plugins (e.g. snowpipes) are invoked with individual contexts using the $plugin:$context format, e.g. "snowpipes:snowpipes" or "snowpipes:snowpipes_copy_history,snowpipes_usage_history". This is correct and expected — do not flag these as malformed.

Notebook tile format notes

Markdown tiles — use the markdown: key (NOT text:). The text: key is silently ignored by Dynatrace; the rendered content comes from markdown:.

- id: my-markdown-tile
  type: markdown
  markdown: |
    ## Section heading

    Some **bold** description.

DQL tiles — always include showInput: false to hide the code by default.

Path	Purpose
`src/dtagent/version.py`	Source of truth for current version
`conf/config-dev-{TAG}.yml`	Per-environment configuration
`test/qa/RELEASE-CHECKLIST.md`	Full checklist with all items
`test/qa/test-suite/test-suite.yml`	Notebook YAML template
`scripts/test/deploy_test_notebook.sh`	Notebook deploy script
`test/qa/results/`	QA result files (create as needed)

Deploy commands quick reference

# Deploy both environments (fresh) — human only on dev-* profiles (requires DTAGENT_TOKEN)
./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=all --options=skip_confirm
./scripts/deploy/deploy.sh dev-{PREV_TAG} --scope=all --options=skip_confirm

# test-qa: AI can and must use --scope=all
./scripts/deploy/deploy.sh test-qa --scope=all --options=skip_confirm

# AI-safe re-deploy on dev-* profiles (no token needed) — plugins + agents + config only
./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=plugins,agents,config --options=skip_confirm

# Config-only update (no SQL changes)
./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=config --options=skip_confirm

# Deploy the test notebook
./scripts/test/deploy_test_notebook.sh \
    --curr-version={CURR_VERSION} \
    --prev-version={PREV_VERSION}

# Preview notebook deploy without applying
./scripts/test/deploy_test_notebook.sh --dry-run

CRITICAL — scope rules for AI-assisted deploys:

On test-qa: the AI has full DTAGENT access and must use --scope=all for fresh deployments — this is required and correct.
On dev-* profiles: --scope=all requires DTAGENT_TOKEN env-var (sends deployment biz events). The AI does not have this token on dev profiles. Use --scope=plugins,agents,config instead for AI-run deploys on dev environments.
Always build.sh first if build artifacts are missing — deploy will error with Build file missing: build/....

Deploy log monitoring

Never let deploy output stream directly to the tool — the log is very large and will cause tool aborts. Always background the process and tail the log:

./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=all --options=skip_confirm \
    > /tmp/deploy-{CURR_TAG}.log 2>&1 &
# then poll:
sleep 30 && ps -p $PID && tail -10 /tmp/deploy-{CURR_TAG}.log

Key strings to grep for in deploy logs:

String	Meaning
`Filtering out disabled plugins: ...`	Which plugins were excluded
`UPDATE_FROM_CONFIGURATIONS \| OK`	Config applied successfully
`successfully created` / `successfully resumed`	Tasks are live
`^ERROR:`	Fatal deploy error

Note: snow sql garbles wide-table output (SHOW TASKS, TASK_HISTORY). Never use these to verify task state — use deploy log evidence instead.

B-section deployment scenarios (B8–B10)

These tests use conf/config-dev-{CURR_TAG}.yml changes + redeploy on the current environment. The AI updates the config file; the human runs the deploy (or the AI runs --scope=plugins,agents,config if no token is needed).

Always restore the config to a clean state after each B8–B10 scenario before the next one. A fresh --scope=all (by the human) is the safest restore path.

B8 — Selected plugins only

Config pattern:

plugins:
  disabled_by_default: true
  deploy_disabled_plugins: false
  query_history:
    is_enabled: true
  data_volume:
    is_enabled: true
  shares:
    is_enabled: true

Deploy: --scope=all (human). Verify:

Deploy log shows Filtering out disabled plugins: <13 plugins>
Trigger enabled tasks manually: EXECUTE TASK DTAGENT_{TAG}_DB.APP.TASK_DTAGENT_QUERY_HISTORY; etc.
DQL: zero telemetry from any non-enabled plugin over last 15 min.

B9 — Config-only update

Make a non-structural config change (e.g. log_level: DEBUG → INFO). Deploy: --scope=config only (AI can run this). Verify:

No CREATE PROCEDURE / CREATE VIEW / CREATE TASK in deploy log.
UPDATE_FROM_CONFIGURATIONS → OK.
New value confirmed in Snowflake: SELECT PATH, VALUE FROM CONFIG.CONFIGURATIONS WHERE PATH = 'core.log_level';
Trigger tasks and confirm FINISHED biz events still appear.

B10 — Disabled plugin not callable

Remove a previously-enabled plugin from config (e.g. remove shares: is_enabled: true). Deploy: --scope=all (human) or --scope=plugins,agents,config (AI).

Correct verification method: Call the agent directly for the disabled plugin:

snow sql -c snow_agent_dev-{CURR_TAG} \
    --role DTAGENT_{TAG}_OWNER \
    --warehouse DTAGENT_{TAG}_WH \
    --database DTAGENT_{TAG}_DB \
    --schema APP \
    -q "CALL APP.DTAGENT(ARRAY_CONSTRUCT('shares'));"

Expected result: "not_implemented" — the plugin code is excluded from the DTAGENT procedure at compile time. Do not check for absent SQL objects — the deploy does not DROP pre-existing views or tasks; enforcement is at the Python code level inside the stored procedure.

Confirm no telemetry from the plugin in the 5 minutes after deploy:

fetch logs, from: now()-5m
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter dsoa.run.plugin == "shares"
| filter dsoa.run.context != "self_monitoring"
| summarize count = count()

Pass: count == 0.

name	qa-runner
description	AI-guided QA walkthrough for DSOA releases. Automates version detection, deployment commands, notebook deployment, and interactive test walkthrough. Use when a QA engineer needs to execute the DSOA release test suite.
license	MIT
compatibility	opencode
metadata	{"audience":"qa-engineers, developers"}

Skill: DSOA Release QA Runner

Use this skill when asked to:

Start the QA process for a DSOA release
Walk a QA engineer through the release test suite
Deploy and open the QA test notebook
Generate a QA signoff summary

Overview

The QA runner executes in five sequential phases. Complete each phase fully before moving to the next. Do not skip phases.

Phase	Name	Who acts	Output
1	Version discovery	AI (automated)	Verified version tags + config files
2	Deployment guidance	Human (AI provides commands)	Both environments running
3	Notebook deployment	AI (runs script)	Notebook URL
3.5	Auto-evaluation	AI (runs DQL via MCP)	Pass/fail for auto-evaluable tests
4	Test walkthrough	Interactive	Pass/fail per checklist item
5	QA signoff	AI	Markdown report file

Phase 1 — Version Discovery

Run all of the following automatically without waiting for the human.

1a. Determine current version

grep '^VERSION' src/dtagent/version.py | head -1

Store as CURR_VERSION (e.g. 0.9.4).

1b. Derive version tags

The 3-digit tag is: printf "%03d" $((minor * 10 + patch))

bash -c '
v="'"${CURR_VERSION}"'"
minor=$(echo "$v" | cut -d. -f2)
patch=$(echo "$v" | cut -d. -f3)
printf "%03d\n" $(( minor * 10 + patch ))
'

Store as CURR_TAG (e.g. 094). The deployment environment is DEV-${CURR_TAG}.

1c. Determine previous version

Default rule: decrement the patch component of CURR_VERSION by 1. Example: 0.9.4 → 0.9.3 → tag 093.

Override: If the human specifies a different previous version (e.g. because the previous release was a hotfix like 0.9.3.1), use that version instead. Ask:

"The default previous version is {auto_prev}. Is that correct, or should I use a different version (e.g. 0.9.3.1)? Type the version or press Enter to accept the default."

Store as PREV_VERSION and derive PREV_TAG using the same algorithm.

1d. Verify config files

ls conf/config-dev-{CURR_TAG}.yml conf/config-dev-{PREV_TAG}.yml 2>&1

If conf/config-dev-{CURR_TAG}.yml is missing: stop and instruct the human to create it (pointing to the current Snowflake account and Dynatrace tenant).
If conf/config-dev-{PREV_TAG}.yml is missing: warn the human that cross-version comparison tiles will show only the current environment. Ask whether to proceed or to create the file first.

1e. Extract tenant info

yq '.core.dynatrace_tenant_address' conf/config-dev-{CURR_TAG}.yml
yq '.core.deployment_environment'   conf/config-dev-{CURR_TAG}.yml
yq '.core.dynatrace_tenant_address' conf/config-dev-{PREV_TAG}.yml
yq '.core.deployment_environment'   conf/config-dev-{PREV_TAG}.yml

Verify both configs point to the same dynatrace_tenant_address. If they differ, warn the human — both environments must send data to the same tenant for comparison tiles to work.

Phase 1 output

Report the following before proceeding:

Current version:   {CURR_VERSION}  (tag: {CURR_TAG},  env: DEV-{CURR_TAG})
Previous version:  {PREV_VERSION}  (tag: {PREV_TAG},  env: DEV-{PREV_TAG})
Dynatrace tenant:  {TENANT_ADDR}
Config files:      conf/config-dev-{CURR_TAG}.yml  ✓
                   conf/config-dev-{PREV_TAG}.yml  ✓ / ⚠ missing

Ask the human to confirm before proceeding to Phase 2.

Phase 2 — Deployment Guidance

Instruct the human to run the following commands. Both use --scope=all for a fresh, complete deployment of the agent into each environment.

Deploy the current version

./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=all --options=skip_confirm

Deploy the previous version

./scripts/deploy/deploy.sh dev-{PREV_TAG} --scope=all --options=skip_confirm

Important notes to share:

Both deployments must target the same Snowflake account (different schemas/roles differentiated by deployment_environment tag, not by database name).
Both deployments must target the same Dynatrace tenant.
Wait for each deployment to complete and the Snowflake task scheduler to run at least one execution cycle before proceeding.
Timing expectations:
- Most plugins start emitting telemetry within 30 minutes of the first task run.
- Some plugins (e.g. those querying heavy Snowflake views) may take several hours before their first successful execution.
- Budget-related plugins run at most once per day — their data will not appear until the daily schedule fires.
- Recommendation: deploy today, then come back the next day to perform the full test walkthrough once all plugin data is available.

After both deploys, ask:

"Have both deployments completed successfully and is telemetry appearing in Dynatrace? (yes / no / need help)"

If the human says "need help":

Check Snowflake task history for the DTAGENT task
Check agent operational logs: fetch logs | filter dsoa.run.context == "self_monitoring"
Check for ERROR-level log entries from the agent

Phase 3 — Notebook Deployment

Run the deploy script:

./scripts/test/deploy_test_notebook.sh \
    --curr-version={CURR_VERSION} \
    --prev-version={PREV_VERSION}

grep -c "type: dql"      test/qa/test-suite/test-suite.yml
grep -c "showInput: false" test/qa/test-suite/test-suite.yml

If any tile is missing it, add showInput: false on the line immediately after type: dql:

sed -i '' 's/^    type: dql$/    type: dql\n    showInput: false/' \
    test/qa/test-suite/test-suite.yml

The script:

Reads conf/config-dev-{CURR_TAG}.yml to get the tenant address
Finds the matching dtctl context
Converts test/qa/test-suite/test-suite.yml → JSON and injects the notebook name
Deploys via dtctl apply and prints the notebook URL
Writes the assigned notebook ID back into the YAML for future runs

If dtctl is not authenticated, instruct the human to run:

dtctl auth login

Then retry the script.

Phase 3.5 — Auto-Evaluation (AI runs DQL via MCP)

Run all auto-evaluable tests using the execute_dql MCP tool without waiting for the human. Do not cap at 10 — run every test in this section.

Due to the MCP rate limit (5 calls per 20 seconds), send tests in batches of 5 with a brief pause between batches if needed.

Substitutions:

DEV-{CURR_TAG} → current deployment environment (e.g. DEV-094)
DEV-{PREV_TAG} → previous deployment environment (e.g. DEV-093)
Default timeframe: now()-24h unless noted per test.

Each test specifies a DQL and a Pass condition. Record each result as PASS, FAIL, or SKIP (with reason). Reference the matching checklist ID (e.g. C4.9) in the report.

Should-be-empty checks (0 rows = PASS)

AE-C4.9 — No supportability.non_persisted_attribute_keys

fetch spans
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter isNotNull(supportability.non_persisted_attribute_keys)
| summarize count = count()

Pass: count == 0.

AE-C5.4 — Timestamps in BizEvents are current

fetch bizevents
| filter telemetry.exporter.name == "dynatrace.snowagent"
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter dsoa.run.context == "self_monitoring"
| filter dsoa.task.exec.status == "STARTED"
| fields timestamp, dsoa.run.id
| join [
  fetch logs
  | filter db.system == "snowflake"
  | filter deployment.environment == "DEV-{CURR_TAG}"
  | filter dsoa.run.context == "self_monitoring"
  | filter isNotNull(dsoa.run.id)
  | summarize {min_timestamp = min(timestamp)}, by: {dsoa.run.id}
]
, kind:leftOuter
, on: {left[dsoa.run.id] == right[dsoa.run.id]}
, fields: {min_timestamp}
| filter isNotNull(min_timestamp)
| fieldsAdd timeShift = abs(timestamp - min_timestamp)
| filterOut timeShift < 10min
| summarize count = count()

Pass: count == 0.

AE-C4.4 — Completeness span.events

fetch spans, from: now()-7d
| filter dsoa.run.context == "query_history"
| filter deployment.environment == "DEV-{CURR_TAG}"
| fields events_count = arraySize(span.events),
         events_added = snowagent.debug.span.events.added,
         events_failed = snowagent.debug.span.events.failed,
         supportability.dropped_events_count
| filter events_count + coalesce(supportability.dropped_events_count, 0) != events_added
| summarize count = count()

Pass: count == 0.

AE-C4.7 — No missing span.parent_id for child queries in same DSOA run

fetch spans, from: now()-24h
| filter db.system == "snowflake"
| filter isNotNull(dsoa.run.context)
| filter isNotNull(snowflake.query.parent_id)
| filter deployment.environment == "DEV-{CURR_TAG}"
| joinNested parent_spans = [
  fetch spans
  | filter db.system == "snowflake"
  | filter isNotNull(dsoa.run.context)
  | filter isNotNull(snowflake.query.parent_id)
  | filter deployment.environment == "DEV-{CURR_TAG}"
  | fields span.id, snowflake.query.id, dsoa.run.id
], on: {left[snowflake.query.parent_id] == right[snowflake.query.id]}
, executionOrder:leftFirst
| filterOut isNull(parent_spans)
| expand parent_spans
| fieldsFlatten parent_spans, prefix: "parent."
| filter dsoa.run.id == parent.dsoa.run.id
| filter isNull(span.parent_id)
| summarize count = count()

Pass: count == 0.

AE-C1.2 — No ERROR-level agent logs

fetch logs
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter loglevel == "ERROR"
| filter db.system == "snowflake"
| summarize count = count(), by: {content}
| sort count desc

Pass: 0 rows. If rows appear, list the content values as FAIL notes.

Data-presence checks (data returned = PASS)

AE-C2.3 — Query history metrics are reported

timeseries avg(snowflake.time.execution), by: {deployment.environment}
| filter deployment.environment == "DEV-{CURR_TAG}"
| summarize count = count()

Pass: count > 0.

AE-C5.5 — Process metrics are reported

timeseries avg(process.cpu.utilization), by: {deployment.environment}
| filter deployment.environment == "DEV-{CURR_TAG}"
| summarize count = count()

Pass: count > 0.

AE-C5.7 — Self-monitoring BizEvents delivered for all plugins

fetch bizevents
| filter db.system == "snowflake"
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter dsoa.run.context == "self_monitoring"
| filter in(dsoa.task.exec.status, {"STARTED", "FINISHED"})
| summarize count = count(), by: {dsoa.task.name, dsoa.task.exec.status}
| filter count == 0

Pass: 0 rows (every plugin has count > 0 for both STARTED and FINISHED).

AE-C4.6 — span.parent_id present for child queries

fetch spans
| filter db.system == "snowflake"
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter isNotNull(dsoa.run.context)
| filter isNotNull(snowflake.query.parent_id)
| filter isNotNull(span.parent_id)
| summarize count = count()

Pass: count > 0.

AE-C2.1 — Budget metrics are reported

timeseries avg(snowflake.credits.limit), by: {deployment.environment}
| filter deployment.environment == "DEV-{CURR_TAG}"
| summarize count = count()

Cross-version comparison checks

AE-C1.3 — No increase in dt.ingest.warnings (5% tolerance)

Run two queries — one per environment — then compare:

fetch logs
| filter telemetry.exporter.name == "dynatrace.snowagent"
| filter in(deployment.environment, {"DEV-{PREV_TAG}", "DEV-{CURR_TAG}"})
| filter isNotNull(dt.ingest.warnings)
| expand warning = dt.ingest.warnings
| summarize count = count(), by: {deployment.environment, warning}
| sort deployment.environment, warning

Auto-evaluation output

After running all tests, present the results in a table using the checklist IDs:

## Auto-Evaluation Results — DEV-{CURR_TAG}

| Test      | Description                                            | Result     | Notes |
|-----------|--------------------------------------------------------|------------|-------|
| AE-C4.9   | No non_persisted_attribute_keys                        | PASS/FAIL  |       |
| AE-C5.4   | BizEvent timestamps are current                        | PASS/FAIL  |       |
| AE-C4.4   | Completeness span.events                               | PASS/FAIL  |       |
| AE-C4.7   | No missing span.parent_id for child queries            | PASS/FAIL  |       |
| AE-C1.2   | No ERROR-level agent logs                              | PASS/FAIL  |       |
| AE-C2.3   | Query history metrics reported                         | PASS/FAIL  |       |
| AE-C5.5   | Process metrics reported                               | PASS/FAIL  |       |
| AE-C5.7   | Self-monitoring BizEvents all delivered                | PASS/FAIL  |       |
| AE-C4.6   | span.parent_id present for child queries               | PASS/FAIL  |       |
| AE-C2.1   | Budget metrics reported                                | PASS/FAIL  |       |
| AE-C1.3   | No increase in dt.ingest.warnings (5% tolerance)       | PASS/FAIL  |       |

Auto-evaluated: {N}/11 — {n} passed, {f} failed

Include the full table in the Phase 5 markdown report.

Phase 4 — Test Walkthrough

Walk through test/qa/RELEASE-CHECKLIST.md section by section. For each item:

State the item description clearly
For Section A (offline): ask [PASS], [FAIL], or [SKIP reason]
For Section B (deployment): provide the exact command, ask the human to run it, then confirm the result
For Section C (live telemetry): name the exact notebook tile to check, note whether it is a [COMPARE] tile (both DEV-{PREV} and DEV-{CURR} expected), and ask for the result

Keep a running tally in memory. Only proceed to the next item after recording the current item's result.

Tile navigation hints

Tell the human to open the notebook at the URL from Phase 3. The tiles are grouped by test theme matching the checklist sections. Within each group tiles appear in checklist order.

For [COMPARE] tiles, both series must be visible. If only one series appears, it likely means the other environment has not completed a run yet — ask the human to wait and refresh.

Handling failures

When a test fails:

Ask the human to describe what they see
Suggest the most likely investigation steps (e.g. check logs for that plugin, verify the Snowflake view exists, check task is not suspended)
Record the failure with a brief note
Continue to the next item — do not block the session on a single failure

Phase 5 — QA Signoff

Generate the result summary table:

Section              | Pass | Fail | Skip | Total
---------------------|------|------|------|------
A — Offline          |      |      |      |   5
B — Deployment       |      |      |      |  10
C1 — Data Volume     |      |      |      |   8
C2 — Metrics         |      |      |      |   8
C3 — Logs            |      |      |      |   6
C4 — Spans           |      |      |      |   9
C5 — Events          |      |      |      |   7
C6 — Active Queries  |      |      |      |   4
C7 — Shares          |      |      |      |   4
C8 — Plugin Lifecycle|      |      |      |   2
Total                |      |      |      |  63

List all failed and skipped items with the human's notes.

Generate the signoff line:

DSOA {CURR_VERSION} QA — {DATE} — {PASS}/{TOTAL} items passed
Tester: {human name or "QA"}
Notebook: {NOTEBOOK_URL}

Write the markdown report

Always write the full results to a file (do not just offer — write it):

mkdir -p test/qa/results
# file: test/qa/results/qa-{CURR_VERSION}-{YYYYMMDD}.md

The report file must have the following structure:

# DSOA {CURR_VERSION} QA Report — {DATE}

**Tester:** {NAME}
**Notebook:** [{NOTEBOOK_URL}]({NOTEBOOK_URL})
**Environment:** DEV-{CURR_TAG} vs DEV-{PREV_TAG}
**Tenant:** {TENANT_ADDR}

## Signoff

> DSOA {CURR_VERSION} QA — {DATE} — {PASS}/{TOTAL} items passed

## Auto-Evaluation (AI)

| Test | Description                                          | Result |
|------|------------------------------------------------------|--------|
| AE-1 | No non_persisted_attribute_keys                      | ...    |
...

Auto-evaluated: {N}/10 — {n} passed, {f} failed, {s} skipped

## Section Results

| Section              | Pass | Fail | Skip | Total |
|----------------------|------|------|------|-------|
| A — Offline          |      |      |      |   5   |
| B — Deployment       |      |      |      |  10   |
| C1 — Data Volume     |      |      |      |   8   |
| C2 — Metrics         |      |      |      |   8   |
| C3 — Logs            |      |      |      |   6   |
| C4 — Spans           |      |      |      |   9   |
| C5 — Events          |      |      |      |   7   |
| C6 — Active Queries  |      |      |      |   4   |
| C7 — Shares          |      |      |      |   4   |
| C8 — Plugin Lifecycle|      |      |      |   2   |
| **Total**            |      |      |      | **63**|

## Failures and Skips

### Failed items

- **{ID}** — {title}: {human's note}

### Skipped items

- **{ID}** — {title}: {reason}

## Notes

{any additional observations from the QA session}

Helper Reference

Version-to-tag algorithm (bash)

version_to_tag() {
    local version="$1"
    local minor patch
    minor=$(echo "$version" | cut -d. -f2)
    patch=$(echo "$version" | cut -d. -f3)
    printf "%03d" $(( minor * 10 + patch ))
}

Examples: 0.9.4 → 094 | 0.9.3.1 → 093 | 0.9.10 → 100

DQL semantics — `dsoa.run.plugin` vs `dsoa.run.context`

These two attributes have distinct meanings and must never be used interchangeably in DQL queries:

Attribute	Meaning	Example values
`dsoa.run.plugin`	The plugin that emitted the telemetry	`"shares"`, `"query_history"`
`dsoa.run.context`	The specific context (sub-task) within a plugin run	`"inbound_shares"`, `"outbound_shares"`, `"shares"`

Example — correct (shares events from any context):

fetch events
| filter dsoa.run.plugin == "shares"

Example — correct (shares logs from specific inbound/outbound contexts):

fetch logs
| filter in(dsoa.run.context, {"inbound_shares", "outbound_shares"})

Example — wrong (uses context instead of plugin for a plugin-level query):

fetch events
| filter dsoa.run.context == "shares"   // WRONG — should be dsoa.run.plugin

B2 — Manual agent invocation

The commented call template in src/dtagent.sql/agents/700_dtagent.sql already contains the correct separate-call form. Run each line individually:

use role DTAGENT_VIEWER; use database DTAGENT_DB; use warehouse DTAGENT_WH;
call APP.DTAGENT(ARRAY_CONSTRUCT('active_queries'));
call APP.DTAGENT(ARRAY_CONSTRUCT('budgets'));
-- ... one per plugin ...
call APP.DTAGENT(ARRAY_CONSTRUCT('warehouse_usage'));

[connections.snow_agent_dev-{CURR_TAG}]
# ... existing settings ...
login_timeout = 300
network_timeout = 300

Run each plugin call via:

snow sql -c snow_agent_dev-{CURR_TAG} \
    --role DTAGENT_{TAG}_OWNER \
    --warehouse DTAGENT_{TAG}_WH \
    --database DTAGENT_{TAG}_DB \
    --schema APP \
    -q "call APP.DTAGENT(ARRAY_CONSTRUCT('<plugin>'));"

After all calls, verify telemetry arrives by checking for recent FINISHED biz events per plugin:

fetch bizevents, from: now()-30m
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter dsoa.task.exec.status == "FINISHED"
| summarize count = count(), by: {dsoa.task.name}
| sort dsoa.task.name asc

All 16 plugins should appear with count >= 1. Any missing plugin is a FAIL.

Notebook tile format notes

Markdown tiles — use the markdown: key (NOT text:). The text: key is silently ignored by Dynatrace; the rendered content comes from markdown:.

- id: my-markdown-tile
  type: markdown
  markdown: |
    ## Section heading

    Some **bold** description.

DQL tiles — always include showInput: false to hide the code by default.

Path	Purpose
`src/dtagent/version.py`	Source of truth for current version
`conf/config-dev-{TAG}.yml`	Per-environment configuration
`test/qa/RELEASE-CHECKLIST.md`	Full checklist with all items
`test/qa/test-suite/test-suite.yml`	Notebook YAML template
`scripts/test/deploy_test_notebook.sh`	Notebook deploy script
`test/qa/results/`	QA result files (create as needed)

Deploy commands quick reference

# Deploy both environments (fresh) — human only on dev-* profiles (requires DTAGENT_TOKEN)
./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=all --options=skip_confirm
./scripts/deploy/deploy.sh dev-{PREV_TAG} --scope=all --options=skip_confirm

# test-qa: AI can and must use --scope=all
./scripts/deploy/deploy.sh test-qa --scope=all --options=skip_confirm

# AI-safe re-deploy on dev-* profiles (no token needed) — plugins + agents + config only
./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=plugins,agents,config --options=skip_confirm

# Config-only update (no SQL changes)
./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=config --options=skip_confirm

# Deploy the test notebook
./scripts/test/deploy_test_notebook.sh \
    --curr-version={CURR_VERSION} \
    --prev-version={PREV_VERSION}

# Preview notebook deploy without applying
./scripts/test/deploy_test_notebook.sh --dry-run

CRITICAL — scope rules for AI-assisted deploys:

On test-qa: the AI has full DTAGENT access and must use --scope=all for fresh deployments — this is required and correct.
On dev-* profiles: --scope=all requires DTAGENT_TOKEN env-var (sends deployment biz events). The AI does not have this token on dev profiles. Use --scope=plugins,agents,config instead for AI-run deploys on dev environments.
Always build.sh first if build artifacts are missing — deploy will error with Build file missing: build/....

Deploy log monitoring

Never let deploy output stream directly to the tool — the log is very large and will cause tool aborts. Always background the process and tail the log:

./scripts/deploy/deploy.sh dev-{CURR_TAG} --scope=all --options=skip_confirm \
    > /tmp/deploy-{CURR_TAG}.log 2>&1 &
# then poll:
sleep 30 && ps -p $PID && tail -10 /tmp/deploy-{CURR_TAG}.log

Key strings to grep for in deploy logs:

String	Meaning
`Filtering out disabled plugins: ...`	Which plugins were excluded
`UPDATE_FROM_CONFIGURATIONS \| OK`	Config applied successfully
`successfully created` / `successfully resumed`	Tasks are live
`^ERROR:`	Fatal deploy error

Note: snow sql garbles wide-table output (SHOW TASKS, TASK_HISTORY). Never use these to verify task state — use deploy log evidence instead.

B-section deployment scenarios (B8–B10)

Always restore the config to a clean state after each B8–B10 scenario before the next one. A fresh --scope=all (by the human) is the safest restore path.

B8 — Selected plugins only

Config pattern:

plugins:
  disabled_by_default: true
  deploy_disabled_plugins: false
  query_history:
    is_enabled: true
  data_volume:
    is_enabled: true
  shares:
    is_enabled: true

Deploy: --scope=all (human). Verify:

Deploy log shows Filtering out disabled plugins: <13 plugins>
Trigger enabled tasks manually: EXECUTE TASK DTAGENT_{TAG}_DB.APP.TASK_DTAGENT_QUERY_HISTORY; etc.
DQL: zero telemetry from any non-enabled plugin over last 15 min.

B9 — Config-only update

Make a non-structural config change (e.g. log_level: DEBUG → INFO). Deploy: --scope=config only (AI can run this). Verify:

No CREATE PROCEDURE / CREATE VIEW / CREATE TASK in deploy log.
UPDATE_FROM_CONFIGURATIONS → OK.
New value confirmed in Snowflake: SELECT PATH, VALUE FROM CONFIG.CONFIGURATIONS WHERE PATH = 'core.log_level';
Trigger tasks and confirm FINISHED biz events still appear.

B10 — Disabled plugin not callable

Remove a previously-enabled plugin from config (e.g. remove shares: is_enabled: true). Deploy: --scope=all (human) or --scope=plugins,agents,config (AI).

Correct verification method: Call the agent directly for the disabled plugin:

snow sql -c snow_agent_dev-{CURR_TAG} \
    --role DTAGENT_{TAG}_OWNER \
    --warehouse DTAGENT_{TAG}_WH \
    --database DTAGENT_{TAG}_DB \
    --schema APP \
    -q "CALL APP.DTAGENT(ARRAY_CONSTRUCT('shares'));"

Confirm no telemetry from the plugin in the 5 minutes after deploy:

fetch logs, from: now()-5m
| filter deployment.environment == "DEV-{CURR_TAG}"
| filter dsoa.run.plugin == "shares"
| filter dsoa.run.context != "self_monitoring"
| summarize count = count()

Pass: count == 0.

qa-runner

同仓库更多 Skills

Skill: DSOA Release QA Runner

Overview

Phase 1 — Version Discovery

1a. Determine current version

1b. Derive version tags

1c. Determine previous version

1d. Verify config files

1e. Extract tenant info

Phase 1 output

Phase 2 — Deployment Guidance

Deploy the current version

Deploy the previous version

Phase 3 — Notebook Deployment

Phase 3.5 — Auto-Evaluation (AI runs DQL via MCP)

Should-be-empty checks (0 rows = PASS)

AE-C4.9 — No supportability.non_persisted_attribute_keys

AE-C5.4 — Timestamps in BizEvents are current

AE-C4.4 — Completeness span.events

AE-C4.7 — No missing span.parent_id for child queries in same DSOA run

AE-C1.2 — No ERROR-level agent logs

Data-presence checks (data returned = PASS)

AE-C2.3 — Query history metrics are reported

AE-C5.5 — Process metrics are reported

AE-C5.7 — Self-monitoring BizEvents delivered for all plugins

AE-C4.6 — span.parent_id present for child queries

AE-C2.1 — Budget metrics are reported

Cross-version comparison checks

AE-C1.3 — No increase in dt.ingest.warnings (5% tolerance)

Auto-evaluation output

Phase 4 — Test Walkthrough

Tile navigation hints

Handling failures

Phase 5 — QA Signoff

Write the markdown report

Helper Reference

Version-to-tag algorithm (bash)

DQL semantics — dsoa.run.plugin vs dsoa.run.context

B2 — Manual agent invocation

Notebook tile format notes

Deploy commands quick reference

Deploy log monitoring

B-section deployment scenarios (B8–B10)

B8 — Selected plugins only

B9 — Config-only update

B10 — Disabled plugin not callable

Skill: DSOA Release QA Runner

Overview

Phase 1 — Version Discovery

1a. Determine current version

1b. Derive version tags

1c. Determine previous version

1d. Verify config files

1e. Extract tenant info

Phase 1 output

Phase 2 — Deployment Guidance

Deploy the current version

Deploy the previous version

Phase 3 — Notebook Deployment

Phase 3.5 — Auto-Evaluation (AI runs DQL via MCP)

Should-be-empty checks (0 rows = PASS)

AE-C4.9 — No supportability.non_persisted_attribute_keys

AE-C5.4 — Timestamps in BizEvents are current

AE-C4.4 — Completeness span.events

AE-C4.7 — No missing span.parent_id for child queries in same DSOA run

AE-C1.2 — No ERROR-level agent logs

Data-presence checks (data returned = PASS)

AE-C2.3 — Query history metrics are reported

AE-C5.5 — Process metrics are reported

AE-C5.7 — Self-monitoring BizEvents delivered for all plugins

AE-C4.6 — span.parent_id present for child queries

AE-C2.1 — Budget metrics are reported

Cross-version comparison checks

AE-C1.3 — No increase in dt.ingest.warnings (5% tolerance)

Auto-evaluation output

Phase 4 — Test Walkthrough

Tile navigation hints

Handling failures

Phase 5 — QA Signoff

DQL semantics — `dsoa.run.plugin` vs `dsoa.run.context`

DQL semantics — `dsoa.run.plugin` vs `dsoa.run.context`