원클릭으로 Manus에서 모든 스킬 실행

$pwd:

examining-discovery-spaces

Name: Examining Discovery Spaces
Author: IBM

// End-to-end workflow to examine and summarise an ado discoveryspace — fetch space YAML, describe entity and measurement space structure, assess sampling coverage, export measurement data, and find related resources. Use when the user asks to inspect, summarise, debug, or analyse a discoveryspace; wants to understand dimensions, experiments, or sampling coverage; provides a space ID or asks to use --use-latest for the current space.

Manus에서 실행

$ git log --oneline --stat

stars:48

forks:5

updated:2026년 4월 17일 15:37

SKILL.md

readonly

name

examining-discovery-spaces

description

End-to-end workflow to examine and summarise an ado discoveryspace — fetch space YAML, describe entity and measurement space structure, assess sampling coverage, export measurement data, and find related resources. Use when the user asks to inspect, summarise, debug, or analyse a discoveryspace; wants to understand dimensions, experiments, or sampling coverage; provides a space ID or asks to use --use-latest for the current space.

Examining ado Discovery Spaces

Structured workflow for understanding what a discoveryspace contains, how covered its entity space is, and what data has been collected.

Run all commands from the repository root with uv run.
Write the report to reports/<ado_context_name>/ (create the directory if needed)
- where ado_context_name is the active ado metastore context (uv run ado context)
Write the report as <SPACEID>_<YYYY-MM-DD>_report.md

Related skills:

For CLI verification and command spelling, see using-ado-cli.
For metastore filtering and schemas, see query-ado-data.
For examining operations run on a space, see examining-ado-operations.
For a project/context wide view (all spaces and operations), see examining-ado-project.

Context

Operations and DiscoverySpaces

discoveryspaces (or spaces for short) define a set of points (entities) and how to measure them. They also contain the results of the measurements
operations operate on discovery spaces either selecting or measuring points or analysing existing measurements

Terminology: Distinguishing Entities in a DiscoverySpace

When working with the data from a discoveryspace the following distinctions are important.

Measured: These entities have been measured by an operation on the space
Unmeasured: These entities have not been measured by an operation on the space

The samplestore used by a discovery space is shared. This means there may be relevant measurement data in the samplestore for entities in the space but that measurement has not been performed by an operation on the space (it was performed on another).

Matching: Data in the spaces samplestore that matches the space definition - it includes measured entities
Missing: Entities that have no matching data in the samplestore.

Why is it useful to work with matching data?

Allows using the discoveryspace as a view to fetch particular data without having to perform operations on it
- Concrete example: You create a discoveryspace that is a subspace of another sampled spaced to analyze it. You can perform analysis on existing data even though no operation has been run on the new discoveryspace.
Memoization: You can understand if there are memoization opportunities that would speed up a operation on the space.

Pre-requisites: The Space Identifier

To apply this skill you need either:

(a) a space id; (b) explicit instruction to examine “the latest” space

In the case of (b) get the actual identifier:

uv run ado show related space --use-latest

Tips

Avoiding refetching YAML

ado get … -o yaml writes YAML to stdout by default. Prefer --output-file PATH (with the same -o yaml) to save it once and reuse the file instead of calling ado get repeatedly for the same resource.

Large output files

The output produced for a given -o/--output format can be very large (for example from show entities). Use --output-file with the path where the output should be saved, and when inspecting these files:

Use wc to count the file size first before using head/tail/cat etc. on it.
Use head -n1 to get column headers, this will not be large
Avoid head -n > 1 unless you have a specific need e.g. checking if file is corrupted
Avoid tail unless you have a specific need
Prefer python e.g. pandas.read_csv for any detailed analysis on the file.

Workflow

Run Step 2 and 3 first. Then steps 4,5 and 6 can be run in parallel.

Step 1: Get Space YAML

uv run ado get space SPACE_ID -o yaml --output-file SPACE_ID.yaml

Extract and summarise:

Resource identifier and metadata (name, description, labels)
sampleStoreIdentifier: the sample store backing this space
entitySpace: dimensions — property names, types (categorical / discrete / continuous), and their domains/values
experiments: actuator and experiment identifiers that define what can be measured, and which target properties each experiment produces

Step 2: Sampling coverage and related resources

Execute

uv run ado show details space SPACE_ID

This outputs two sections:

DETAILS — sampling coverage:

Total entities in the space
How many have been measured
How many have failed measurements
How many are unmeasured
How many are matching

Compare measured vs total to understand exploration progress. Compare measured vs matching to understand memoization opportunities. Also, a signal that other overlapping spaces exist.

RELATED RESOURCES — all operations and stores linked to this space.

Performance note: ado show details space is slow as it fetches and aggregates entity data. Use only when sampling coverage is needed.

Step 3: Check for existing report

Check if there is an existing report for this space in reports/<ado_context_name>/
If yes, check if either of the following are true:
- New operations have been run on space since report
- The number of measured entities has increased
If neither of above are true, ask the user if they want to write a new report or use existing
- As nothing has changed, the only purpose of creating a new report is if a different agent is being used

Step 4: Find Similar spaces

ado get space --matching-space-id SPACE_ID --details finds spaces with the same entity structure. Use this to understand research progression and why this space was created.

uv run ado get space --matching-space-id SPACE_ID --details

Step 5: Export Measurement Data

Note: Keep in mind the guidelines on large output files for the following.

uv run ado show entities space SPACE_ID \
  --include measured \
  --property-format target \
  -o csv --output-file SPACE_ID_entities.csv

This writes the data to SPACE_ID_entities.csv. If you find SPACE_ID_entities.csv already exists do not use it, as data may be stale

You can also get lists of all unmeasured or missing entities, though this is not typically required unless you want to analyse the unsampled portion.

Perform an analysis of the measurements, checking e.g. distributions of metrics, metric outliers, correlations between metrics. Take into account the domain of the experiment and meaning of metrics when looking for patterns.

Step 6: Examine Related Operations

For each related operation (output in step 2), use the examining-ado-operations skill to understand what each operation did and what it produced.

Note: Do not analyze the data in the operations, or do detailed diagnoses. Just enough for summary.

Producing a Report

Structure the report as:

Overview: What the space represents. Infer from metadata, dimensions, and experiments. Short and narrative.
- Space summary – ID, metadata, entity count, dimensions (parameters and their types/values)
- Measurement space – experiments, target properties
Related Spaces (Optional): If there are related spaces, describe them and how they relate.
Sampling coverage – sampled vs unsampled vs missing counts; progress assessment
Data summary – distributions of measured properties, notable performers, outliers, correlations
Related operations – which operations ran on this space and their status

related-skills.json

같은 저장소

using-ado-cli.md

from "IBM/ado"

Guidelines for using ado CLI commands and documenting them correctly. Use when writing documentation that includes ado commands, verifying CLI syntax, or explaining ado CLI usage patterns to users.

2026-05-0548

remote-execution.md

from "IBM/ado"

Run ado operations on remote Ray clusters using --remote execution context files. Use when the user wants to create an operation, asks about remote clusters, wants to ship local plugins or data files to a cluster, or asks about execution context YAML files. Also applies proactively when creating an operation if execution context files are present in the workspace.

2026-04-2148

examining-ado-operations.md

from "IBM/ado"

End-to-end workflow to examine and summarize an ado operation — fetch operation and space YAML, summarise configuration, export entities/requests/results to CSV, perform simple analysis, and interpret failures and data quality. Use when the user asks to summarize, analyse, debug, or review an operation; wants insights from measurement data; or provides an operation ID or asks to use --use-latest for the current operation.

2026-04-1748

examining-ado-project.md

from "IBM/ado"

Builds a picture of work in an ado project: activity volume, spaces and operations created over time, experiments and operation configs used etc. Use to create a project/context overview report, summarize what the team has been doing in an ado project, report trends across spaces/operations, or to onboard onto an ado project.

2026-04-1748

query-ado-data.md

from "IBM/ado"

Query ado metadata and measurement data using CLI commands. Use when the user needs to find resources, filter by metadata, retrieve entities and measurements, or get resource schemas. Covers metastore queries (operations, discoveryspaces, samplestores, datacontainers, actuatorconfigurations) and samplestore queries (entities and measurements).

2026-04-1748

formulate-discovery-problem.md

from "IBM/ado"

Formulates problems for execution with ado by creating discoveryspace and operation YAML files. Guides through experiment selection, space creation, validation, operation configuration, and parameterization. Use when the user wants to create discoveryspace or operation YAML files, configure experiments, set up entity spaces, or formulate research, benchmarking or search problems.

2026-04-1748

package.json

"author": "IBM"

"repository": "IBM/ado"

GitHub 저장소 열기 Creator 저장소 보기

$ install --global

$ download --local

Manus에서 실행

$ useful --forSOC

소프트웨어 개발자컴퓨터 및 수학직15-1252L4

name

examining-discovery-spaces

description

Examining ado Discovery Spaces

Structured workflow for understanding what a discoveryspace contains, how covered its entity space is, and what data has been collected.

Run all commands from the repository root with uv run.
Write the report to reports/<ado_context_name>/ (create the directory if needed)
- where ado_context_name is the active ado metastore context (uv run ado context)
Write the report as <SPACEID>_<YYYY-MM-DD>_report.md

Related skills:

For CLI verification and command spelling, see using-ado-cli.
For metastore filtering and schemas, see query-ado-data.
For examining operations run on a space, see examining-ado-operations.
For a project/context wide view (all spaces and operations), see examining-ado-project.

Context

Operations and DiscoverySpaces

discoveryspaces (or spaces for short) define a set of points (entities) and how to measure them. They also contain the results of the measurements
operations operate on discovery spaces either selecting or measuring points or analysing existing measurements

Terminology: Distinguishing Entities in a DiscoverySpace

When working with the data from a discoveryspace the following distinctions are important.

Measured: These entities have been measured by an operation on the space
Unmeasured: These entities have not been measured by an operation on the space

Matching: Data in the spaces samplestore that matches the space definition - it includes measured entities
Missing: Entities that have no matching data in the samplestore.

Why is it useful to work with matching data?

Allows using the discoveryspace as a view to fetch particular data without having to perform operations on it
- Concrete example: You create a discoveryspace that is a subspace of another sampled spaced to analyze it. You can perform analysis on existing data even though no operation has been run on the new discoveryspace.
Memoization: You can understand if there are memoization opportunities that would speed up a operation on the space.

Pre-requisites: The Space Identifier

To apply this skill you need either:

(a) a space id; (b) explicit instruction to examine “the latest” space

In the case of (b) get the actual identifier:

uv run ado show related space --use-latest

Tips

Avoiding refetching YAML

Large output files

Use wc to count the file size first before using head/tail/cat etc. on it.
Use head -n1 to get column headers, this will not be large
Avoid head -n > 1 unless you have a specific need e.g. checking if file is corrupted
Avoid tail unless you have a specific need
Prefer python e.g. pandas.read_csv for any detailed analysis on the file.

Workflow

Run Step 2 and 3 first. Then steps 4,5 and 6 can be run in parallel.

Step 1: Get Space YAML

uv run ado get space SPACE_ID -o yaml --output-file SPACE_ID.yaml

Extract and summarise:

Resource identifier and metadata (name, description, labels)
sampleStoreIdentifier: the sample store backing this space
entitySpace: dimensions — property names, types (categorical / discrete / continuous), and their domains/values
experiments: actuator and experiment identifiers that define what can be measured, and which target properties each experiment produces

Step 2: Sampling coverage and related resources

Execute

uv run ado show details space SPACE_ID

This outputs two sections:

DETAILS — sampling coverage:

Total entities in the space
How many have been measured
How many have failed measurements
How many are unmeasured
How many are matching

Compare measured vs total to understand exploration progress. Compare measured vs matching to understand memoization opportunities. Also, a signal that other overlapping spaces exist.

RELATED RESOURCES — all operations and stores linked to this space.

Performance note: ado show details space is slow as it fetches and aggregates entity data. Use only when sampling coverage is needed.

Step 3: Check for existing report

Check if there is an existing report for this space in reports/<ado_context_name>/
If yes, check if either of the following are true:
- New operations have been run on space since report
- The number of measured entities has increased
If neither of above are true, ask the user if they want to write a new report or use existing
- As nothing has changed, the only purpose of creating a new report is if a different agent is being used

Step 4: Find Similar spaces

ado get space --matching-space-id SPACE_ID --details finds spaces with the same entity structure. Use this to understand research progression and why this space was created.

uv run ado get space --matching-space-id SPACE_ID --details

Step 5: Export Measurement Data

Note: Keep in mind the guidelines on large output files for the following.

uv run ado show entities space SPACE_ID \
  --include measured \
  --property-format target \
  -o csv --output-file SPACE_ID_entities.csv

This writes the data to SPACE_ID_entities.csv. If you find SPACE_ID_entities.csv already exists do not use it, as data may be stale

You can also get lists of all unmeasured or missing entities, though this is not typically required unless you want to analyse the unsampled portion.

Step 6: Examine Related Operations

For each related operation (output in step 2), use the examining-ado-operations skill to understand what each operation did and what it produced.

Note: Do not analyze the data in the operations, or do detailed diagnoses. Just enough for summary.

Producing a Report

Structure the report as:

Overview: What the space represents. Infer from metadata, dimensions, and experiments. Short and narrative.
- Space summary – ID, metadata, entity count, dimensions (parameters and their types/values)
- Measurement space – experiments, target properties
Related Spaces (Optional): If there are related spaces, describe them and how they relate.
Sampling coverage – sampled vs unsampled vs missing counts; progress assessment
Data summary – distributions of measured properties, notable performers, outliers, correlations
Related operations – which operations ran on this space and their status

examining-discovery-spaces

Examining ado Discovery Spaces

Context

Operations and DiscoverySpaces

Terminology: Distinguishing Entities in a DiscoverySpace

Pre-requisites: The Space Identifier

Tips

Avoiding refetching YAML

Large output files

Workflow

Step 1: Get Space YAML

Step 2: Sampling coverage and related resources

Step 3: Check for existing report

Step 4: Find Similar spaces

Step 5: Export Measurement Data

Step 6: Examine Related Operations

Producing a Report

이 저장소의 다른 Skills

이 저장소의 다른 Skills

Examining ado Discovery Spaces

Context

Operations and DiscoverySpaces

Terminology: Distinguishing Entities in a DiscoverySpace

Pre-requisites: The Space Identifier

Tips

Avoiding refetching YAML

Large output files

Workflow

Step 1: Get Space YAML

Step 2: Sampling coverage and related resources

Step 3: Check for existing report

Step 4: Find Similar spaces

Step 5: Export Measurement Data

Step 6: Examine Related Operations

Producing a Report