Run any Skill in Manus with one click

generate-asset-actions

Stars551

Forks358

UpdatedMarch 25, 2026 at 02:28

Generate asset-actions.yaml from ASSETS.yaml by classifying assets into priority tiers. Use when the user asks to regenerate, update, or refresh the asset actions.

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

Run Skill in Manus

Source

UKGovernmentBEIS

UKGovernmentBEIS/inspect_evals

View GitHub Repository View Creator Repositories

Download

Run Skill in Manus

Related occupationsSOC

Based on SOC occupation classification

Computer Network ArchitectsComputer and Mathematical Occupations·SOC 15-1241

SKILL.md

readonly

name	generate-asset-actions
description	Generate asset-actions.yaml from ASSETS.yaml by classifying assets into priority tiers. Use when the user asks to regenerate, update, or refresh the asset actions.

Generate Asset Policy

Regenerate internal/audits/asset-actions.yaml and internal/audits/audit-summary.md from ASSETS.yaml.

If ASSETS.yaml may be stale, run uv run python tools/generate_asset_manifest.py first.

Run uv run python tools/summarise_asset_manifest.py to get aggregate counts (by type, by state, totals). Use these numbers when populating audit-summary.md.

Classification

Read ASSETS.yaml. For each asset, determine target stage first, then priority. Process both state: floating assets AND state: pinned assets that match known-unstable sources (since their target is controlled, they are not yet at their target stage).

Target stages (per ADR-0007)

The target stage depends on host reliability, not asset type:

controlled (Stage 2) — any asset where upstream has broken before, maintainer is unresponsive/deprecated, OR host is unreliable (personal repos, Google Drive, .edu domains, university servers, any host without version control). This applies to git_clone, direct_url, and huggingface alike.
pinned (Stage 1) — assets on reliable, version-controlled hosts (GitHub, HuggingFace, well-known CDNs) with no history of breakage.

Per ADR-0007: "Anything hosted on a less reliable domain (personal websites, Google Drive, university servers, or any host without version control) should skip straight to Stage 2."

Priority tiers

Urgent — all other floating refs on reliable hosts. Target is pinned.
High — matches a known-unstable source (see registry below). Target is controlled.
Medium — unreliable host (drive.google.com, .edu domains, personal repos/websites) not already in the known-unstable registry. Target is controlled.

For assets with state: pinned and a {SHA} placeholder but no checksum, classify as Low (target: pinned with checksum).

Omit assets already at their target stage.

Every entry needs: eval, source, type, state, target, action, reason.

Known-Unstable Sources

Update this list when new instability is discovered.

Source	Eval	Incident
`xlang-ai/OSWorld`	osworld	Files removed (PR #958)
`openai/evals`	makemesay	Deprecated upstream
`corebench.cs.princeton.edu`	core_bench	University server, no versioning
`epatey/fonts`	osworld	Personal repo
`ShishirPatil/gorilla`	bfcl	Data format issues (PR #954)
`yunx-z/MLRC-Bench`	mlrc_bench	Broken task
`LRudL/sad`	sad	Upstream bugs (issues #7, #8)
`meg-tong/sycophancy-eval`	sycophancy	Invalid JSON/NaN, workaround in code
`josancamon/paperbench`	paperbench	Paper ID mismatch (HF discussion #2)
`sentientfutures/moru-benchmark`	moru	Exact duplicate rows

Verification

asset-actions.yaml parses as valid YAML
Every floating asset in ASSETS.yaml appears in urgent, high, or medium
floating_assets + needing_checksums + no_action_needed == total_external_assets
Numbers in audit-summary.md match output of summarise_asset_manifest.py

More from this repository

same repository

ci-maintenance-workflow

UKGovernmentBEIS/inspect_evals

CI and GitHub Actions maintenance workflows — fix a failing test from a CI URL, fix a failing smoke test, add @pytest.mark.slow markers to slow tests, or review a PR against agent-checkable standards. Use when user asks to fix a failing test, fix a smoke test, mark slow tests, or review a PR. Trigger when the user asks you to run the "Write a PR For A Failing Test", "Fix A Failing Smoke Test", "Mark Slow Tests", or "Review PR According to Agent-Checkable Standards" workflow.

2026-06-19551

prepare-submission-workflow

UKGovernmentBEIS/inspect_evals

Prepare an evaluation for PR submission as an entry to the register. Use when user asks to prepare an eval for submission or finalize a PR. Trigger when the user asks you to run the "Prepare Evaluation For Submission" workflow.

2026-06-11551

eval-validity-review

UKGovernmentBEIS/inspect_evals

Review a single evaluation's validity — whether its claims hold up, whether its name is accurate, whether samples can be both succeeded and failed at, and whether scoring measures ground truth. Use when user asks to check validity of an eval, or as part of the Master Checklist workflow. Do NOT use for code quality or test coverage (use eval-quality-workflow or ensure-test-coverage instead).

2026-06-07551

code-quality-fix-all

UKGovernmentBEIS/inspect_evals

Fix code quality issues identified in a code quality review stored in agent_artefacts/code_quality/<topic>/. Systematically addresses issues found by the code-quality-review-all skill for ANY code quality topic, with validation and testing at each step. Use when user asks to fix issues from a code quality review, or asks to fix issues from agent_artefacts/code_quality/<topic>.

2026-06-04551

eval-report-workflow

UKGovernmentBEIS/inspect_evals

Create an evaluation report for a README by selecting models, estimating costs, running evaluations, and formatting results tables. Use when user asks to make/create/generate an evaluation report. Trigger when the user asks you to run the "Make An Evaluation Report" workflow.

2026-05-24551

create-eval

UKGovernmentBEIS/inspect_evals

Redirect to the inspect-evals-template for creating new evaluations. New evals are no longer created in this repository — they live in standalone repos. Use when user asks to create/implement/build a new evaluation.

2026-05-04551

name	generate-asset-actions
description	Generate asset-actions.yaml from ASSETS.yaml by classifying assets into priority tiers. Use when the user asks to regenerate, update, or refresh the asset actions.

Generate Asset Policy

Regenerate internal/audits/asset-actions.yaml and internal/audits/audit-summary.md from ASSETS.yaml.

If ASSETS.yaml may be stale, run uv run python tools/generate_asset_manifest.py first.

Run uv run python tools/summarise_asset_manifest.py to get aggregate counts (by type, by state, totals). Use these numbers when populating audit-summary.md.

Classification

Target stages (per ADR-0007)

The target stage depends on host reliability, not asset type:

controlled (Stage 2) — any asset where upstream has broken before, maintainer is unresponsive/deprecated, OR host is unreliable (personal repos, Google Drive, .edu domains, university servers, any host without version control). This applies to git_clone, direct_url, and huggingface alike.
pinned (Stage 1) — assets on reliable, version-controlled hosts (GitHub, HuggingFace, well-known CDNs) with no history of breakage.

Per ADR-0007: "Anything hosted on a less reliable domain (personal websites, Google Drive, university servers, or any host without version control) should skip straight to Stage 2."

Priority tiers

Urgent — all other floating refs on reliable hosts. Target is pinned.
High — matches a known-unstable source (see registry below). Target is controlled.
Medium — unreliable host (drive.google.com, .edu domains, personal repos/websites) not already in the known-unstable registry. Target is controlled.

For assets with state: pinned and a {SHA} placeholder but no checksum, classify as Low (target: pinned with checksum).

Omit assets already at their target stage.

Every entry needs: eval, source, type, state, target, action, reason.

Known-Unstable Sources

Update this list when new instability is discovered.

Source	Eval	Incident
`xlang-ai/OSWorld`	osworld	Files removed (PR #958)
`openai/evals`	makemesay	Deprecated upstream
`corebench.cs.princeton.edu`	core_bench	University server, no versioning
`epatey/fonts`	osworld	Personal repo
`ShishirPatil/gorilla`	bfcl	Data format issues (PR #954)
`yunx-z/MLRC-Bench`	mlrc_bench	Broken task
`LRudL/sad`	sad	Upstream bugs (issues #7, #8)
`meg-tong/sycophancy-eval`	sycophancy	Invalid JSON/NaN, workaround in code
`josancamon/paperbench`	paperbench	Paper ID mismatch (HF discussion #2)
`sentientfutures/moru-benchmark`	moru	Exact duplicate rows

Verification

asset-actions.yaml parses as valid YAML
Every floating asset in ASSETS.yaml appears in urgent, high, or medium
floating_assets + needing_checksums + no_action_needed == total_external_assets
Numbers in audit-summary.md match output of summarise_asset_manifest.py