Crawl repository PRs, issues, and review comments to distill institutional knowledge into a shared knowledge base. Run periodically by "context agents" to maintain agent_artefacts/repo_context/REPO_CONTEXT.md. Trigger only on specific request.

2026-07-02

check-trajectories-workflow

软件开发工程师

Use Inspect Scout to analyze agent trajectories from evaluation log files. Runs default and custom scanners to detect external failures, formatting issues, reward hacking, and ethical refusals. Use when user asks to check/analyze agent trajectories. Trigger when the user asks you to run the "Check Agent Trajectories" workflow.

2026-07-02

ci-maintenance-workflow

软件质量保证分析师与测试员

CI and GitHub Actions maintenance workflows — fix a failing test from a CI URL, fix a failing smoke test, add @pytest.mark.slow markers to slow tests, or review a PR against agent-checkable standards. Use when user asks to fix a failing test, fix a smoke test, mark slow tests, or review a PR. Trigger when the user asks you to run the "Write a PR For A Failing Test", "Fix A Failing Smoke Test", "Mark Slow Tests", or "Review PR According to Agent-Checkable Standards" workflow.

2026-07-02

code-quality-fix-all

软件开发工程师

Fix code quality issues identified in a code quality review stored in agent_artefacts/code_quality/<topic>/. Systematically addresses issues found by the code-quality-review-all skill for ANY code quality topic, with validation and testing at each step. Use when user asks to fix issues from a code quality review, or asks to fix issues from agent_artefacts/code_quality/<topic>.

2026-07-02

code-quality-review-all

软件质量保证分析师与测试员

Review all evaluations in the repository against a single code quality standard. Checks ALL evals against ONE standard for periodic quality reviews. Use when user asks to review/audit/check all evaluations for a specific topic or standard. Do NOT use for reviewing a single eval (use eval-quality-workflow instead) or for test coverage (use ensure-test-coverage instead).

2026-07-02

eval-quality-workflow

软件质量保证分析师与测试员

Fix or review a single evaluation against all EVALUATION_CHECKLIST.md standards. Use "fix" mode to refactor an eval into compliance, or "review" mode to assess compliance without making changes. Use when user asks to fix, review, or check an evaluation's quality. Trigger when the user asks you to run the "Fix An Evaluation" or "Review An Evaluation" workflow. Do NOT use for reviewing ALL evals against a single code quality standard (use code-quality-review-all instead).

2026-07-02

eval-report-workflow

软件开发工程师

Create an evaluation report for a README by selecting models, estimating costs, running evaluations, and formatting results tables. Use when user asks to make/create/generate an evaluation report. Trigger when the user asks you to run the "Make An Evaluation Report" workflow.

2026-07-02

generate-asset-actions

软件开发工程师

Generate asset-actions.yaml from ASSETS.yaml by classifying assets into priority tiers. Use when the user asks to regenerate, update, or refresh the asset actions.

2026-07-02

当前展示该仓库 Top 8 / 17 个已收集 skills。

#002

inspect_ai

4 个 skills2.4k614更新于 2026-03-17

占该创作者 16%

skill

职业分类

描述

更新

disk-usage

网络与计算机系统管理员

Analyze disk space usage, filesystem mounts, and storage allocation on Linux systems. Identifies large files and directories, checks partition usage, and reports inode consumption. Use when the user asks about disk full errors, free space, storage usage, du/df output, finding large files, or checking which directories consume the most space.

2026-03-17

network-info

网络与计算机系统管理员

Gather network configuration and connectivity details on Linux including interfaces, IP addresses, routing tables, DNS settings, and listening ports. Use when the user asks about IP configuration, network interfaces, connection issues, DNS resolution, open ports, routing, or network troubleshooting.

2026-03-17

system-info

网络与计算机系统管理员

Retrieve detailed Linux system information including OS distribution, kernel version, CPU model and core count, memory usage, and uptime. Use when the user asks about system specs, hardware details, RAM, processor info, kernel version, or needs a host inventory summary.

2026-03-17

secret-code

软件开发工程师

Retrieve a secret code by reading a bundled asset file and executing a companion script. Use when the user asks to reveal, decode, or look up the secret code from this skill's assets.

2026-03-17

#003

sandbox_escape_bench

2 个 skills276更新于 2026-02-03

占该创作者 8.0%

skill

职业分类

描述

更新

inspect-ai

数据科学家

Analyze Inspect AI evaluation logs, understand EvalLog structure, extract samples, events, and scoring data using dataframes

2026-02-03

inspect-scout

软件开发工程师

Analyze AI agent transcripts using Inspect Scout scanners, grep patterns, and LLM-based analysis

2026-02-03

#004

vllm-lens

1 个 skills11711更新于 2026-04-14

占该创作者 4.0%

skill

职业分类

描述

更新

scientific-debug

软件开发工程师

Use this skill whenever the user asks to debug something, fix a bug, troubleshoot an issue, or mentions "scientific debugging". Any debugging request should use this skill.

2026-04-14

#005

MEES-Test-Automation

1 个 skills01更新于 2026-07-08

占该创作者 4.0%

skill

职业分类

描述

更新

update-test-cases

软件质量保证分析师与测试员

Use when: updating test cases in General_TestCases.csv, syncing test cases from spec files, adding new test cases to CSV, documenting automated tests, checking for duplicate test cases, updating test case results or status. Maintains Documentation/Test Cases/General_TestCases.csv in sync with Playwright .spec.ts test files. Use for: 'update test cases', 'sync test cases', 'add test case to CSV', 'document this test', 'record test cases from this file'.

2026-07-08

已展示 5 / 5 个仓库

已展示全部仓库