Run any Skill in Manus with one click

$pwd:

aris-compute-guard

Name: Aris Compute Guard
Author: OpenLAIR

// Mandatory pre-flight compute resource check before running experiments. Detects whether local/remote GPU or compute resources are actually available. If resources are unavailable, STOPS the experiment pipeline immediately and reports to the user — preventing the model from hallucinating fake experiment results. Use when: about to run experiments, deploy training, or any GPU-intensive task.

Run Skill in Manus

$ git log --oneline --stat

stars:974

forks:101

updated:April 23, 2026 at 15:15

SKILL.md

readonly

name	aris-compute-guard
description	Mandatory pre-flight compute resource check before running experiments. Detects whether local/remote GPU or compute resources are actually available. If resources are unavailable, STOPS the experiment pipeline immediately and reports to the user — preventing the model from hallucinating fake experiment results. Use when: about to run experiments, deploy training, or any GPU-intensive task.
argument-hint	["environment-type"]
allowed-tools	Bash(nvidia-smi), Bash(python), Bash(ssh), Bash(echo), Bash(which), Bash(command), Read, Grep, Glob
license	MIT
metadata	{"author":"wanshuiyin/ARIS","version":"1.0.0"}

Compute Resource Guard

MANDATORY pre-flight check before any experiment execution. This skill determines whether the required compute resources are actually available. If they are not, you MUST stop immediately and inform the user — do NOT proceed to run experiments, and do NOT imagine or fabricate experiment results.

Context: $ARGUMENTS

CRITICAL RULE

If this check determines compute resources are unavailable, you MUST:

STOP all experiment execution immediately
DO NOT attempt to run any training scripts, evaluation scripts, or experiment code
DO NOT fabricate, imagine, or hallucinate any experiment results
REPORT clearly to the user what resources are missing and what they need to do
MARK the experiment task as blocked (not failed, not done)

Workflow

Step 1: Detect Target Environment

Read the project's CLAUDE.md to determine the experiment environment:

Local GPU (gpu: local): Check local CUDA/MPS
Remote server (gpu: remote): Check SSH connectivity + remote GPU
Vast.ai (gpu: vast): Check for running instances
Modal (gpu: modal): Check Modal CLI + auth (Modal is serverless — always "available" if configured)

If no CLAUDE.md exists or no gpu: setting is found, assume local environment.

Step 2: Check Compute Availability

For Local GPU (Linux with CUDA):

# Check if nvidia-smi exists
which nvidia-smi 2>/dev/null
# If exists, check GPU status
nvidia-smi --query-gpu=index,name,memory.used,memory.total,utilization.gpu --format=csv,noheader 2>/dev/null

Available = nvidia-smi succeeds AND at least one GPU has memory.used < 500 MiB (free). Unavailable = nvidia-smi not found, returns error, or ALL GPUs have memory.used >= memory.total * 0.9.

For Local GPU (Mac with MPS):

python3 -c "
import torch
mps_available = hasattr(torch.backends, 'mps') and torch.backends.mps.is_available()
print(f'MPS_AVAILABLE={mps_available}')
if mps_available:
    print('COMPUTE_OK=true')
else:
    print('COMPUTE_OK=false')
" 2>/dev/null

Available = MPS is available (Apple Silicon with PyTorch MPS support). Unavailable = No MPS, no CUDA, pure CPU only — warn user that experiments will be extremely slow or may not work.

For Local CPU-only (no GPU):

# Check if any GPU framework is available
python3 -c "
import torch
cuda = torch.cuda.is_available()
mps = hasattr(torch.backends, 'mps') and torch.backends.mps.is_available()
print(f'CUDA={cuda}, MPS={mps}')
if not cuda and not mps:
    print('COMPUTE_OK=false')
    print('REASON=No GPU available (no CUDA, no MPS). CPU-only execution is not suitable for ML training experiments.')
else:
    print('COMPUTE_OK=true')
" 2>&1

If python3 or torch is not installed:

# Fallback: check for nvidia-smi directly
nvidia-smi 2>/dev/null || echo "COMPUTE_OK=false"
echo "REASON=Neither nvidia-smi nor PyTorch found. Cannot verify GPU availability."

For Remote Server (SSH):

# Check SSH connectivity (timeout 10s)
ssh -o ConnectTimeout=10 -o BatchMode=yes <server> "echo CONNECTED" 2>/dev/null
# If connected, check GPU
ssh -o ConnectTimeout=10 <server> "nvidia-smi --query-gpu=index,memory.used,memory.total --format=csv,noheader" 2>/dev/null

Available = SSH connects AND GPU has free memory. Unavailable = SSH fails (server down, auth issue, network) OR no free GPU.

For Vast.ai:

# Check for running instances
cat vast-instances.json 2>/dev/null
# Or query Vast.ai API
vastai show instances 2>/dev/null

Available = A running instance exists with SSH access. Unavailable = No running instances (need to provision one first).

For Modal (serverless):

# Check Modal CLI is installed and authenticated
modal token verify 2>/dev/null || echo "MODAL_NOT_CONFIGURED"

Available = Modal CLI installed and authenticated. Unavailable = Modal not installed or not authenticated.

Step 3: Decision Gate

Check Result	Action
COMPUTE_OK = true	Proceed with experiment. Print brief resource summary and continue.
COMPUTE_OK = false	STOP IMMEDIATELY. Do NOT run any experiments. Go to Step 4.

Step 4: Stop and Report (when compute unavailable)

When compute resources are NOT available, respond with a clear, structured message:

⚠️ COMPUTE RESOURCES UNAVAILABLE — Experiment Stopped

I checked the compute resources and they are NOT available for running experiments.

**Environment:** [local / remote / vast.ai / modal]
**Issue:** [specific reason — e.g., "No GPU detected", "SSH connection failed", "All GPUs fully occupied"]

**What you need to do:**
- [Actionable step 1 — e.g., "Ensure your machine has a CUDA-compatible GPU"]
- [Actionable step 2 — e.g., "Free up GPU memory by stopping other processes"]
- [Actionable step 3 — e.g., "Configure a remote server in CLAUDE.md"]

**Alternative options:**
- Set `gpu: modal` in CLAUDE.md to use Modal serverless GPU (no local GPU needed)
- Set `gpu: vast` in CLAUDE.md to rent an on-demand GPU from Vast.ai
- Configure a remote GPU server with `gpu: remote` in CLAUDE.md

I will NOT proceed with running experiments or generating results, as doing so without actual compute resources would produce fabricated output. Please resolve the compute issue and try again.

After this message, STOP. Do not continue with any experiment workflow steps.

Step 5: Proceed Summary (when compute available)

When compute IS available, print a brief summary and return control:

✅ Compute resources verified:
- Environment: [local / remote / vast.ai / modal]
- GPU: [GPU name, count, free memory]
- Status: Ready for experiments

Proceeding with experiment execution.

Integration

This skill is called automatically by:

/aris-run-experiment (Step 0, before environment detection)
/aris-experiment-bridge (Phase 0, before parsing experiment plan)

It can also be called standalone:

/aris-compute-guard
/aris-compute-guard local
/aris-compute-guard remote

Rules

NEVER skip this check. It exists to prevent wasted time and hallucinated results.
If the check itself fails (e.g., python3 not found), treat it as unavailable and report.
For gpu: modal, the check is lenient — Modal handles GPU allocation automatically. Only fail if Modal CLI is not installed/authenticated.
For CPU-only environments, warn but allow if the experiment is explicitly CPU-compatible (e.g., small-scale testing, data preprocessing).
This check should complete in under 30 seconds. If SSH times out, report as unavailable.

related-skills.json

same repository

inno-figure-gen.md

from "OpenLAIR/dr-claw"

Generate/edit images with OpenAI gpt-image-2 by default, falling back to Gemini (gemini-3.1-flash-image-preview) when OPENAI_API_KEY is unset. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image for editing, --provider to force a provider, --model to override the model.

2026-04-26974

aris-experiment-bridge.md

from "OpenLAIR/dr-claw"

Workflow 1.5: Bridge between idea discovery and auto review. Reads EXPERIMENT_PLAN.md, implements experiment code, deploys to GPU, collects initial results. Use when user says "实现实验", "implement experiments", "bridge", "从计划到跑实验", "deploy the plan", or has an experiment plan ready to execute.

2026-04-23974

aris-run-experiment.md

from "OpenLAIR/dr-claw"

Deploy and run ML experiments on local, remote, Vast.ai, or Modal serverless GPU. Use when user says "run experiment", "deploy to server", "跑实验", or needs to launch training jobs.

2026-04-23974

drclaw.md

from "OpenLAIR/dr-claw"

Dr. Claw workspace skill for project lookup, session inspection, TaskMaster progress, OpenClaw structured schema, and event-driven reporting

2026-04-11974

ds-analysis-campaign.md

from "OpenLAIR/dr-claw"

Use when a quest needs one or more follow-up runs such as ablations, robustness checks, error analysis, or failure analysis after a main experiment.

2026-04-08974

ds-baseline.md

from "OpenLAIR/dr-claw"

Use when a quest needs to attach, import, reproduce, repair, verify, compare, or publish a baseline and its metrics.

2026-04-08974

package.json

"author": "OpenLAIR"

"repository": "OpenLAIR/dr-claw"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Network and Computer Systems AdministratorsComputer and Mathematical Occupations15-1244L4

name	aris-compute-guard
description	Mandatory pre-flight compute resource check before running experiments. Detects whether local/remote GPU or compute resources are actually available. If resources are unavailable, STOPS the experiment pipeline immediately and reports to the user — preventing the model from hallucinating fake experiment results. Use when: about to run experiments, deploy training, or any GPU-intensive task.
argument-hint	["environment-type"]
allowed-tools	Bash(nvidia-smi), Bash(python), Bash(ssh), Bash(echo), Bash(which), Bash(command), Read, Grep, Glob
license	MIT
metadata	{"author":"wanshuiyin/ARIS","version":"1.0.0"}

Compute Resource Guard

Context: $ARGUMENTS

CRITICAL RULE

If this check determines compute resources are unavailable, you MUST:

STOP all experiment execution immediately
DO NOT attempt to run any training scripts, evaluation scripts, or experiment code
DO NOT fabricate, imagine, or hallucinate any experiment results
REPORT clearly to the user what resources are missing and what they need to do
MARK the experiment task as blocked (not failed, not done)

Workflow

Step 1: Detect Target Environment

Read the project's CLAUDE.md to determine the experiment environment:

Local GPU (gpu: local): Check local CUDA/MPS
Remote server (gpu: remote): Check SSH connectivity + remote GPU
Vast.ai (gpu: vast): Check for running instances
Modal (gpu: modal): Check Modal CLI + auth (Modal is serverless — always "available" if configured)

If no CLAUDE.md exists or no gpu: setting is found, assume local environment.

Step 2: Check Compute Availability

For Local GPU (Linux with CUDA):

# Check if nvidia-smi exists
which nvidia-smi 2>/dev/null
# If exists, check GPU status
nvidia-smi --query-gpu=index,name,memory.used,memory.total,utilization.gpu --format=csv,noheader 2>/dev/null

For Local GPU (Mac with MPS):

python3 -c "
import torch
mps_available = hasattr(torch.backends, 'mps') and torch.backends.mps.is_available()
print(f'MPS_AVAILABLE={mps_available}')
if mps_available:
    print('COMPUTE_OK=true')
else:
    print('COMPUTE_OK=false')
" 2>/dev/null

Available = MPS is available (Apple Silicon with PyTorch MPS support). Unavailable = No MPS, no CUDA, pure CPU only — warn user that experiments will be extremely slow or may not work.

For Local CPU-only (no GPU):

# Check if any GPU framework is available
python3 -c "
import torch
cuda = torch.cuda.is_available()
mps = hasattr(torch.backends, 'mps') and torch.backends.mps.is_available()
print(f'CUDA={cuda}, MPS={mps}')
if not cuda and not mps:
    print('COMPUTE_OK=false')
    print('REASON=No GPU available (no CUDA, no MPS). CPU-only execution is not suitable for ML training experiments.')
else:
    print('COMPUTE_OK=true')
" 2>&1

If python3 or torch is not installed:

# Fallback: check for nvidia-smi directly
nvidia-smi 2>/dev/null || echo "COMPUTE_OK=false"
echo "REASON=Neither nvidia-smi nor PyTorch found. Cannot verify GPU availability."

For Remote Server (SSH):

# Check SSH connectivity (timeout 10s)
ssh -o ConnectTimeout=10 -o BatchMode=yes <server> "echo CONNECTED" 2>/dev/null
# If connected, check GPU
ssh -o ConnectTimeout=10 <server> "nvidia-smi --query-gpu=index,memory.used,memory.total --format=csv,noheader" 2>/dev/null

Available = SSH connects AND GPU has free memory. Unavailable = SSH fails (server down, auth issue, network) OR no free GPU.

For Vast.ai:

# Check for running instances
cat vast-instances.json 2>/dev/null
# Or query Vast.ai API
vastai show instances 2>/dev/null

Available = A running instance exists with SSH access. Unavailable = No running instances (need to provision one first).

For Modal (serverless):

# Check Modal CLI is installed and authenticated
modal token verify 2>/dev/null || echo "MODAL_NOT_CONFIGURED"

Available = Modal CLI installed and authenticated. Unavailable = Modal not installed or not authenticated.

Step 3: Decision Gate

Check Result	Action
COMPUTE_OK = true	Proceed with experiment. Print brief resource summary and continue.
COMPUTE_OK = false	STOP IMMEDIATELY. Do NOT run any experiments. Go to Step 4.

Step 4: Stop and Report (when compute unavailable)

When compute resources are NOT available, respond with a clear, structured message:

⚠️ COMPUTE RESOURCES UNAVAILABLE — Experiment Stopped

I checked the compute resources and they are NOT available for running experiments.

**Environment:** [local / remote / vast.ai / modal]
**Issue:** [specific reason — e.g., "No GPU detected", "SSH connection failed", "All GPUs fully occupied"]

**What you need to do:**
- [Actionable step 1 — e.g., "Ensure your machine has a CUDA-compatible GPU"]
- [Actionable step 2 — e.g., "Free up GPU memory by stopping other processes"]
- [Actionable step 3 — e.g., "Configure a remote server in CLAUDE.md"]

**Alternative options:**
- Set `gpu: modal` in CLAUDE.md to use Modal serverless GPU (no local GPU needed)
- Set `gpu: vast` in CLAUDE.md to rent an on-demand GPU from Vast.ai
- Configure a remote GPU server with `gpu: remote` in CLAUDE.md

I will NOT proceed with running experiments or generating results, as doing so without actual compute resources would produce fabricated output. Please resolve the compute issue and try again.

After this message, STOP. Do not continue with any experiment workflow steps.

Step 5: Proceed Summary (when compute available)

When compute IS available, print a brief summary and return control:

✅ Compute resources verified:
- Environment: [local / remote / vast.ai / modal]
- GPU: [GPU name, count, free memory]
- Status: Ready for experiments

Proceeding with experiment execution.

Integration

This skill is called automatically by:

/aris-run-experiment (Step 0, before environment detection)
/aris-experiment-bridge (Phase 0, before parsing experiment plan)

It can also be called standalone:

/aris-compute-guard
/aris-compute-guard local
/aris-compute-guard remote

Rules

NEVER skip this check. It exists to prevent wasted time and hallucinated results.
If the check itself fails (e.g., python3 not found), treat it as unavailable and report.
For gpu: modal, the check is lenient — Modal handles GPU allocation automatically. Only fail if Modal CLI is not installed/authenticated.
For CPU-only environments, warn but allow if the experiment is explicitly CPU-compatible (e.g., small-scale testing, data preprocessing).
This check should complete in under 30 seconds. If SSH times out, report as unavailable.

aris-compute-guard

Compute Resource Guard

Context: $ARGUMENTS

CRITICAL RULE

Workflow

Step 1: Detect Target Environment

Step 2: Check Compute Availability

For Local GPU (Linux with CUDA):

For Local GPU (Mac with MPS):

For Local CPU-only (no GPU):

For Remote Server (SSH):

For Vast.ai:

For Modal (serverless):

Step 3: Decision Gate

Step 4: Stop and Report (when compute unavailable)

Step 5: Proceed Summary (when compute available)

Integration

Rules

More from this repository

More from this repository

Compute Resource Guard

Context: $ARGUMENTS

CRITICAL RULE

Workflow

Step 1: Detect Target Environment

Step 2: Check Compute Availability

For Local GPU (Linux with CUDA):

For Local GPU (Mac with MPS):

For Local CPU-only (no GPU):

For Remote Server (SSH):

For Vast.ai:

For Modal (serverless):

Step 3: Decision Gate

Step 4: Stop and Report (when compute unavailable)

Step 5: Proceed Summary (when compute available)

Integration

Rules