Run any Skill in Manus with one click

$pwd:

trial-costs

Name: Trial Costs
Author: harbor-framework

// Sum agent-trial spend ($) from TB3 PR comments and Modal compute spend. Reports totals by kind (/run vs /cheat) and provider (anthropic, openai, ...), top-spender PRs, per-task latest trial cost, and daily Modal billing. Use when the user asks how much was spent on agent trials, cost breakdown by provider, PR-level spend, or Modal usage.

Run Skill in Manus

$ git log --oneline --stat

stars:208

forks:260

updated:May 16, 2026 at 21:46

SKILL.md

readonly

name	trial-costs
description	Sum agent-trial spend ($) from TB3 PR comments and Modal compute spend. Reports totals by kind (/run vs /cheat) and provider (anthropic, openai, ...), top-spender PRs, per-task latest trial cost, and daily Modal billing. Use when the user asks how much was spent on agent trials, cost breakdown by provider, PR-level spend, or Modal usage.
allowed-tools	Bash

Run the trial-costs tool and summarize the output for the user.

Default invocation (last 7 days):

uv run tools/trial-costs/trial_costs.py

Flags:

--days N — change window (default 7)
--since YYYY-MM-DD — explicit start date
--top-threshold N — top-spender PR threshold in dollars (default 100)
--no-modal — skip the Modal billing report at the end

Pass through whatever window the user asked for. If they didn't specify, default to 7 days.

After running, summarize for the user:

Grand total and split by /run vs /cheat
Per-provider totals
Top-spender PRs above the threshold (with titles)
Per-task latest /run trial cost table (one row per PR)

Keep the summary tight — the script already prints formatted tables; don't re-format them all, just lift the key numbers and mention any notable patterns (e.g. one PR responsible for a large fraction).

related-skills.json

same repository

convert-separate-verifier.md

from "harbor-framework/terminal-bench-3"

Convert a TB3 task from Harbor's shared verifier mode (default) to separate verifier mode. Use when the user asks to "convert this task to separate verifier", "make the verifier run in its own container", or asks about Harbor's separate-verifier environment for a specific task.

2026-05-30208

deep-review-task.md

from "harbor-framework/terminal-bench-3"

Run the sandboxed deep-review tool against a benchmark task PR — launches Claude Code in Docker with pre-fetched PR artifacts and writes review-summary.md / issues-found.md

2026-05-16208

tb3-status.md

from "harbor-framework/terminal-bench-3"

Generate TB3 review status report and open in browser

2026-05-13208

update-rubric.md

from "harbor-framework/terminal-bench-3"

Propose new rubric criteria based on review findings — creates a branch and PR

2026-04-10208

package.json

"author": "harbor-framework"

"repository": "harbor-framework/terminal-bench-3"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Cost EstimatorsBusiness and Financial Operations Occupations13-1051L4

name	trial-costs
description	Sum agent-trial spend ($) from TB3 PR comments and Modal compute spend. Reports totals by kind (/run vs /cheat) and provider (anthropic, openai, ...), top-spender PRs, per-task latest trial cost, and daily Modal billing. Use when the user asks how much was spent on agent trials, cost breakdown by provider, PR-level spend, or Modal usage.
allowed-tools	Bash

Run the trial-costs tool and summarize the output for the user.

Default invocation (last 7 days):

uv run tools/trial-costs/trial_costs.py

Flags:

--days N — change window (default 7)
--since YYYY-MM-DD — explicit start date
--top-threshold N — top-spender PR threshold in dollars (default 100)
--no-modal — skip the Modal billing report at the end

Pass through whatever window the user asked for. If they didn't specify, default to 7 days.

After running, summarize for the user:

Grand total and split by /run vs /cheat
Per-provider totals
Top-spender PRs above the threshold (with titles)
Per-task latest /run trial cost table (one row per PR)

trial-costs

More from this repository

More from this repository