Run any Skill in Manus with one click

$pwd:

run-bench

Name: Run Bench
Author: tv-labs

// Execute the benchmark harness under benchmarks/ and produce a markdown report comparing this build against the recorded baseline (and optionally against Luerl and PUC-Lua). Flag regressions over a configurable threshold. Use when the user asks for a benchmark run, after any executor or codegen change in Direction B, or when a Direction B plan's verification calls for it. STATUS: stub. Will be filled in when Direction B begins. The benchee harness is already in benchmarks/ (added in PR #143).

Run Skill in Manus

$ git log --oneline --stat

stars:167

forks:8

updated:May 4, 2026 at 19:44

SKILL.md

readonly

name

run-bench

description

Execute the benchmark harness under benchmarks/ and produce a markdown report comparing this build against the recorded baseline (and optionally against Luerl and PUC-Lua). Flag regressions over a configurable threshold. Use when the user asks for a benchmark run, after any executor or codegen change in Direction B, or when a Direction B plan's verification calls for it. STATUS: stub. Will be filled in when Direction B begins. The benchee harness is already in benchmarks/ (added in PR #143).

run-bench (stub)

This skill is a placeholder. It will be fleshed out when Direction B starts.

What's already in place (don't re-build)

benchmarks/ directory with benchee scripts: closures.exs, fibonacci.exs, oop.exs, string_ops.exs, table_ops.exs. (PR #143)
Comparison against Luerl and PUC-Lua via luerl and luaport deps.

What this skill needs to add when Direction B starts

A standard "run all benchmarks" entry point (probably a mix task or shell script).
Output to bench/results/<date>-<sha>.md so we have a history.
A baseline stored at bench/baseline.md that gets updated explicitly (not on every run).
A diff renderer: takes two result sets, produces a markdown table showing percent change, flags regressions over a threshold (default 20%).
Convention for capturing PR-relevant numbers in the PR body.

Until then

Run benchmarks manually:

mix run benchmarks/fibonacci.exs
mix run benchmarks/closures.exs
mix run benchmarks/oop.exs
mix run benchmarks/string_ops.exs
mix run benchmarks/table_ops.exs

Capture stdout and paste into the PR description. Until a baseline file exists, "no regression" is judged by re-running before and after on the same machine.

related-skills.json

same repository

triage-suite-failure.md

from "tv-labs/lua"

Diagnose a failing Lua 5.3 official test suite file. Isolates the failure, classifies it, decides whether to fix-now or defer, and produces either a unit test + plan file (for fix-now) or a deferred-with-comment skip (for out-of-scope). Use this when investigating a specific suite file (`literals.lua`, `bitwise.lua`, etc.), when a /next-plan ship reveals downstream suite failures, when /suite-status shows new failures, or when the user asks "why is X failing". This skill does not ship code itself — it produces the artifacts (plan file, unit test, or skip tag) that other plans then ship via ship-a-plan.

2026-05-27167

ship-a-plan.md

from "tv-labs/lua"

Execute one plan file from .agents/plans/ as a single PR against main. Reads the plan, verifies preconditions, implements only what's in scope, runs full validation, opens a PR, and updates the plan file's status. Stops before merging — review is human-gated. Use this skill when the user invokes /next-plan, asks to "ship the next plan", "start plan A1" or similar, or when picking up a specific plan file by id. One plan = one PR = one issue = one merge to main. Do not batch multiple plans into a single PR.

2026-05-07167

package.json

"author": "tv-labs"

"repository": "tv-labs/lua"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name

run-bench

description

run-bench (stub)

This skill is a placeholder. It will be fleshed out when Direction B starts.

What's already in place (don't re-build)

benchmarks/ directory with benchee scripts: closures.exs, fibonacci.exs, oop.exs, string_ops.exs, table_ops.exs. (PR #143)
Comparison against Luerl and PUC-Lua via luerl and luaport deps.

What this skill needs to add when Direction B starts

A standard "run all benchmarks" entry point (probably a mix task or shell script).
Output to bench/results/<date>-<sha>.md so we have a history.
A baseline stored at bench/baseline.md that gets updated explicitly (not on every run).
A diff renderer: takes two result sets, produces a markdown table showing percent change, flags regressions over a threshold (default 20%).
Convention for capturing PR-relevant numbers in the PR body.

Until then

Run benchmarks manually:

mix run benchmarks/fibonacci.exs
mix run benchmarks/closures.exs
mix run benchmarks/oop.exs
mix run benchmarks/string_ops.exs
mix run benchmarks/table_ops.exs

Capture stdout and paste into the PR description. Until a baseline file exists, "no regression" is judged by re-running before and after on the same machine.

run-bench

run-bench (stub)

What's already in place (don't re-build)

What this skill needs to add when Direction B starts

Until then

More from this repository

run-bench (stub)

What's already in place (don't re-build)

What this skill needs to add when Direction B starts

Until then

More from this repository