Skip to main content
Run any Skill in Manus
with one click
GitHub repository

skillsbench

skillsbench contains 208 collected skills from benchflow-ai, with repository-level occupation coverage and site-owned skill detail pages.

skills collected
208
Stars
1.4k
updated
2026-06-16
Forks
318
Occupation coverage
39 occupation categories · 100% classified
repository explorer

Skills in this repository

skillsbench
software-developers

SkillsBench contribution workflow. Use when: (1) Creating benchmark tasks, (2) Understanding repo structure, (3) Preparing PRs for task submission.

2026-06-16
task-creator
software-developers

SkillsBench task authoring — walk a contributor from idea to submission-ready task following CONTRIBUTING.md and the task-implementation rubric. Use when the user wants to create a new SkillsBench task, scaffold a task from an existing workflow (notebook, Excel workbook, document, dataset), convert a prompt or a benchmark item into a SkillsBench task, write skills for a task, or prepare a SkillsBench PR. Pairs with `task-review` (run that as a self-check before submitting).

2026-06-16
task-review
software-quality-assurance-analysts-and-testers

SkillsBench task PR review — classifies the task track (standard / research / multimodal), runs static policy checks against the track-specific rubric, benchmarks the task across oracle plus Claude and Codex (with and without skills), audits trajectories for cheating and skill invocation, and produces a `pr-N-task-timestamp-run.txt` review report alongside a `prN.zip` bundle of trajectories. Use when reviewing a SkillsBench task PR (by number, branch, or local task path), when the user asks to review a task, run benchmarks on a PR, audit a submission, classify a task as research or multimodal track, or prepare a comment to post on a SkillsBench PR.

2026-06-16
contract-playbook-review
paralegals-and-legal-assistants-232011lawyers

Methodology for clause-by-clause review of a contract against a structured deviation policy ("playbook"). Covers how to walk a playbook, locate the matching provision in the contract, apply rule types (max-value, must-be-present, must-be-absent, acceptable-set, must-have-feature), classify the result (ok / risk / reject), choose the prescribed action, and ground each finding in a verbatim excerpt. Use whenever reviewing any contract — NDA, MSA, vendor DD questionnaire, lease, DPA — against a structured rules-based playbook.

2026-06-11
nda-clause-taxonomy
paralegals-and-legal-assistants-232011lawyers

Reference for the standard clauses found in commercial non-disclosure agreements (mutual and one-way) — what each clause does, the surface forms it appears in, and how to recognise it in unfamiliar drafting. Use when reviewing, comparing, or extracting provisions from any confidentiality / NDA / mutual NDA / standstill-and-confidentiality agreement.

2026-06-11
xlsx-parsing
software-developers

Read Microsoft Excel (.xlsx) files robustly with `openpyxl` (or `pandas`). Covers multi-sheet workbooks, header rows, empty cells, merged cells, comma-separated list cells, and converting a sheet to a list-of-dicts the rest of your code can consume. Use when a task input or reference document is an `.xlsx` file rather than JSON/CSV.

2026-06-11
hierarchical-taxonomy-clustering
software-developers

Build unified multi-level category taxonomy from hierarchical product category paths from any e-commerce companies using embedding-based recursive clustering with intelligent category naming via weighted word frequency analysis.

2026-06-08
d3-visualization
software-developers

Build deterministic, verifiable data visualizations with D3.js (v6). Generate standalone HTML/SVG (and optional PNG) from local data files without external network dependencies. Use when tasks require charts, plots, axes/scales, legends, tooltips, or data-driven SVG output.

2026-06-05
dialogue-graph
software-developers

A library for building, validating, visualizing, and serializing dialogue graphs. Use this when parsing scripts or creating branching narrative structures.

2026-06-05
senior-java
software-developers

World-class Java and Spring Boot development skill for enterprise applications, microservices, and cloud-native systems. Expertise in Spring Framework, Spring Boot 3.x, Spring Cloud, JPA/Hibernate, and reactive programming with WebFlux. Includes project scaffolding, dependency management, security implementation, and performance optimization.

2026-06-05
senior-data-engineer
software-developers

World-class data engineering skill for building scalable data pipelines, ETL/ELT systems, real-time streaming, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, Flink, Kinesis, and modern data stack. Includes data modeling, pipeline orchestration, data quality, streaming quality monitoring, and DataOps. Use when designing data architectures, building batch or streaming data pipelines, optimizing data workflows, or implementing data governance.

2026-06-05
reflow-machine-maintenance-guidance
electrical-and-electronic-engineering-technologists-and-technicians

This skill should be considered when you need to answer reflow machine maintenance questions or provide detailed guidance based on thermocouple data, MES data or defect data and reflow technical handbooks. This skill covers how to obtain important concepts, calculations, definitions, thresholds, and others from the handbook and how to do cross validations between handbook and datasets.

2026-06-05
image-editing
software-developers

Comprehensive command-line tools for modifying and manipulating images, such as resize, blur, crop, flip, and many more.

2026-06-05
object-counter
software-developers

Count occurrences of an object in the image using computer vision algorithm.

2026-06-05
ffmpeg-audio-processing
software-developers

Extract, normalize, mix, and process audio tracks - audio manipulation and analysis

2026-06-05
ffmpeg-format-conversion
software-developers

Convert media files between formats - video containers, audio formats, and codec transcoding

2026-06-05
ffmpeg-media-info
software-developers

Analyze media file properties - duration, resolution, bitrate, codecs, and stream information

2026-06-05
ffmpeg-video-editing
software-developers

Cut, trim, concatenate, and split video files - basic video editing operations

2026-06-05
ffmpeg-video-filters
software-developers

Apply video filters - scale, crop, watermark, speed, blur, and visual effects

2026-06-05
text-to-speech
software-developers

Practical mastering steps for TTS audio: cleanup, loudness normalization, alignment, and delivery specs.

2026-06-05
data-reconciliation
software-developers

Recover missing spreadsheet values from row and column totals, percentage shares, year-over-year changes, CAGR relationships, and cross-sheet constraints.

2026-06-05
comp3-packed-decimal
software-developers

Reference for COBOL COMP-3 (packed decimal / BCD) numeric storage -- layout on disk, declaring it in the FD, and pitfalls when carrying the decoded value through working storage and into later arithmetic. Useful when an input record holds a balance, rate, or carry-forward as packed decimal rather than a printable digit string.

2026-06-04
ebcdic-overpunch-decoding
software-developers

Reference for the EBCDIC "overpunch" / zoned-decimal sign convention where the units position of a numeric field is replaced with a letter that encodes both a digit and a sign. Useful when reading mainframe-style fixed-length tapes whose amount fields appear as digits followed by a letter (e.g. "0000000000123D" or "0000000000045M").

2026-06-04
gl-posting-codes
software-developers

Background on general-ledger batch posting codes (HD/DR/CR/RV plus signed "other" rows). Reference only -- follow the task instruction for the exact rules, which may differ per account type.

2026-06-04
gnucobol-mainframe-batch
software-developers

GnuCOBOL toolchain pointers for mainframe-style batch programs (LINE/RECORD SEQUENTIAL files, PIC clauses, cobc build flags).

2026-06-04
paired-reversal-cancellation
software-developers

Reference for paired-reversal handling on GL batch tapes: an RV row whose ref-trace points at an earlier row cancels BOTH legs (the RV and the row it references) — but ONLY when the two value-dates fall within the shop's reversal settlement window; an RV that references a too-old posting is NOT a cancellation and posts one-sided. Useful when the posting tape carries a reference-trace column alongside reversal codes and the running balance is off by twice the reversal magnitude, or by a whole leg on stale references.

2026-06-04
triangulated-fx-rates
software-developers

Reference for multi-hop / triangulated currency conversion on batch rate tapes. When a rate row carries an explicit "via" currency, the row's rate is only one leg of the conversion, and the via currency may ITSELF be quoted through another via — so the effective rate is the product of every leg, resolved by walking the chain until a direct quote is reached. Useful when an FX feed lists some pairs directly to the reporting currency and others as triangulated (or doubly-triangulated) quotes through vehicle currencies.

2026-06-04
fragmentation-aware-packing
software-developers

Choose placements that preserve useful residual capacity. Use for bin packing, GPU sharing, accelerator placement, and multi-resource scheduling where stranded capacity hurts future fit.

2026-05-30
multi-resource-allocation-validation
software-developers

Validate and repair proposed resource allocations by replaying them against temporary capacity. Use when actions consume several resource dimensions such as CPU, memory, GPUs, or accelerators.

2026-05-30
online-resource-scheduling
software-developers

Design deterministic online scheduling policies from current observations. Use when assigning arriving work to limited resources without seeing future requests.

2026-05-30
threejs
software-developers

Three.js scene-graph parsing and export workflows: mesh baking, InstancedMesh expansion, part partitioning, per-link OBJ export, and URDF articulation.

2026-05-27
pymatgen
software-developers

Materials science toolkit. Crystal structures (CIF, POSCAR), phase diagrams, band structure, DOS, Materials Project integration, format conversion, for computational materials science.

2026-05-21
attitude-controller-planner
electrical-engineers

Use this skill when implementing the inner control loop for a quadrotor — attitude (roll/pitch/yaw) PID control and attitude planning (converting desired acceleration to desired Euler angles). Covers gain layout, integral reset pattern, and the attitude planner inverse kinematics.

2026-05-05
flight-plan-parser
aerospace-engineers-172011

Use this skill when converting natural language flight commands into waypoints and timing for a drone simulator. Covers parsing commands like "Take off to X m height in Y seconds", "Hover at X m height for Y seconds", "Fly from (x,y,z) to (x',y',z') in T seconds", and "Land from X m height in Y seconds" into structured (4×n) waypoint arrays and segment mode lists.

2026-05-05
motor-model-dynamics
aerospace-engineers-172011

Use this skill when simulating quadrotor physical dynamics — mapping desired thrust/moments to individual motor RPMs via a propeller allocation matrix, applying first-order motor lag, and integrating the nonlinear equations of motion (translational and rotational) using RK45.

2026-05-05
plot-quadrotor
aerospace-engineers-172011

Use this skill when visualising drone simulation results. Produces three matplotlib figures — desired vs actual trajectories, instantaneous error, and cumulative absolute error — for all 5 state groups (position, orientation, velocity, angular velocity, acceleration). Saves figures to a plots/ directory automatically.

2026-05-05
position-controller-trajectory-planner
aerospace-engineers-172011

Use this skill when implementing the outer control loop for a quadrotor — position PID control (position/velocity error → thrust and desired acceleration) and trajectory planning from flight-plan waypoints (takeoff, hover, fly, land segments → smooth 15-row state matrix).

2026-05-05
stepinfo-3d
aerospace-engineers-172011

Use this skill when computing 3D step-response performance metrics for point-to-point drone flight — rise time, settling time, percent overshoot, and steady-state error based on Euclidean distance to the final target. Use instead of 1D stepinfo for any flight where all three position axes move simultaneously.

2026-05-05
transaction-concurrency-control-foundations
database-administrators

Provides database transaction concurrency-control background. Use when reasoning about transactions, commits and aborts, serializability, isolation anomalies, conflicts, dependencies, and protocol families such as locking, timestamp ordering, OCC, MVCC, SSI, TicToc, hybrid protocols.

2026-05-05
transaction-protocol-reasoning
database-administrators

Teaches how to reason from a transaction protocol’s rules to a working model. Use when analyzing a concurrency-control paper, spec, or algorithm to identify state, metadata, invariants, operation rules, examples, counterexamples, guarantees, false aborts, unsafe commits, or tradeoffs before inspecting concrete traces.

2026-05-05
Showing top 40 of 208 collected skills in this repository.