Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:
harbor-framework
GitHub 创作者资料

harbor-framework

按仓库查看 5 个 GitHub 仓库中的 19 个已收集 skills,并展示近似职业覆盖。

已收集 skills
19
仓库
5
职业领域
3
更新
2026-05-30
仓库浏览

仓库与代表性 skills

#001
harbor
8 个 skills2.2k1.1k更新于 2026-05-30
占该创作者 42%
create-task
软件开发工程师

Create a new Harbor task for evaluating agents. Use when the user wants to scaffold, build, or design a new task, benchmark problem, or eval. Guides through instruction writing, environment setup, verifier design (pytest vs Reward Kit vs custom), and solution scripting.

2026-05-30
rewardkit
软件质量保证分析师与测试员

Write Harbor task verifiers using Reward Kit. Use when creating or editing a task's tests/ directory, adding grading criteria, setting up LLM/agent judges, or designing verifiers that produce a reward score.

2026-05-30
bundled-keep
项目管理专家

Existing task skill that should remain after job-level skill injection.

2026-05-18
runtime-proof
软件开发工程师

Write the proof file for the Harbor runtime skill injection example.

2026-05-18
publish
软件开发工程师

Publish a Harbor task or dataset to the registry. Use when the user wants to upload, publish, or share tasks or datasets/benchmarks on the Harbor registry.

2026-04-25
create-adapter
软件开发工程师

Scaffold a new Harbor benchmark adapter by running `harbor adapter init` and then guide implementation using the Adapters Agent Guide as the authoritative spec.

2026-04-19
upload-parity-experiments
软件开发工程师

Create or reuse Hugging Face dataset PRs for `harborframework/parity-experiments` and upload Harbor parity/oracle result folders efficiently with sparse checkout, raw git pushes, and Git LFS.

2026-04-10
generate-greeting
软件开发工程师计算机程序员

Generate a greeting message and write it to a file.

2026-03-01
#002
terminal-bench-3
5 个 skills208260更新于 2026-05-30
占该创作者 26%
#003
skills
3 个 skills91更新于 2026-03-17
占该创作者 16%
harbor-adapter-creator
软件开发工程师

Create Harbor benchmark adapters that convert external benchmark datasets into Harbor task format. Use when porting an existing benchmark to Harbor, running parity experiments, registering a dataset to the Harbor registry, or debugging adapter validation failures. Covers: adapter class interface (generate_task, make_local_task_id), directory layout including YAML job configs, oracle verification, parity planning and experiments, dataset registration, and the full post-implementation workflow.

2026-03-17
harbor-cli
软件开发工程师

Harbor CLI command reference and usage patterns. Covers harbor run, harbor jobs, harbor trials, harbor datasets, harbor adapters, harbor tasks, harbor view, harbor sweeps, harbor traces, harbor cache, and harbor admin commands. Use this skill whenever running Harbor evaluations, managing datasets, viewing results, debugging tasks, exporting traces, or working with any harbor CLI command. Also use when constructing harbor command lines, looking up flag names, or troubleshooting CLI errors.

2026-03-17
harbor-task-creator
软件开发工程师

Create Harbor evaluation tasks from scratch. Generates task.toml configuration, instruction.md for agents, environment/Dockerfile setup, tests/test.sh verification scripts, and solution/solve.sh reference solutions. Use this skill whenever creating, scaffolding, or authoring new Harbor benchmark tasks, evaluation environments, or agent challenges. Also use when fixing broken tasks, debugging reward file issues, or structuring multi-container evaluation environments.

2026-03-17
#004
terminal-bench-science
2 个 skills11755更新于 2026-05-20
占该创作者 11%
已展示 5 / 5 个仓库
已展示全部仓库