一键在 Manus 中运行任何 Skill

library-evaluation-integration

Create evaluation scripts and integration tests for Python scientific libraries in the digitalmodel package. Follows the established pattern from fluids, ht, meshio, sectionproperties, and pygmt evaluations.

在 Manus 中运行

概览

安装命令

npx skills add https://github.com/vamseeachanta/workspace-hub --skill library-evaluation-integration

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

vamseeachanta/workspace-hub

星标10

分支6

更新时间2026年6月1日 22:37

SKILL.md

readonly

name	library-evaluation-integration
description	Create evaluation scripts and integration tests for Python scientific libraries in the digitalmodel package. Follows the established pattern from fluids, ht, meshio, sectionproperties, and pygmt evaluations.
tags	["digitalmodel","evaluation","integration-test","scientific-python"]
triggers	["evaluate a library in digitalmodel","create integration tests for a library","add a new library evaluation script"]

Library Evaluation + Integration Test Pattern

Context

The digitalmodel repo (a sibling checkout, e.g. $WORKROOT/digitalmodel where $WORKROOT holds your repo checkouts) has a standard pattern for evaluating and testing scientific Python library integrations:

Evaluation script: scripts/integrations/<lib>_evaluation.py
Integration tests: tests/test_<lib>_integration.py

Workflow (4 phases)

Phase 1: API Discovery (CRITICAL — don't skip)

Before writing ANY code, probe the actual installed API:

uv run python -c "import <lib>; print(<lib>.__version__)"
uv run python -c "import <lib>.<submodule>; print(dir(<lib>.<submodule>))"
uv run python -c "help(<lib>.<submodule>.<function>)"

Why: Library APIs change between versions. The ht.insulation module completely changed its API surface in v1.2.0 (no more R_value functions, now k_material/nearest_material). Always verify what's actually importable before writing imports.

Phase 2: Compute Reference Values

Run each function with representative inputs and record the actual output:

uv run python -c "
from <lib>.<mod> import <func>
result = <func>(arg1, arg2)
print(f'result = {result}')
"

Why: Setting test assertions from textbook expectations can fail. Example: a subsea pipeline U-value calculated to 0.29 W/m²/K which is realistic but was below the initial bound of 0.5. Always compute first, then set bounds around the computed value.

Phase 3: Test Edge Cases Interactively

Different functions handle edge cases differently — test before asserting:

# Test Re=0, Pr=0, T=T2, empty inputs, etc.
uv run python -c "
try:
    result = func(edge_case_args)
    print(f'Returns: {result}')
except Exception as e:
    print(f'Raises: {type(e).__name__}: {e}')
"

Pitfall discovered: In ht library, Nu_conv_internal(Re=0) raises ValueError but Nu_cylinder_Churchill_Bernstein(Re=0) returns 0.3. Can't assume uniform edge-case behavior across submodules.

Phase 4: Write Files

Evaluation Script Structure

#!/usr/bin/env python3
"""<Library Name> — Evaluation Script.

Demonstrates <lib> integration for offshore/engineering workflows.
Library: <url> (v<version>, <license>)
"""

import <lib>
from <lib>.<submod> import <func>

def demo_capability_1():
    """Description with engineering context."""
    print("=" * 65)
    print("1. CAPABILITY NAME")
    print("=" * 65)
    # Scenario description, calculations, formatted output
    print()

# ... more demo functions ...

def main():
    print("*" * 65)
    print(f"  <Library> — Evaluation Script")
    print(f"  Version: {<lib>.__version__}")
    print("*" * 65)
    demo_capability_1()
    # ...
    print("  Evaluation complete.")

if __name__ == "__main__":
    main()

Integration Test Structure

"""<Library> integration tests.

Library: <url> (v<version>, <license>)
Tests: import checks, known-value verification, edge cases, physics sanity.
All values in SI units.
"""
import math
import pytest

<lib> = pytest.importorskip("<lib>")
from <lib>.<submod> import <func>

class TestImportAndVersion:
    def test_import(self): ...
    def test_version(self): ...
    def test_submodules_importable(self): ...

class TestCapability1:
    def test_known_value(self):
        """Compare against pre-computed reference value."""
        result = func(args)
        assert result == pytest.approx(REFERENCE, rel=1e-2)

    def test_monotonicity(self):
        """Physical quantity increases/decreases with parameter."""

    def test_edge_case(self):
        """Re=0, T=0, empty input, etc."""

    def test_physics_sanity(self):
        """Nu > 0, 0 <= eff <= 1, R > 0, etc."""

Test Categories (aim for 15+ tests)

Import/version (2-3 tests): importorskip, version check, submodules
Known-value verification (1 per capability): pre-computed reference values
Monotonicity/physics (1-2 per capability): Nu increases with Re, etc.
Edge cases (2-3 total): zero inputs, extreme values, domain errors
Unit consistency (1-2): dimensional analysis checks
Integration/end-to-end (1-2): combine multiple functions into realistic workflow

Pitfalls

Always use uv run — never bare python3 (project policy)
pytest.approx with rel tolerance — use rel=1e-2 for engineering correlations, rel=1e-6 for analytical formulas, abs for zero-valued results
Don't guess assertion bounds — compute the value first, then verify it makes physical sense, THEN set the test bounds around it
API drift — when user says "check what's available", always probe with dir() and help() before writing imports
Randomized test ordering — the repo uses pytest-randomly; tests must be independent

同仓库更多 Skills

同仓库

flywheel-closeout

vamseeachanta/workspace-hub

Use this at the end of substantial repo or agent waves to convert evidence-backed lessons into proposed durable assets: skills, scripts, rules/checks, prompt templates, docs, or issues. Always use it when the user mentions flywheel, wave closeout, repo ecosystem learning, durable asset promotion, or learning-to-tools.

2026-06-0310

blender-worktree-test-hardening

vamseeachanta/workspace-hub

Recover and harden digitalmodel Blender automation work in isolated worktrees when uv/editable dependency paths break and local machines lack a Blender executable.

2026-06-0110

digitalmodel-orcawave-orcaflex-proof-workflows

vamseeachanta/workspace-hub

Class-level digitalmodel OrcaWave/OrcaFlex readiness, semantic-proof, fixture-proof, and closeout workflows.

2026-06-0110

digitalmodel-worktree-test-execution-with-shared-venv

vamseeachanta/workspace-hub

Run digitalmodel tests from isolated worktrees without uv editable-dependency failures by using the main repo's existing virtualenv and PYTHONPATH.

2026-06-0110

orcaflex-reporting-fixture-proof-pattern

vamseeachanta/workspace-hub

Build and extend fixture-backed OrcaFlex reporting proof paths in digitalmodel using stable metadata baselines, normalized HTML snapshots, and reusable reporting test helpers.

2026-06-0110

hermes-memory-bridge

vamseeachanta/workspace-hub

Architecture and scripts for syncing Hermes memory into git-tracked .claude/memory/ so all machines get context via git pull. Covers quality gate, drift detection, topic mirroring, and cron automation.

2026-06-0110

来源

vamseeachanta

vamseeachanta/workspace-hub

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

软件质量保证分析师与测试员计算机与数学类职业15-1253L4

name	library-evaluation-integration
description	Create evaluation scripts and integration tests for Python scientific libraries in the digitalmodel package. Follows the established pattern from fluids, ht, meshio, sectionproperties, and pygmt evaluations.
tags	["digitalmodel","evaluation","integration-test","scientific-python"]
triggers	["evaluate a library in digitalmodel","create integration tests for a library","add a new library evaluation script"]

Library Evaluation + Integration Test Pattern

Context

Evaluation script: scripts/integrations/<lib>_evaluation.py
Integration tests: tests/test_<lib>_integration.py

Workflow (4 phases)

Phase 1: API Discovery (CRITICAL — don't skip)

Before writing ANY code, probe the actual installed API:

uv run python -c "import <lib>; print(<lib>.__version__)"
uv run python -c "import <lib>.<submodule>; print(dir(<lib>.<submodule>))"
uv run python -c "help(<lib>.<submodule>.<function>)"

Phase 2: Compute Reference Values

Run each function with representative inputs and record the actual output:

uv run python -c "
from <lib>.<mod> import <func>
result = <func>(arg1, arg2)
print(f'result = {result}')
"

Phase 3: Test Edge Cases Interactively

Different functions handle edge cases differently — test before asserting:

# Test Re=0, Pr=0, T=T2, empty inputs, etc.
uv run python -c "
try:
    result = func(edge_case_args)
    print(f'Returns: {result}')
except Exception as e:
    print(f'Raises: {type(e).__name__}: {e}')
"

Pitfall discovered: In ht library, Nu_conv_internal(Re=0) raises ValueError but Nu_cylinder_Churchill_Bernstein(Re=0) returns 0.3. Can't assume uniform edge-case behavior across submodules.

Phase 4: Write Files

Evaluation Script Structure

#!/usr/bin/env python3
"""<Library Name> — Evaluation Script.

Demonstrates <lib> integration for offshore/engineering workflows.
Library: <url> (v<version>, <license>)
"""

import <lib>
from <lib>.<submod> import <func>

def demo_capability_1():
    """Description with engineering context."""
    print("=" * 65)
    print("1. CAPABILITY NAME")
    print("=" * 65)
    # Scenario description, calculations, formatted output
    print()

# ... more demo functions ...

def main():
    print("*" * 65)
    print(f"  <Library> — Evaluation Script")
    print(f"  Version: {<lib>.__version__}")
    print("*" * 65)
    demo_capability_1()
    # ...
    print("  Evaluation complete.")

if __name__ == "__main__":
    main()

Integration Test Structure

"""<Library> integration tests.

Library: <url> (v<version>, <license>)
Tests: import checks, known-value verification, edge cases, physics sanity.
All values in SI units.
"""
import math
import pytest

<lib> = pytest.importorskip("<lib>")
from <lib>.<submod> import <func>

class TestImportAndVersion:
    def test_import(self): ...
    def test_version(self): ...
    def test_submodules_importable(self): ...

class TestCapability1:
    def test_known_value(self):
        """Compare against pre-computed reference value."""
        result = func(args)
        assert result == pytest.approx(REFERENCE, rel=1e-2)

    def test_monotonicity(self):
        """Physical quantity increases/decreases with parameter."""

    def test_edge_case(self):
        """Re=0, T=0, empty input, etc."""

    def test_physics_sanity(self):
        """Nu > 0, 0 <= eff <= 1, R > 0, etc."""

Test Categories (aim for 15+ tests)

Import/version (2-3 tests): importorskip, version check, submodules
Known-value verification (1 per capability): pre-computed reference values
Monotonicity/physics (1-2 per capability): Nu increases with Re, etc.
Edge cases (2-3 total): zero inputs, extreme values, domain errors
Unit consistency (1-2): dimensional analysis checks
Integration/end-to-end (1-2): combine multiple functions into realistic workflow

Pitfalls

Always use uv run — never bare python3 (project policy)
pytest.approx with rel tolerance — use rel=1e-2 for engineering correlations, rel=1e-6 for analytical formulas, abs for zero-valued results
Don't guess assertion bounds — compute the value first, then verify it makes physical sense, THEN set the test bounds around it
API drift — when user says "check what's available", always probe with dir() and help() before writing imports
Randomized test ordering — the repo uses pytest-randomly; tests must be independent