تشغيل أي مهارة في Manus بنقرة واحدة

steering

النجوم٠

التفرعات٠

آخر تحديث١١ أبريل ٢٠٢٦ في ٢٣:٢٢

Run activation-steering feature discovery for a Hugging Face model. Use this skill whenever the user says /steering, wants to steer a model, extract a cognitive feature, generate steering vectors, run feature discovery, or produce steering artifacts. Specify a model (default: gpt2) and an optional feature name. If a feature is supplied, generate inputs, expected outputs, then run an extraction pass and output artifacts. If no feature is supplied, auto-pick one that hasn't already been extracted. If no test data is supplied, generate synthetic examples; otherwise use the user's data and fill in whatever's missing.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

Tyler-R-Kendrick

Tyler-R-Kendrick/agentic-metacognition

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

علماء البياناتمهن الحاسوب والرياضيات·SOC 15-2051

مستكشف الملفات

2 ملفات

SKILL.md

readonly

name

steering

description

/steering

Run activation-steering feature discovery end-to-end: specify a model and an optional feature, and this skill orchestrates data generation, vector extraction, and artifact output.

When to use this skill

The user types /steering or asks to "steer" a model
The user wants to extract a cognitive feature (chain_of_thought, few_shot_prompting, react, etc.)
The user wants to generate steering artifacts for a model
The user asks to run feature discovery or build a steering vector
The user mentions activation engineering, representation engineering, or contrastive extraction

How it works

Inputs

Parameter	Default	Description
`model_name`	`"gpt2"`	Any decoder-only Hugging Face model identifier
`feature_name`	auto-pick	A feature to extract (e.g. `chain_of_thought`). If omitted, the system picks the next undiscovered feature from the standard catalog.
`user_examples`	generate	Optional list of `FeatureExample` objects with `text` and `label` ("positive" / "negative"). If omitted, synthetic examples are generated. If only one label is present, the other is generated.
`layer_idx`	`5`	Transformer layer for hidden-state collection
`output_dir`	`None`	Directory to write plugin artifacts to

Pipeline

Resolve feature — look up in the standard catalog, or generate a new spec.
Ensure data — use user-supplied examples, fill missing labels with synthetic data, or generate all data.
Load model — load_model_and_tokenizer(model_name).
Discover vectors — discover_feature_vectors(feature_spec, layer_idx, model, tokenizer, device).
Write artifacts — write_artifact_plugin(...) to produce a distributable plugin bundle.

Output

A SteeringResult with:

feature_spec — the resolved FeatureSpec
discovered_vectors — list of DiscoveredFeatureVector (torch tensors + metadata)
artifact_dir — path to the written plugin directory (if output_dir was set)

Quick-start

from activation_steering.steering_command import SteeringRunConfig, run_steering

# Explicit feature
result = run_steering(SteeringRunConfig(
    model_name="gpt2",
    feature_name="chain_of_thought",
    output_dir="./artifacts",
))

# Auto-pick next undiscovered feature
result = run_steering(SteeringRunConfig(
    model_name="gpt2",
    output_dir="./artifacts",
))

# With user-supplied data
from activation_steering.features import FeatureExample
result = run_steering(SteeringRunConfig(
    model_name="gpt2",
    feature_name="my_custom_feature",
    user_examples=[
        FeatureExample(text="detailed reasoning before answer", label="positive"),
        FeatureExample(text="just the answer", label="negative"),
    ],
    output_dir="./artifacts",
))

Artifacts produced

<output_dir>/<model_name>/<feature_name>/
├── plugin.json          # Manifest
├── feature_specs.json   # The resolved FeatureSpec
└── controllers.json     # Discovered steering vectors

Standard catalog features (gpt2)

Feature	Category
`few_shot_prompting`	prompt_engineering
`retrieval_augmented_context`	context_engineering
`react`	cognitive_architecture
`chain_of_thought`	reasoning_strategy

Implementation

The command is implemented in activation_steering/steering_command.py and exported from activation_steering:

SteeringRunConfig — run configuration dataclass
SteeringResult — run output dataclass
run_steering(config) — main orchestrator
build_steering_feature_spec(...) — resolve or generate a FeatureSpec
pick_undiscovered_feature(...) — auto-select next feature
generate_synthetic_examples(...) — create training data

API reference

For implementation details, read activation_steering/steering_command.py.

المزيد من هذا المستودع

نفس المستودع

artifact-plugins

Tyler-R-Kendrick/agentic-metacognition

Create, load, merge, and distribute persistent artifact plugin bundles for activation steering. Use this skill when the user wants to manage steering artifacts, create distributable plugin packs, load model bundles, merge multiple plugins, write new artifact directories, or work with the plugin directory tree. Also trigger when the user mentions artifact plugins, plugin bundles, plugin manifests, controllers.json, activations.json, or the artifacts/ directory layout.

2026-04-110

feature-discovery

Tyler-R-Kendrick/agentic-metacognition

Define, discover, and manage cognitive feature specifications and steering vectors for LLM activation steering. Use this skill when the user wants to create feature specs, define extraction examples with positive/negative labels, set evaluation criteria, discover feature vectors from specs, persist discovered vectors, work with the standard feature catalog, or manage the feature lifecycle. Also trigger when the user mentions feature extraction, feature catalog, feature specs, cognitive features, or interaction features.

2026-04-110

gh-aw

Tyler-R-Kendrick/agentic-metacognition

Use when creating, compiling, validating, running, or debugging GitHub Agentic Workflows with the gh-aw CLI.

2026-04-110

graphrag

Tyler-R-Kendrick/agentic-metacognition

Build and query Neo4j-backed reasoning trajectory graphs for the hybrid meta-cognition agent. Use this skill when the user wants to persist task plans, subgoals, constraints, run states, and verifier outcomes in a Neo4j graph, retrieve evidence paths with PathRAG, or store drift-correction records. Also trigger when the user mentions Neo4j, GraphRAG, PathRAG, reasoning trajectories, graph store, evidence paths, or task plan graphs — even if they don't say "graphrag" explicitly.

2026-04-110

hybrid-agent

Tyler-R-Kendrick/agentic-metacognition

Build and run a hybrid planner/retriever/steered-executor/verifier meta-cognition agent loop with persistent steering memory. Use this skill when the user wants to create an agent that plans tasks, retrieves context, applies activation steering, verifies results, and writes back to memory. Also use when the user mentions HybridMetaCognitionAgent, SteeredExecutor, planner/verifier loops, steering controllers, agent runs, or activation traces — even if they don't say "hybrid agent" explicitly.

2026-04-110

skill-creator

Tyler-R-Kendrick/agentic-metacognition

Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.

2026-04-110

name

steering

description

/steering

Run activation-steering feature discovery end-to-end: specify a model and an optional feature, and this skill orchestrates data generation, vector extraction, and artifact output.

When to use this skill

The user types /steering or asks to "steer" a model
The user wants to extract a cognitive feature (chain_of_thought, few_shot_prompting, react, etc.)
The user wants to generate steering artifacts for a model
The user asks to run feature discovery or build a steering vector
The user mentions activation engineering, representation engineering, or contrastive extraction

How it works

Inputs

Parameter	Default	Description
`model_name`	`"gpt2"`	Any decoder-only Hugging Face model identifier
`feature_name`	auto-pick	A feature to extract (e.g. `chain_of_thought`). If omitted, the system picks the next undiscovered feature from the standard catalog.
`user_examples`	generate	Optional list of `FeatureExample` objects with `text` and `label` ("positive" / "negative"). If omitted, synthetic examples are generated. If only one label is present, the other is generated.
`layer_idx`	`5`	Transformer layer for hidden-state collection
`output_dir`	`None`	Directory to write plugin artifacts to

Pipeline

Resolve feature — look up in the standard catalog, or generate a new spec.
Ensure data — use user-supplied examples, fill missing labels with synthetic data, or generate all data.
Load model — load_model_and_tokenizer(model_name).
Discover vectors — discover_feature_vectors(feature_spec, layer_idx, model, tokenizer, device).
Write artifacts — write_artifact_plugin(...) to produce a distributable plugin bundle.

Output

A SteeringResult with:

feature_spec — the resolved FeatureSpec
discovered_vectors — list of DiscoveredFeatureVector (torch tensors + metadata)
artifact_dir — path to the written plugin directory (if output_dir was set)

Quick-start

from activation_steering.steering_command import SteeringRunConfig, run_steering

# Explicit feature
result = run_steering(SteeringRunConfig(
    model_name="gpt2",
    feature_name="chain_of_thought",
    output_dir="./artifacts",
))

# Auto-pick next undiscovered feature
result = run_steering(SteeringRunConfig(
    model_name="gpt2",
    output_dir="./artifacts",
))

# With user-supplied data
from activation_steering.features import FeatureExample
result = run_steering(SteeringRunConfig(
    model_name="gpt2",
    feature_name="my_custom_feature",
    user_examples=[
        FeatureExample(text="detailed reasoning before answer", label="positive"),
        FeatureExample(text="just the answer", label="negative"),
    ],
    output_dir="./artifacts",
))

Artifacts produced

<output_dir>/<model_name>/<feature_name>/
├── plugin.json          # Manifest
├── feature_specs.json   # The resolved FeatureSpec
└── controllers.json     # Discovered steering vectors

Standard catalog features (gpt2)

Feature	Category
`few_shot_prompting`	prompt_engineering
`retrieval_augmented_context`	context_engineering
`react`	cognitive_architecture
`chain_of_thought`	reasoning_strategy

Implementation

The command is implemented in activation_steering/steering_command.py and exported from activation_steering:

SteeringRunConfig — run configuration dataclass
SteeringResult — run output dataclass
run_steering(config) — main orchestrator
build_steering_feature_spec(...) — resolve or generate a FeatureSpec
pick_undiscovered_feature(...) — auto-select next feature
generate_synthetic_examples(...) — create training data

API reference

For implementation details, read activation_steering/steering_command.py.