Run any Skill in Manus with one click

feature-discovery

Stars0

Forks0

UpdatedApril 11, 2026 at 23:22

Define, discover, and manage cognitive feature specifications and steering vectors for LLM activation steering. Use this skill when the user wants to create feature specs, define extraction examples with positive/negative labels, set evaluation criteria, discover feature vectors from specs, persist discovered vectors, work with the standard feature catalog, or manage the feature lifecycle. Also trigger when the user mentions feature extraction, feature catalog, feature specs, cognitive features, or interaction features.

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

Run Skill in Manus

Source

Tyler-R-Kendrick

Tyler-R-Kendrick/agentic-metacognition

View GitHub Repository View Creator Repositories

Download

Run Skill in Manus

Related occupationsSOC

Based on SOC occupation classification

Data ScientistsComputer and Mathematical Occupations·SOC 15-2051

File Explorer

2 files

SKILL.md

readonly

name

feature-discovery

description

Feature Discovery

Use this skill to define reusable feature specifications, discover steering vectors from those specs, and manage feature catalogs for the activation_steering package.

When to use this skill

Creating a new FeatureSpec with labeled extraction examples
Defining EvaluationCriterion rules for judging feature extraction
Discovering steering vectors from feature specs via discover_feature_vectors
Persisting discovered vectors with save_discovered_feature_vectors
Loading or extending the standard feature catalog
Discovering dynamic interaction features from prompt/output pairs

Feature specification model

A feature spec has four components:

Identity: name, category, summary
Extraction examples: Labeled text samples (FeatureExample with text, label, metadata)
Test cases: Additional labeled examples for validation
Evaluation criteria: EvaluationCriterion rules with name, description, threshold

Creating a feature spec

from activation_steering import FeatureSpec, FeatureExample, EvaluationCriterion, build_feature_spec

spec = build_feature_spec(
    name="truthfulness",
    category="reasoning",
    summary="Measures whether the model produces truthful, evidence-based statements",
    extraction_examples=[
        FeatureExample(text="The study found a strong correlation.", label="positive"),
        FeatureExample(text="Everyone knows this is obviously true.", label="negative"),
    ],
    test_cases=[
        FeatureExample(text="According to the data, the trend is upward.", label="positive"),
        FeatureExample(text="Trust me, this always works.", label="negative"),
    ],
    evaluation_criteria=[
        EvaluationCriterion(
            name="cosine_alignment",
            description="Positive examples should have higher cosine similarity to the vector",
            threshold=0.5,
        ),
    ],
)

Loading the standard catalog

from activation_steering import get_standard_feature_specs, get_standard_feature_catalog

# Get all specs for a model
specs = get_standard_feature_specs(model_name="gpt2")

# Get the full catalog object
catalog = get_standard_feature_catalog(model_name="gpt2")

Discovering feature vectors

The discovery pipeline builds one steering vector per feature spec from its labeled examples:

from activation_steering import (
    discover_feature_vectors,
    save_discovered_feature_vectors,
    discover_and_store_feature_vectors,
)

# Step by step
vectors = discover_feature_vectors(
    feature_specs=specs,
    model_name="gpt2",
    layer_idx=5,
    model=model,
    tokenizer=tokenizer,
    device=device,
)
save_discovered_feature_vectors(vectors, output_dir="./my_vectors")

# Or all-in-one
discover_and_store_feature_vectors(
    feature_specs=specs,
    model_name="gpt2",
    layer_idx=5,
    model=model,
    tokenizer=tokenizer,
    device=device,
    output_dir="./my_vectors",
)

Each DiscoveredFeatureVector contains:

name, model_name, category, summary
layer_idx, vector (the steering tensor)
positive_example_count, negative_example_count, test_case_count
evaluation_criteria (serialized list)

Interaction feature discovery

For dynamic features learned from prompt/output pairs during agent usage:

from activation_steering import discover_interaction_features

features = discover_interaction_features(
    model_name="gpt2",
    prompt="What is the capital of France?",
    output="The capital of France is Paris.",
    model=model,
    tokenizer=tokenizer,
    device=device,
)

Each ObservedInteractionFeature is identified as interaction::{prompt_shape}__{context_usage}__{output_shape}.

Standard catalog layout

Standard feature specs live at:

activation_steering/artifacts/gpt2/standard/feature_specs.json

Use get_standard_feature_models() to list available models and load_standard_feature_catalogs() to load all catalogs at once.

API reference

For full details, read activation_steering/features.py (specs, examples, criteria, catalogs) and activation_steering/discovery.py (vector discovery, interaction features, persistence).

More from this repository

same repository

artifact-plugins

Tyler-R-Kendrick/agentic-metacognition

Create, load, merge, and distribute persistent artifact plugin bundles for activation steering. Use this skill when the user wants to manage steering artifacts, create distributable plugin packs, load model bundles, merge multiple plugins, write new artifact directories, or work with the plugin directory tree. Also trigger when the user mentions artifact plugins, plugin bundles, plugin manifests, controllers.json, activations.json, or the artifacts/ directory layout.

2026-04-110

gh-aw

Tyler-R-Kendrick/agentic-metacognition

Use when creating, compiling, validating, running, or debugging GitHub Agentic Workflows with the gh-aw CLI.

2026-04-110

graphrag

Tyler-R-Kendrick/agentic-metacognition

Build and query Neo4j-backed reasoning trajectory graphs for the hybrid meta-cognition agent. Use this skill when the user wants to persist task plans, subgoals, constraints, run states, and verifier outcomes in a Neo4j graph, retrieve evidence paths with PathRAG, or store drift-correction records. Also trigger when the user mentions Neo4j, GraphRAG, PathRAG, reasoning trajectories, graph store, evidence paths, or task plan graphs — even if they don't say "graphrag" explicitly.

2026-04-110

hybrid-agent

Tyler-R-Kendrick/agentic-metacognition

Build and run a hybrid planner/retriever/steered-executor/verifier meta-cognition agent loop with persistent steering memory. Use this skill when the user wants to create an agent that plans tasks, retrieves context, applies activation steering, verifies results, and writes back to memory. Also use when the user mentions HybridMetaCognitionAgent, SteeredExecutor, planner/verifier loops, steering controllers, agent runs, or activation traces — even if they don't say "hybrid agent" explicitly.

2026-04-110

skill-creator

Tyler-R-Kendrick/agentic-metacognition

Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.

2026-04-110

steering

Tyler-R-Kendrick/agentic-metacognition

Run activation-steering feature discovery for a Hugging Face model. Use this skill whenever the user says /steering, wants to steer a model, extract a cognitive feature, generate steering vectors, run feature discovery, or produce steering artifacts. Specify a model (default: gpt2) and an optional feature name. If a feature is supplied, generate inputs, expected outputs, then run an extraction pass and output artifacts. If no feature is supplied, auto-pick one that hasn't already been extracted. If no test data is supplied, generate synthetic examples; otherwise use the user's data and fill in whatever's missing.

2026-04-110

name

feature-discovery

description

Feature Discovery

Use this skill to define reusable feature specifications, discover steering vectors from those specs, and manage feature catalogs for the activation_steering package.

When to use this skill

Creating a new FeatureSpec with labeled extraction examples
Defining EvaluationCriterion rules for judging feature extraction
Discovering steering vectors from feature specs via discover_feature_vectors
Persisting discovered vectors with save_discovered_feature_vectors
Loading or extending the standard feature catalog
Discovering dynamic interaction features from prompt/output pairs

Feature specification model

A feature spec has four components:

Identity: name, category, summary
Extraction examples: Labeled text samples (FeatureExample with text, label, metadata)
Test cases: Additional labeled examples for validation
Evaluation criteria: EvaluationCriterion rules with name, description, threshold

Creating a feature spec

from activation_steering import FeatureSpec, FeatureExample, EvaluationCriterion, build_feature_spec

spec = build_feature_spec(
    name="truthfulness",
    category="reasoning",
    summary="Measures whether the model produces truthful, evidence-based statements",
    extraction_examples=[
        FeatureExample(text="The study found a strong correlation.", label="positive"),
        FeatureExample(text="Everyone knows this is obviously true.", label="negative"),
    ],
    test_cases=[
        FeatureExample(text="According to the data, the trend is upward.", label="positive"),
        FeatureExample(text="Trust me, this always works.", label="negative"),
    ],
    evaluation_criteria=[
        EvaluationCriterion(
            name="cosine_alignment",
            description="Positive examples should have higher cosine similarity to the vector",
            threshold=0.5,
        ),
    ],
)

Loading the standard catalog

from activation_steering import get_standard_feature_specs, get_standard_feature_catalog

# Get all specs for a model
specs = get_standard_feature_specs(model_name="gpt2")

# Get the full catalog object
catalog = get_standard_feature_catalog(model_name="gpt2")

Discovering feature vectors

The discovery pipeline builds one steering vector per feature spec from its labeled examples:

from activation_steering import (
    discover_feature_vectors,
    save_discovered_feature_vectors,
    discover_and_store_feature_vectors,
)

# Step by step
vectors = discover_feature_vectors(
    feature_specs=specs,
    model_name="gpt2",
    layer_idx=5,
    model=model,
    tokenizer=tokenizer,
    device=device,
)
save_discovered_feature_vectors(vectors, output_dir="./my_vectors")

# Or all-in-one
discover_and_store_feature_vectors(
    feature_specs=specs,
    model_name="gpt2",
    layer_idx=5,
    model=model,
    tokenizer=tokenizer,
    device=device,
    output_dir="./my_vectors",
)

Each DiscoveredFeatureVector contains:

name, model_name, category, summary
layer_idx, vector (the steering tensor)
positive_example_count, negative_example_count, test_case_count
evaluation_criteria (serialized list)

Interaction feature discovery

For dynamic features learned from prompt/output pairs during agent usage:

from activation_steering import discover_interaction_features

features = discover_interaction_features(
    model_name="gpt2",
    prompt="What is the capital of France?",
    output="The capital of France is Paris.",
    model=model,
    tokenizer=tokenizer,
    device=device,
)

Each ObservedInteractionFeature is identified as interaction::{prompt_shape}__{context_usage}__{output_shape}.

Standard catalog layout

Standard feature specs live at:

activation_steering/artifacts/gpt2/standard/feature_specs.json

Use get_standard_feature_models() to list available models and load_standard_feature_catalogs() to load all catalogs at once.

API reference

For full details, read activation_steering/features.py (specs, examples, criteria, catalogs) and activation_steering/discovery.py (vector discovery, interaction features, persistence).