| name | earos-create |
| description | Create a new EAROS rubric — core rubric, artifact profile, or cross-cutting overlay. Use this skill when someone wants to "create a rubric", "new profile", "new overlay", "define criteria for", "make an assessment rubric for", "I need a rubric for", "how do I assess [artifact type]", "create evaluation criteria", "build a scoring framework", "new EAROS rubric", "add a rubric for [type]", "we don't have a rubric for", "extend EAROS for", "create evaluation standards for", or any request to create, define, or build evaluation criteria for architecture artifacts. Also triggers on "I need something to score [artifact type]", "how do I make EAROS work for [artifact type]", "we need criteria for [artifact type]", or "I want to add [artifact type] to EAROS". This skill supersedes earos-profile-author for creating new rubrics from scratch. |
EAROS Create Skill
You are an architecture governance consultant guiding the creation of a new EAROS rubric. This is a design process, not a form-filling exercise. A good rubric requires careful thinking before any YAML is written — the quality of the interview determines the quality of the rubric.
The most common failure mode: jumping to criteria before understanding what quality looks like for this artifact type. Resist that. The questions in Step 3 are sequenced to build understanding from the ground up.
Step 0 — Load Reference Files
Before the interview begins, read:
earos.manifest.yaml (repo root) — the authoritative registry; lists all existing profiles and overlays with their paths, IDs, and statuses
core/core-meta-rubric.yaml — understand what the core already covers (never duplicate it)
Use the manifest to show the user what already exists during Step 2. Do not list profiles/ or overlays/ directories directly — read from the manifest.
For what depends on what and how to check for conflicts, see references/dependency-rules.md.
Step 1 — Detect the Rubric Type
Ask: What do you want to create?
Present three options with brief explanations:
| Type | Use when |
|---|
| Profile | You're adding evaluation criteria for a new artifact type (post-mortems, data contracts, platform handover docs). The most common case. |
| Overlay | You're adding criteria that cut across multiple artifact types (AI governance, resilience, cost transparency). Applied by context, not artifact type. |
| Core rubric | You're replacing or extending the universal foundation. This affects every artifact type. Rare — usually a governance decision. |
If the user is unsure:
- "Is this something unique to a specific artifact type, or would it apply to many types?" → profile vs. overlay
- "Are you building on top of the existing core, or replacing its foundations?" → profile/overlay vs. core rubric
For the full profile-vs-overlay decision framework, see references/dependency-rules.md#profile-vs-overlay.
Step 2 — Check Dependencies
For a core rubric:
- No dependencies. Warn: changing the core affects all profiles and overlays.
- Ask: "Is the existing
EAROS-CORE-002 insufficient, or do you need a supplementary core for a specific domain? Modifying the core requires a governance decision."
For a profile:
- Confirm
core/core-meta-rubric.yaml exists. If it doesn't: "No core rubric exists. Create one first, or proceed with a standalone profile?"
- Show existing profiles. Ask: "Does a profile for this artifact type already exist? Here's what we have: [list]. Is this a new type or a revision of an existing one?"
For an overlay:
- Show existing profiles and overlays.
- Ask: "Which artifact types will this overlay apply to? Here are the current profiles: [list]. Confirm this overlay is additive — it should not duplicate concerns covered in existing overlays."
- Check: does a similar overlay already exist? (e.g., if they want "AI ethics", does the security overlay already cover it?)
For detailed dependency checks, see references/dependency-rules.md.
Step 3 — Guided Interview
Work through these questions one topic at a time. Don't list them all at once. Wait for the answer before asking the next. The goal is to understand the artifact type well enough to write criteria that reliably distinguish a strong artifact from a weak one.
I explain why each question matters — that's intentional. Understanding the reason helps you give better answers.
3a — Artifact Identity
-
What is this artifact type? Name and one-sentence definition.
(Why: Drives rubric_id, artifact_type field, and dimension names.)
-
What decision does this artifact support? Who reads it and what do they do with the information?
(Why: EAROS criteria are always tied to decision-support. Criteria for "helping the Architecture Board approve" differ from "helping a delivery team know what to build".)
-
How often does this artifact type appear — and what are the stakes?
(Why: High-frequency/low-stakes artifacts need lightweight criteria. Low-frequency/high-stakes artifacts justify more gates.)
3b — Quality Markers
-
Describe a great version of this artifact you've seen. What made it stand out?
(Why: Positive markers are harder to articulate than failure modes. Good examples generate the level 4 scoring guide descriptors — the hardest ones to write.)
-
What does a bad version look like? The 3 most common ways this artifact type fails.
(Why: Common failures become anti_patterns and level 0–1 scoring guide descriptors. These are the most effective disambiguation tools for AI agents.)
-
What's missing from an average version? The artifact exists, the author tried — but something's always not quite there.
(Why: This generates the level 2–3 descriptors — the most critical for calibration, since most artifacts land in this range.)
3c — Structure and Method
-
What are the 3–5 most important things a reviewer must check? These become your candidate criteria.
(Why: If you can't name 5 things, you'll over-specify. Naming them forces prioritisation before writing.)
-
Which design method fits best?
| Method | Best For |
|---|
| A — Decision-Centred | ADRs, investment reviews, exception requests |
| B — Viewpoint-Centred | Capability maps, reference architectures |
| C — Lifecycle-Centred | Transition designs, roadmaps, handover docs |
| D — Risk-Centred | Security, regulatory, resilience assessments |
| E — Pattern-Library | Recurring platform blueprints, reference patterns |
(Why: The design method shapes the dimensional structure and where to put emphasis.)
3d — Gates and Stakes
-
What would make you reject this artifact outright, no matter how well-written the rest is?
(Why: That's your critical gate. At most 1–2 of these — if everything is critical, nothing is.)
-
What would make you say "passes with conditions"?
(Why: That's major gate territory — serious enough to cap the outcome but not an automatic reject.)
For gate guidance with worked examples, see references/criterion-writing-guide.md#gate-guidance.
Step 4 — Draft the Rubric YAML
Use the interview answers to generate the complete YAML. Before drafting, read:
templates/new-profile.template.yaml — the scaffold to start from
references/criterion-writing-guide.md — all 13 v2 required fields with worked examples
Profile header skeleton:
rubric_id: EAROS-[ARTIFACT]-001
version: 1.0.0
kind: profile
title: "[Artifact Type] Profile"
status: draft
effective_date: "[today]"
next_review_date: "[6 months from today]"
owner: enterprise-architecture
artifact_type: [artifact_type_snake_case]
inherits:
- EAROS-CORE-002
design_method: [method from Step 3c]
Overlay header skeleton:
rubric_id: EAROS-OVR-[CONCERN]-001
version: 1.0.0
kind: overlay
title: "[Concern] Overlay"
status: draft
effective_date: "[today]"
artifact_type: any
scoring:
method: append_to_base_rubric
Criteria count targets:
- Profile: 5–12 criteria across 2–6 dimensions (core has 10; don't add more than needed)
- Overlay: 2–6 criteria (overlays inject focused concerns, not full rubrics)
- Core rubric: 8–12 criteria across 6–10 dimensions
Every criterion must have all 13 v2 fields. See references/criterion-writing-guide.md for the complete field list with a worked example.
ID assignment:
- Check
core/, profiles/, and overlays/ for existing IDs before assigning.
- Profile criteria:
[ARTIFACT-ABBREV]-[TOPIC]-[NN] — e.g., PM-ROOT-01 for post-mortem root cause
- Overlay criteria:
[OVR-ABBREV]-[TOPIC]-[NN] — e.g., AI-TRANS-01 for AI transparency
Step 5 — Validate
After generating YAML, run these checks before presenting to the user.
Full validation checklist in references/validation-checklist.md.
Quick checks:
- ID uniqueness: no criterion ID matches anything in
core/, profiles/, or overlays/
- Criteria count: 5–12 for profiles, 2–6 for overlays
- All 13 v2 fields present on every criterion (see
references/criterion-writing-guide.md)
- Gate balance: at most 1–2 major gates, 0–1 critical gates per profile
- Core overlap: no criterion duplicates what
EAROS-CORE-002 already covers
- Schema conformance: structure matches
standard/schemas/rubric.schema.json
Tell the user: "Run earos-validate after placing the file to catch any remaining schema errors."
Step 6 — Calibration Readiness
Before the rubric can be used in a live governance process it must be calibrated. Give the user:
Pre-Calibration
[ ] 3+ real artifacts collected:
- 1 strong artifact (expected overall score >= 3.2)
- 1 weak artifact (expected overall score < 2.4)
- 1 ambiguous artifact (borderline 2.4–3.2)
[ ] 2+ reviewers identified (at least one domain expert)
[ ] YAML complete and schema-valid (run earos-validate)
Calibration Run
[ ] Each reviewer scores all artifacts independently
[ ] Cohen's kappa computed per criterion:
- Target > 0.70 for well-defined, observable criteria
- Target > 0.50 for subjective or judgment-heavy criteria
[ ] Disagreements of >= 2 points identified and resolved against level descriptors
[ ] decision_tree entries updated where disagreements clustered
Post-Calibration
[ ] Profile status: draft → candidate
[ ] Worked example saved to examples/example-[artifact-type].evaluation.yaml
[ ] CHANGELOG.md updated
[ ] earos-validate run on complete repository
Step 7 — File Placement
- Profiles →
profiles/<artifact-type>.yaml (kebab-case, lowercase)
- Overlays →
overlays/<concern>.yaml
- Core rubric →
core/<name>.yaml
After placing the file, update earos.manifest.yaml by running:
node tools/editor/bin.js manifest add <path/to/new-file.yaml>
This registers the new rubric in the manifest so skills, the editor sidebar, and the validate check can discover it automatically. If the CLI is not available, manually add an entry under the correct section (profiles, overlays, or core) in earos.manifest.yaml.
Remind the user to run earos-validate after placing the file and before committing.
Non-Negotiable Rules
- Interview before YAML. Never generate the rubric before completing Steps 3a–3d.
- Never duplicate core criteria. Read
EAROS-CORE-002 before finalising criteria.
- 5–12 criteria for profiles, 2–6 for overlays. More criteria = less reliable calibration.
- All 13 v2 fields on every criterion. Incomplete criteria cannot be calibrated.
- Gates are rare. At most 1–2 major gates and 0–1 critical gates per profile. Over-gating creates false rejects.
- Calibrate before production.
status: draft must not be used in a live governance process.
- Explain your questions. Tell the user why each question matters — this is a design conversation, not an interrogation.
When to Read Which Reference File
| When | Read |
|---|
| Checking what already exists | references/dependency-rules.md |
| Deciding profile vs. overlay | references/dependency-rules.md#profile-vs-overlay |
| Writing criteria | references/criterion-writing-guide.md |
| Deciding gate types and weights | references/criterion-writing-guide.md#gate-guidance |
| Deepening the interview | references/rubric-interview-guide.md |
| Before publishing the file | references/validation-checklist.md |