Run any Skill in Manus with one click

$pwd:

ds-critique

Name: Ds Critique
Author: lyderhansen

// Use this skill to run the Splunk Dashboard Slop Test against an existing dashboard.json or dashboard.xml and write critique.md with a blunt verdict (PASS / MIXED / SLOP) and a per-criterion scorecard. Narrower and more opinionated than ds-review — it only checks the design-principles Slop Test, reflex defaults, and absolute bans. Use when the user asks "is this dashboard AI slop?", "does this pass the Slop Test?", or wants a design-principles-only audit before ds-polish.

Run Skill in Manus

$ git log --oneline --stat

stars:0

forks:0

updated:May 6, 2026 at 12:29

SKILL.md

readonly

package.json

"author": "lyderhansen"

"repository": "lyderhansen/splunk-knowledge"

View GitHub Repository

$ install --globalskills.sh

$ download --local

Run Skill in Manus

[HINT] Download the complete skill directory including SKILL.md and all related files

Run any Skill with one click

name

ds-critique

description

Use this skill to run the Splunk Dashboard Slop Test against an existing dashboard.json or dashboard.xml and write critique.md with a blunt verdict (PASS / MIXED / SLOP) and a per-criterion scorecard. Narrower and more opinionated than ds-review — it only checks the design-principles Slop Test, reflex defaults, and absolute bans. Use when the user asks "is this dashboard AI slop?", "does this pass the Slop Test?", or wants a design-principles-only audit before ds-polish.

ds-critique — Run the Slop Test against a dashboard

When to use

The user asks: "is this dashboard AI slop?", "does this pass the Slop Test?", "critique this dashboard".
Before ds-polish — to get a verdict on how much work polish will have to do.
After ds-create on a dashboard you suspect hit a template default — to confirm the suspicion before changing anything.

The stance

This skill does not review accessibility, performance, or schema correctness. It answers one question:

If someone said "an AI made this" — would an SRE, SOC analyst, or VP believe them immediately?

If yes → the dashboard is slop, and the report says so in those words. No euphemism, no diplomatic hedging. The design principles in ds-ref-design-principles are the scoring rubric.

How ds-critique differs from ds-review and ds-polish

Three skills operate on an existing dashboard. They are not interchangeable:

Skill	Lens	Mutates?	Output
`ds-review`	Broad audit — panel count, viz appropriateness, drilldowns, tokens, accessibility, SPL perf	No	`review.md` (findings, fixes suggested)
`ds-critique`	Narrow — Slop Test, reflex defaults, absolute bans	No	`critique.md` (verdict + scorecard)
`ds-polish`	Narrow — same rubric as critique, but applies fixes	Yes	mutated dashboard + `polish-report.md`

Typical routing:

User asks "is this good?" → ds-review (general audit).
User asks "is this AI slop?" → ds-critique (Slop Test only).
User says "fix it" or "apply design principles" → ds-polish.

ds-critique and ds-polish share the same rubric — both are derived from ds-ref-design-principles. Critique is the read-only variant; polish is the write variant.

Input / output contract

Input (one of):

A workspace path containing build/dashboard.json.
A direct file path to dashboard.json or dashboard.xml.

Output:

critique.md written alongside the dashboard (or in the workspace root if inside a workspace).
No mutation of the dashboard.
A one-line verdict echoed to the user: Verdict: SLOP (4/13 passed).

Required context

Critique is interpretive. Before scoring, confirm:

Archetype — a dashboard criticised for "no hero KPI" must be one where a hero is expected. Exec summaries expect heroes; dense NOC grids do not.
Theme — several criteria depend on whether the dashboard is dark, dark-NOC, or light.

If either is unknown, ask the user before running the scorecard. Do not guess — guessing produces a dishonest verdict.

What ds-critique scores

See sections below.

The scorecard

Each criterion produces one of three outcomes:

PASS — evidence is present and correct.
FAIL — evidence is absent or wrong. Cite what's broken.
N/A — criterion does not apply to this archetype (e.g., hero KPI criterion on a dense NOC grid).

Check 1: Archetype committed

DETECT: Layout matches one of the four archetypes in ds-ref-design-principles (executive summary, operational, analytical, SOC). A dashboard that mixes KPI-heavy exec layout with a detail table AND a geographic SOC map AND a correlation scatter is uncommitted.
EVIDENCE: Panel count, viz-type mix, panel proportions. Match against archetype fingerprint.
VERDICT: PASS if the panel mix matches a single archetype. FAIL if it reads as two archetypes fighting.

Check 2: Theme derived from audience

DETECT: layout.options.backgroundColor matches the theme expected from the audience (dark for ops/SOC/analytical, light for executive print/PDF, dark-NOC for 24/7 wall).
EVIDENCE: Background hex value. Cross-reference with the archetype declared in Check 1.
VERDICT: PASS if theme and archetype align. FAIL if light canvas on a NOC dashboard, or dark canvas on an exec PDF.

Check 3: Canvas `backgroundColor` set

DETECT: layout.options.backgroundColor is present AND is not the Splunk default grey (#EDEDF4, #F4F4F4, or absent).
EVIDENCE: The exact hex value, or "absent".
VERDICT: PASS if set to one of #0b0c0e, #000000, #FAFAF7, or a deliberate brand color. FAIL if absent, default, or generic.

Check 4: KPI row has semantic polarity

DETECT: For each splunk.singlevalue in a KPI row, majorColor is either (a) threshold-colored via DOS, or (b) a semantic-palette static hex chosen from the polarity rules in ds-ref-design-principles.
EVIDENCE: Per-KPI majorColor values. List them.
VERDICT: PASS if polarity is applied to status metrics. FAIL if every KPI shares the same static color (classic #006D9C tell).

Check 5: KPI row has visual hierarchy

DETECT: Not every KPI in a row has identical w × h. One anchor KPI is visibly larger (≥ 1.5×) OR the row uses deliberately uniform sizing because the archetype demands it (NOC status grid).
EVIDENCE: List panel dimensions. Note whether the archetype justifies uniformity.
VERDICT: PASS if an anchor exists, or if uniform sizing fits the archetype (N/A applies here). FAIL if uniform-by-default on an archetype that expects hierarchy.

Check 6: Every table has a drilldown

DETECT: Every splunk.table has either drilldown.link, drilldown.setToken, or a custom action.
EVIDENCE: Per-table drilldown status.
VERDICT: PASS if all tables link out. FAIL on any dead-end table.

Check 7: Every input has a default value

DETECT: Every input of type dropdown, multiselect, or text has options.defaultValue, OR the bound token has a default.
EVIDENCE: Per-input default status.
VERDICT: PASS if every input resolves on first load. FAIL on any input that would render the dashboard empty.

Check 8: Every search is time-bounded

DETECT: Every ds.search has earliest= AND latest= in its query, OR binds options.queryParameters.earliest / latest to the global time token.
EVIDENCE: Per-search time-binding status.
VERDICT: PASS if every search is bounded. FAIL on any unbounded query — this is an absolute ban.

Check 9: Series colors come from a categorical palette

DETECT: Chart seriesColors or chart.seriesColorsByField values do NOT include any of the reserved semantic hexes: #DC4E41, #F1813F, #F8BE34, #53A051, #006D9C (when used in a non-status chart context).
EVIDENCE: Per-chart series color list. Highlight reserved hexes that appear.
VERDICT: PASS if semantic colors are reserved for status. FAIL on any leak — a green data series in a non-status chart is the classic tell.

Check 10: Color is paired with icon / label / shape

DETECT: For every status-colored element (red/green KPIs, severity-colored rows), there is a paired icon (splunk.singlevalueicon), text label column, or shape change.
EVIDENCE: Per-status-element pairing status.
VERDICT: PASS if every status signal has a non-color backup. FAIL on any color-only status differentiator — this is an absolute ban.

Check 11: Pie charts have ≤ 6 slices

DETECT: Every splunk.pie bound to SPL that produces ≤ 6 rows, OR the SPL contains a | head 6 / Top-N aggregation, OR the chart has a deliberate slice-count limit.
EVIDENCE: Per-pie row count.
VERDICT: PASS if all pies ≤ 6 slices. FAIL on any pie exceeding — absolute ban.

Check 12: Panel titles are ≤ 40 characters, title case

DETECT: Every panel's title is ≤ 40 chars AND uses title case (not snake_case_field_names, not sentence-long descriptions).
EVIDENCE: Per-panel title length + case check.
VERDICT: PASS if all titles are concise and title-case. FAIL on any paragraph-title or raw field-name title.

Check 13: Depth comes from layered rectangles

DETECT: If the archetype calls for KPI card backgrounds or zone backgrounds (exec summary, operational, SOC), at least one splunk.rectangle is present in layout.structure to provide the layering.
EVIDENCE: Rectangle count, placement (before or after KPI panels in the array).
VERDICT: PASS if depth is present where the archetype expects it. N/A for dense-grid archetypes that do not use cards. FAIL if exec/ops/SOC dashboard has zero rectangles and KPIs float on the canvas.

Verdict system

Checks 8, 9, 10, and 11 map to absolute bans in ds-ref-design-principles. A single FAIL on any of them is disqualifying regardless of the total score — the dashboard is SLOP.

All other checks are weighted equally. Compute:

Effective checks = 13 − (count of N/A).
Passed = count of PASS across effective checks.
Score = Passed / Effective.

Verdict:

Verdict	Condition
PASS	Score ≥ 11/13 (≈ 85%) AND zero absolute-ban failures
MIXED	Score 7/13–10/13 AND zero absolute-ban failures
SLOP	Score < 7/13 OR ≥ 1 absolute-ban failure

A dashboard with 12/13 passes but an unbounded search is still SLOP — the ban overrides the score. Say so in the verdict sentence, cite the ban.

critique.md format

# Critique — <dashboard title>

**Verdict**: SLOP | MIXED | PASS
**Score**: 7/13 (1 absolute-ban failure)
**Generated**: <ISO-timestamp>
**Source**: <path to dashboard.json>
**Archetype**: <executive | operational | analytical | soc>
**Theme**: <dark | dark-noc | light>

## Summary

One paragraph, direct. Example for a SLOP verdict:

> This dashboard reads as AI-generated. Four KPIs share `#006D9C` with no
> semantic polarity, the canvas is unset (default Splunk grey), and the
> pie chart of alert severity has 11 slices. An operator would flag it
> in under ten seconds. Most issues are auto-fixable via `ds-polish`.

## Scorecard

### FAIL — Check 1: Archetype committed
Evidence: Panel mix reads as exec + SOC fusion (4 KPIs + geo map + scatter + analyst table).
Impact: No single audience is well-served. Recommend committing to one archetype via `ds-update`.

### PASS — Check 2: Theme derived from audience
Evidence: Dark theme (`#0b0c0e`) aligns with declared operational archetype.

### FAIL — Check 3: Canvas backgroundColor set
Evidence: `layout.options.backgroundColor` is absent. Renders default grey.
Impact: Auto-fixable — `ds-polish` Fix 1 handles this.

(... continue for all 13 checks ...)

## Absolute ban failures (if any)

- **Check 8 — Unbounded search**: data source `ds_top_hosts` has no `earliest=` / `latest=` binding. This triggers full-index scans. Must be fixed before deploy.

## Next action

Route to one of:
- **`ds-polish`** — most failures are in the APPLIED or SUGGESTED polish catalog. Run polish, then re-critique.
- **`ds-update`** — failures are architectural (wrong archetype, panel mix fighting itself). Polish can't fix structural slop.
- **Accept** — if MIXED with deliberate gaps (e.g., dense-grid archetype explicitly opted out of depth layering), mark the skipped checks in the workspace notes and move on.

Integration with other skills

When to run ds-critique vs ds-review

ds-review when you want the broad audit — schema issues, perf risks, accessibility gaps, drilldown coverage.
ds-critique when you want the opinionated design-principles verdict — and only that. Narrower, blunter, scored.

They are complementary — on a legacy dashboard, run both: ds-review for the "what's technically wrong" and ds-critique for the "is this slop" read.

Handoff routing after critique

Verdict	Recommended next skill
PASS	`ds-validate` → `ds-deploy`. Critique is a sign-off.
MIXED	`ds-polish` for the fixable subset; revisit critique after.
SLOP	`ds-polish` if failures are catalog-covered; `ds-update` if architectural; in severe cases, re-scope via `ds-init` and regenerate.

Cross-check against `ds-ref-pitfalls`

The 13 design-principles checks above don't cover configuration traps (CSV seriesColors, bubble layer with wrong SPL shape, linkToDashboard.tokens map vs array, etc.). Those traps fail the dashboard at runtime regardless of how clean the design is.

After running the critique scorecard, scan ds-ref-pitfalls for any of these symptoms in the dashboard:

Schema-rejected shapes (linter / editor refuses to save).
Silently-empty panels (renders, no console error).
Drilldowns that don't fire or don't forward tokens.
Visibility conditions that always evaluate true / false.

These get appended to the critique report under "Configuration risks beyond the Slop Test".

Pipeline position

ds-create → [ds-critique?] → [ds-polish?] → ds-validate → ds-deploy
            ^^^^^^^^^^^^^^   ^^^^^^^^^^^^
            read-only verdict apply the fixes

Both ds-critique and ds-polish are optional. ds-critique is the diagnostic pass; ds-polish is the fix pass. Running critique BEFORE polish is useful when you want to know how bad it is before starting — especially on legacy dashboards you did not create yourself.

When NOT to use ds-critique

During active design iteration: critique is a finishing diagnostic. Running it mid-design produces noise.
When the user wants a general review: route to ds-review instead.
When you already plan to run ds-polish: polish includes the same diagnostic pass as its Phase 3 (Propose). Separate critique is redundant — just read the polish-report.md.

name

ds-critique

description

ds-critique — Run the Slop Test against a dashboard

When to use

The user asks: "is this dashboard AI slop?", "does this pass the Slop Test?", "critique this dashboard".
Before ds-polish — to get a verdict on how much work polish will have to do.
After ds-create on a dashboard you suspect hit a template default — to confirm the suspicion before changing anything.

The stance

This skill does not review accessibility, performance, or schema correctness. It answers one question:

If someone said "an AI made this" — would an SRE, SOC analyst, or VP believe them immediately?

If yes → the dashboard is slop, and the report says so in those words. No euphemism, no diplomatic hedging. The design principles in ds-ref-design-principles are the scoring rubric.

How ds-critique differs from ds-review and ds-polish

Three skills operate on an existing dashboard. They are not interchangeable:

Skill	Lens	Mutates?	Output
`ds-review`	Broad audit — panel count, viz appropriateness, drilldowns, tokens, accessibility, SPL perf	No	`review.md` (findings, fixes suggested)
`ds-critique`	Narrow — Slop Test, reflex defaults, absolute bans	No	`critique.md` (verdict + scorecard)
`ds-polish`	Narrow — same rubric as critique, but applies fixes	Yes	mutated dashboard + `polish-report.md`

Typical routing:

User asks "is this good?" → ds-review (general audit).
User asks "is this AI slop?" → ds-critique (Slop Test only).
User says "fix it" or "apply design principles" → ds-polish.

ds-critique and ds-polish share the same rubric — both are derived from ds-ref-design-principles. Critique is the read-only variant; polish is the write variant.

Input / output contract

Input (one of):

A workspace path containing build/dashboard.json.
A direct file path to dashboard.json or dashboard.xml.

Output:

critique.md written alongside the dashboard (or in the workspace root if inside a workspace).
No mutation of the dashboard.
A one-line verdict echoed to the user: Verdict: SLOP (4/13 passed).

Required context

Critique is interpretive. Before scoring, confirm:

Archetype — a dashboard criticised for "no hero KPI" must be one where a hero is expected. Exec summaries expect heroes; dense NOC grids do not.
Theme — several criteria depend on whether the dashboard is dark, dark-NOC, or light.

If either is unknown, ask the user before running the scorecard. Do not guess — guessing produces a dishonest verdict.

What ds-critique scores

See sections below.

The scorecard

Each criterion produces one of three outcomes:

PASS — evidence is present and correct.
FAIL — evidence is absent or wrong. Cite what's broken.
N/A — criterion does not apply to this archetype (e.g., hero KPI criterion on a dense NOC grid).

Check 1: Archetype committed

DETECT: Layout matches one of the four archetypes in ds-ref-design-principles (executive summary, operational, analytical, SOC). A dashboard that mixes KPI-heavy exec layout with a detail table AND a geographic SOC map AND a correlation scatter is uncommitted.
EVIDENCE: Panel count, viz-type mix, panel proportions. Match against archetype fingerprint.
VERDICT: PASS if the panel mix matches a single archetype. FAIL if it reads as two archetypes fighting.

Check 2: Theme derived from audience

DETECT: layout.options.backgroundColor matches the theme expected from the audience (dark for ops/SOC/analytical, light for executive print/PDF, dark-NOC for 24/7 wall).
EVIDENCE: Background hex value. Cross-reference with the archetype declared in Check 1.
VERDICT: PASS if theme and archetype align. FAIL if light canvas on a NOC dashboard, or dark canvas on an exec PDF.

Check 3: Canvas `backgroundColor` set

DETECT: layout.options.backgroundColor is present AND is not the Splunk default grey (#EDEDF4, #F4F4F4, or absent).
EVIDENCE: The exact hex value, or "absent".
VERDICT: PASS if set to one of #0b0c0e, #000000, #FAFAF7, or a deliberate brand color. FAIL if absent, default, or generic.

Check 4: KPI row has semantic polarity

DETECT: For each splunk.singlevalue in a KPI row, majorColor is either (a) threshold-colored via DOS, or (b) a semantic-palette static hex chosen from the polarity rules in ds-ref-design-principles.
EVIDENCE: Per-KPI majorColor values. List them.
VERDICT: PASS if polarity is applied to status metrics. FAIL if every KPI shares the same static color (classic #006D9C tell).

Check 5: KPI row has visual hierarchy

DETECT: Not every KPI in a row has identical w × h. One anchor KPI is visibly larger (≥ 1.5×) OR the row uses deliberately uniform sizing because the archetype demands it (NOC status grid).
EVIDENCE: List panel dimensions. Note whether the archetype justifies uniformity.
VERDICT: PASS if an anchor exists, or if uniform sizing fits the archetype (N/A applies here). FAIL if uniform-by-default on an archetype that expects hierarchy.

Check 6: Every table has a drilldown

DETECT: Every splunk.table has either drilldown.link, drilldown.setToken, or a custom action.
EVIDENCE: Per-table drilldown status.
VERDICT: PASS if all tables link out. FAIL on any dead-end table.

Check 7: Every input has a default value

DETECT: Every input of type dropdown, multiselect, or text has options.defaultValue, OR the bound token has a default.
EVIDENCE: Per-input default status.
VERDICT: PASS if every input resolves on first load. FAIL on any input that would render the dashboard empty.

Check 8: Every search is time-bounded

DETECT: Every ds.search has earliest= AND latest= in its query, OR binds options.queryParameters.earliest / latest to the global time token.
EVIDENCE: Per-search time-binding status.
VERDICT: PASS if every search is bounded. FAIL on any unbounded query — this is an absolute ban.

Check 9: Series colors come from a categorical palette

DETECT: Chart seriesColors or chart.seriesColorsByField values do NOT include any of the reserved semantic hexes: #DC4E41, #F1813F, #F8BE34, #53A051, #006D9C (when used in a non-status chart context).
EVIDENCE: Per-chart series color list. Highlight reserved hexes that appear.
VERDICT: PASS if semantic colors are reserved for status. FAIL on any leak — a green data series in a non-status chart is the classic tell.

Check 10: Color is paired with icon / label / shape

DETECT: For every status-colored element (red/green KPIs, severity-colored rows), there is a paired icon (splunk.singlevalueicon), text label column, or shape change.
EVIDENCE: Per-status-element pairing status.
VERDICT: PASS if every status signal has a non-color backup. FAIL on any color-only status differentiator — this is an absolute ban.

Check 11: Pie charts have ≤ 6 slices

DETECT: Every splunk.pie bound to SPL that produces ≤ 6 rows, OR the SPL contains a | head 6 / Top-N aggregation, OR the chart has a deliberate slice-count limit.
EVIDENCE: Per-pie row count.
VERDICT: PASS if all pies ≤ 6 slices. FAIL on any pie exceeding — absolute ban.

Check 12: Panel titles are ≤ 40 characters, title case

DETECT: Every panel's title is ≤ 40 chars AND uses title case (not snake_case_field_names, not sentence-long descriptions).
EVIDENCE: Per-panel title length + case check.
VERDICT: PASS if all titles are concise and title-case. FAIL on any paragraph-title or raw field-name title.

Check 13: Depth comes from layered rectangles

DETECT: If the archetype calls for KPI card backgrounds or zone backgrounds (exec summary, operational, SOC), at least one splunk.rectangle is present in layout.structure to provide the layering.
EVIDENCE: Rectangle count, placement (before or after KPI panels in the array).
VERDICT: PASS if depth is present where the archetype expects it. N/A for dense-grid archetypes that do not use cards. FAIL if exec/ops/SOC dashboard has zero rectangles and KPIs float on the canvas.

Verdict system

Checks 8, 9, 10, and 11 map to absolute bans in ds-ref-design-principles. A single FAIL on any of them is disqualifying regardless of the total score — the dashboard is SLOP.

All other checks are weighted equally. Compute:

Effective checks = 13 − (count of N/A).
Passed = count of PASS across effective checks.
Score = Passed / Effective.

Verdict:

Verdict	Condition
PASS	Score ≥ 11/13 (≈ 85%) AND zero absolute-ban failures
MIXED	Score 7/13–10/13 AND zero absolute-ban failures
SLOP	Score < 7/13 OR ≥ 1 absolute-ban failure

A dashboard with 12/13 passes but an unbounded search is still SLOP — the ban overrides the score. Say so in the verdict sentence, cite the ban.

critique.md format

# Critique — <dashboard title>

**Verdict**: SLOP | MIXED | PASS
**Score**: 7/13 (1 absolute-ban failure)
**Generated**: <ISO-timestamp>
**Source**: <path to dashboard.json>
**Archetype**: <executive | operational | analytical | soc>
**Theme**: <dark | dark-noc | light>

## Summary

One paragraph, direct. Example for a SLOP verdict:

> This dashboard reads as AI-generated. Four KPIs share `#006D9C` with no
> semantic polarity, the canvas is unset (default Splunk grey), and the
> pie chart of alert severity has 11 slices. An operator would flag it
> in under ten seconds. Most issues are auto-fixable via `ds-polish`.

## Scorecard

### FAIL — Check 1: Archetype committed
Evidence: Panel mix reads as exec + SOC fusion (4 KPIs + geo map + scatter + analyst table).
Impact: No single audience is well-served. Recommend committing to one archetype via `ds-update`.

### PASS — Check 2: Theme derived from audience
Evidence: Dark theme (`#0b0c0e`) aligns with declared operational archetype.

### FAIL — Check 3: Canvas backgroundColor set
Evidence: `layout.options.backgroundColor` is absent. Renders default grey.
Impact: Auto-fixable — `ds-polish` Fix 1 handles this.

(... continue for all 13 checks ...)

## Absolute ban failures (if any)

- **Check 8 — Unbounded search**: data source `ds_top_hosts` has no `earliest=` / `latest=` binding. This triggers full-index scans. Must be fixed before deploy.

## Next action

Route to one of:
- **`ds-polish`** — most failures are in the APPLIED or SUGGESTED polish catalog. Run polish, then re-critique.
- **`ds-update`** — failures are architectural (wrong archetype, panel mix fighting itself). Polish can't fix structural slop.
- **Accept** — if MIXED with deliberate gaps (e.g., dense-grid archetype explicitly opted out of depth layering), mark the skipped checks in the workspace notes and move on.

Integration with other skills

When to run ds-critique vs ds-review

ds-review when you want the broad audit — schema issues, perf risks, accessibility gaps, drilldown coverage.
ds-critique when you want the opinionated design-principles verdict — and only that. Narrower, blunter, scored.

They are complementary — on a legacy dashboard, run both: ds-review for the "what's technically wrong" and ds-critique for the "is this slop" read.

Handoff routing after critique

Verdict	Recommended next skill
PASS	`ds-validate` → `ds-deploy`. Critique is a sign-off.
MIXED	`ds-polish` for the fixable subset; revisit critique after.
SLOP	`ds-polish` if failures are catalog-covered; `ds-update` if architectural; in severe cases, re-scope via `ds-init` and regenerate.

Cross-check against `ds-ref-pitfalls`

After running the critique scorecard, scan ds-ref-pitfalls for any of these symptoms in the dashboard:

Schema-rejected shapes (linter / editor refuses to save).
Silently-empty panels (renders, no console error).
Drilldowns that don't fire or don't forward tokens.
Visibility conditions that always evaluate true / false.

These get appended to the critique report under "Configuration risks beyond the Slop Test".

Pipeline position

ds-create → [ds-critique?] → [ds-polish?] → ds-validate → ds-deploy
            ^^^^^^^^^^^^^^   ^^^^^^^^^^^^
            read-only verdict apply the fixes

When NOT to use ds-critique

During active design iteration: critique is a finishing diagnostic. Running it mid-design produces noise.
When the user wants a general review: route to ds-review instead.
When you already plan to run ds-polish: polish includes the same diagnostic pass as its Phase 3 (Propose). Separate critique is redundant — just read the polish-report.md.

ds-critique

ds-critique — Run the Slop Test against a dashboard

When to use

The stance

How ds-critique differs from ds-review and ds-polish

Input / output contract

Required context

What ds-critique scores

The scorecard

Check 1: Archetype committed

Check 2: Theme derived from audience

Check 3: Canvas backgroundColor set

Check 4: KPI row has semantic polarity

Check 5: KPI row has visual hierarchy

Check 6: Every table has a drilldown

Check 7: Every input has a default value

Check 8: Every search is time-bounded

Check 9: Series colors come from a categorical palette

Check 10: Color is paired with icon / label / shape

Check 11: Pie charts have ≤ 6 slices

Check 12: Panel titles are ≤ 40 characters, title case

Check 13: Depth comes from layered rectangles

Verdict system

critique.md format

Integration with other skills

When to run ds-critique vs ds-review

Handoff routing after critique

Cross-check against ds-ref-pitfalls

Pipeline position

When NOT to use ds-critique

ds-critique — Run the Slop Test against a dashboard

When to use

The stance

How ds-critique differs from ds-review and ds-polish

Input / output contract

Required context

What ds-critique scores

The scorecard

Check 1: Archetype committed

Check 2: Theme derived from audience

Check 3: Canvas backgroundColor set

Check 4: KPI row has semantic polarity

Check 5: KPI row has visual hierarchy

Check 6: Every table has a drilldown

Check 7: Every input has a default value

Check 8: Every search is time-bounded

Check 9: Series colors come from a categorical palette

Check 10: Color is paired with icon / label / shape

Check 11: Pie charts have ≤ 6 slices

Check 12: Panel titles are ≤ 40 characters, title case

Check 13: Depth comes from layered rectangles

Verdict system

critique.md format

Integration with other skills

When to run ds-critique vs ds-review

Handoff routing after critique

Cross-check against ds-ref-pitfalls

Pipeline position

When NOT to use ds-critique

Check 3: Canvas `backgroundColor` set

Cross-check against `ds-ref-pitfalls`

Check 3: Canvas `backgroundColor` set

Cross-check against `ds-ref-pitfalls`