一键在 Manus 中运行任何 Skill

architecture

Microarchitecture exploration, PPA estimation, risk assessment, and architecture sign-off for digital chip design. Use when evaluating design candidates, estimating power/area/performance, assessing technical risk, or producing a microarchitecture document for handoff to RTL design.

在 Manus 中运行

概览

安装命令

npx skills add https://github.com/chuanseng-ng/digital-chip-design-agents --skill architecture

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

chuanseng-ng/digital-chip-design-agents

星标140

分支36

更新时间2026年5月31日 00:31

SKILL.md

readonly

同仓库更多 Skills

同仓库

pipeline-orchestration

chuanseng-ng/digital-chip-design-agents

Cross-domain loop orchestration for the chip design pipeline. Provides the fix_request protocol, iteration-cap logic, escalation templates, and dispatch patterns for routing verification/formal failures to the RTL orchestrator and back. Use when driving the closed-loop verification↔RTL feedback cycle.

2026-05-31140

memory-keeper

chuanseng-ng/digital-chip-design-agents

Distil accumulated experience records (experiences.jsonl) into updated domain knowledge summaries (knowledge.md) for any chip-design domain. Run after every 10 orchestrator sessions, or on demand when a domain has collected new issue/fix patterns.

2026-05-31140

hls

chuanseng-ng/digital-chip-design-agents

High-Level Synthesis — C/C++ algorithm analysis, HLS directive optimisation, synthesis execution, and co-simulation verification. Use when converting C/C++ to synthesisable RTL, optimising for latency/throughput/area targets using pragmas, or verifying that generated RTL matches the golden C model.

2026-05-31140

sta

chuanseng-ng/digital-chip-design-agents

Static timing analysis — multi-corner constraint validation, setup and hold analysis, timing exception review, and ECO guidance for closure. Use when running timing analysis on a design, reviewing timing violations, guiding ECO fixes, or performing timing sign-off for tape-out.

2026-05-31140

dft

chuanseng-ng/digital-chip-design-agents

Design for Test — scan architecture planning, scan insertion, ATPG pattern generation, MBIST for embedded memories, and JTAG boundary scan. Use when planning a DFT strategy, inserting scan, generating test patterns, or verifying that a chip will be testable in manufacturing.

2026-05-31140

formal-verification

chuanseng-ng/digital-chip-design-agents

Formal property verification (FPV) and logical equivalence checking (LEC). Use when proving design properties exhaustively, checking RTL vs gate-level netlist equivalence, verifying CDC crossings formally, or closing verification coverage gaps that simulation cannot efficiently reach.

2026-05-31140

来源

chuanseng-ng

chuanseng-ng/digital-chip-design-agents

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

软件开发工程师计算机与数学类职业15-1252L4

name	architecture
description	Microarchitecture exploration, PPA estimation, risk assessment, and architecture sign-off for digital chip design. Use when evaluating design candidates, estimating power/area/performance, assessing technical risk, or producing a microarchitecture document for handoff to RTL design.
version	1.0.0
author	chuanseng-ng
license	MIT
allowed-tools	Read, Write, Bash

Skill: Architecture Evaluation

Invocation

If invoked by a user presenting a design task: immediately spawn the digital-chip-design-agents:architecture-orchestrator agent and pass the full user request and any available context. Do not execute stages directly.
If invoked by the architecture-orchestrator mid-flow: do not spawn a new agent. Treat this file as read-only — return the requested stage rules, sign-off criteria, or loop-back guidance to the calling orchestrator.

Spawning the orchestrator from within an active orchestrator run causes recursive delegation and must never happen.

Pre-run Context

Before executing or advising on any stage, read the following files if they exist:

memory/architecture/knowledge.md — known failure patterns, successful tool flags, PDK/tool quirks. Incorporate its guidance into every stage decision. If absent, proceed without it.
memory/architecture/run_state.md — current run identity (run_id, design_name, tool, last_stage). Use this to resume correctly after interruption. If absent, a new run is starting; the orchestrator will create this file before the first stage.

This pre-run read applies whether this skill is loaded by a user or called by the orchestrator mid-flow. It ensures the fix database is consulted before any diagnosis step.

Purpose

Guide the full microarchitecture evaluation process from product specification through to a signed-off microarchitecture document ready for RTL handoff. Covers specification decomposition, candidate architecture exploration, performance and PPA modelling, risk assessment, and sign-off.

Supported EDA Tools

Open-Source

gem5 (gem5) — full-system micro-architectural simulator for performance modelling
McPAT (mcpat) — processor power, area, and timing estimator
CACTI (cacti) — SRAM/cache power and area estimator
Python estimation scripts (python3 estimate.py) — custom PPA models

Proprietary

Synopsys Platform Architect — IP-level performance and power exploration
ARM Performance Models — cycle-accurate ARM subsystem models
Cadence Virtual System Platform (VSP) — SoC-level virtual prototyping

Stage: spec_analysis

Domain Rules

Classify every requirement: functional, performance, power, area, interface, safety/security
Identify under-specified areas and flag as open questions for the product team
Map each use case to required hardware blocks (datapath, control, memory, IO)
Extract all interface requirements with protocols (AXI, PCIe, USB, Ethernet, etc.)
Identify safety/security requirements (ISO 26262, FIPS, CC) if applicable
Assign priority: Must-Have / Should-Have / Nice-to-Have
Produce a structured requirements document before any architecture work begins

QoR Metrics to Evaluate

Requirements coverage: 100% of spec sections mapped to at least one requirement
Ambiguity count: all unresolved items captured in open questions list
Interface completeness: all external interfaces named with protocol and bandwidth

Common Issues & Fixes

Issue	Fix
Spec section not mapped	Add to open questions; do not assume
Interface bandwidth unspecified	Request from product team before proceeding
Conflicting requirements	Flag as blocker; request resolution

Output Required

Structured requirements document (JSON or Markdown)
Interface list with protocols and bandwidths
Open questions list

Stage: arch_exploration

Domain Rules

Generate minimum 3 candidate architectures: conservative, balanced, aggressive
Evaluate pipeline depth trade-offs (deeper = higher frequency, more area/power)
Evaluate parallelism: SIMD, superscalar, spatial unrolling — with area/power cost
Cache/memory hierarchy: size, associativity, latency vs area trade-off per use case
Interconnect topology: bus, crossbar, NoC — evaluate bandwidth vs complexity
Consider IP reuse: identify hard macros or licensed IPs before designing custom
Document all assumptions for each candidate explicitly
Produce a trade-off matrix comparing all candidates

Trade-off Matrix Template

Candidate	Freq Target	Area Est.	Power Est.	Risk	Notes
Option A	1GHz	3mm²	300mW	Low	...
Option B	2GHz	6mm²	700mW	High	...

QoR Metrics to Evaluate

Minimum 3 candidates explored with distinct trade-off profiles
Each candidate: performance estimate within 20% of target
Single recommended candidate with clear quantitative justification

Output Required

Trade-off matrix with all candidates
Recommended candidate with quantitative justification
Assumptions and risk summary per candidate

Stage: perf_modelling

Domain Rules

Use analytical models (Amdahl, Roofline) for initial estimates
Build TLM/SystemC or Python models for complex pipelines
Model all bottlenecks: compute, memory bandwidth, IO throughput
Sweep key parameters: clock frequency, parallelism, cache size
Validate with representative workloads from the use-case list
Include best/typical/worst-case scenarios
Flag any model assumption that has not been validated

QoR Metrics to Evaluate

Throughput: meets or exceeds target by ≥ 10% margin
Latency: meets target at worst-case workload
Memory bandwidth: does not exceed DRAM/SRAM ceiling
Model confidence: HIGH / MEDIUM / LOW

Output Required

Performance model (script or spreadsheet)
Throughput/latency results per use case
Sensitivity analysis
Comparison table: modelled vs target

Stage: power_area_estimation

Domain Rules

Area: use technology library scaling data (gates/mm² at target node)
Dynamic power: P = α × C × V² × f (get activity factor from use cases)
Leakage: estimate from library characterisation at target Vt mix
Memory area: use SRAM compiler estimates for given depth × width
IO pad area: per pad ring design rules
Apply 15–20% margin — RTL is never minimal
Flag immediately if any estimate exceeds 80% of budget

Clock Gating Opportunity Analysis

Perform this analysis using the activity factors already collected for dynamic power:

For each identified clock domain, record its activity factor α derived from the use-case workload sweep (gem5 simulation or analytical model).
Classify each domain using thresholds from design_state.constraints.power.activity_factors (defaults: {"default": 0.15, "high": 0.40}):
- α < activity_factors.default (default: 0.15) — high gating opportunity: clock gating will save > 30% dynamic power for that domain; flag as a must-have RTL requirement.
- activity_factors.default ≤ α < activity_factors.high (defaults: 0.15–0.40) — moderate gating opportunity: clock gating recommended; flag as should-have RTL requirement.
- α ≥ activity_factors.high (default: 0.40) — always-active: no gating benefit; document as always-on.
Produce a clock_power_budget table (one row per domain):

Domain Frequency α (activity) Est. Clock Power (mW) Gating Class
core 1 GHz 0.08 45 high
dsp 500 MHz 0.55 30 always-on
McPAT clocking component already models clock network power — ensure the frequency-sweep input reflects per-domain frequencies, not a single global clock.
Include the clock_power_budget table in the hand-off package to RTL design. The RTL agent will use it to target ICG (Integrated Clock Gate) insertion.

Supported Tools for Clock/Power Analysis

Tool	Type	Use
McPAT	Open-source	Clock network + dynamic/leakage power (already in flow)
gem5	Open-source	Workload activity factor extraction (already in flow)
CACTI	Open-source	Memory clock power estimate (already in flow)
Yosys + ABC	Open-source	Post-synth switching activity cross-check (optional)
Synopsys PrimePower	Proprietary	RTL-level power sign-off (optional)
Cadence Joules RTL	Proprietary	RTL power analysis (optional)

QoR Metrics to Evaluate

Area estimate: < 80% of design_state.constraints.area.area_um2 budget (required constraint — see architecture-orchestrator Behaviour Rule 9)
Dynamic power: < 80% of design_state.constraints.power.power_mw budget (required constraint)
Leakage: < design_state.constraints.power.leakage_pct_max% of total estimated power (default: 15%)
Clock-gating coverage: ≥ design_state.constraints.power.gating_coverage_pct_min% of register-bank bits in high-opportunity domains (default: 60%) (measured using planned register-map estimates from the microarchitecture specification; mark estimate confidence as HIGH if register counts are frozen, MEDIUM if approximate, LOW if based on scaling from similar designs)
Confidence: HIGH / MEDIUM / LOW

Output Required

Area breakdown by block
Power breakdown: dynamic, leakage, per domain
Margin analysis vs targets
clock_power_budget table (domain → frequency, activity factor, estimated clock power mW, gating class)

Stage: risk_assessment

Domain Rules

Risk categories: schedule, technical feasibility, IP availability, tool support, verification complexity, power closure, manufacturing yield
Score every risk: Probability (1–5) × Impact (1–5) = Risk Score
Risk score ≥ 15: classified HIGH — must have mitigation plan before sign-off
IP risks: verify availability, licensing timeline, silicon-proven status
Tool risks: verify EDA tool certification for chosen technology node
Verification risks: flag if testbench complexity > 6 months estimated effort
Every risk must have an assigned owner

QoR Metrics to Evaluate

No unmitigated HIGH risks at sign-off
All risks: assigned owner and mitigation plan
Schedule risk assessed vs team capacity

Output Required

Risk register (ID, description, score, mitigation, owner)
Top 5 risks for management review

Stage: arch_signoff

Sign-off Checklist

Output Required

Signed-off microarchitecture document
Final trade-off decision record
RTL design guidelines
Hand-off package

Constraint Validation

See plugins/meta/skills/pipeline-orchestration/SKILL.md §Constraints Schema for the authoritative schema and stage-entry validation rule.

Required at entry (spec_analysis) — hard-fail if missing:

constraints.clock.clk_mhz — target frequency
constraints.area.area_um2 — area budget
constraints.power.power_mw — power budget

Optional (schema defaults apply when absent):

constraints.power.leakage_pct_max (default: 15%) — leakage threshold
constraints.power.gating_coverage_pct_min (default: 60%) — ICG coverage target
constraints.power.activity_factors (defaults: {default: 0.15, high: 0.40}) — domain classification thresholds

Memory

Write on stage completion

After each stage completes (regardless of whether an orchestrator session is active), write or overwrite one JSON record in memory/architecture/experiences.jsonl keyed by run_id. This ensures data is persisted even if the flow is interrupted or called without full orchestrator context.

Use run_id = architecture_<YYYYMMDD>_<HHMMSS> (set once at flow start; reuse on each stage update). Set signoff_achieved: false until the final sign-off stage completes.

Run state (write before first stage, update after each stage)

Write memory/architecture/run_state.md as the first action before launching any tool:

run_id:      architecture_<YYYYMMDD>_<HHMMSS>
design_name: <design>
tool:        <primary tool>
start_time:  <ISO-8601>
last_stage:  null

Update last_stage to the completed stage name only after each stage finishes successfully. This file lets wakeup-loop prompts and resumed sessions identify the correct run without relying on in-memory state. Create the file and parent directories if they do not exist.

Optional: claude-mem index

If mcp__plugin_ecc_memory__add_observations is available in this session, emit each applied fix as an observation to entity chip-design-architecture-fixes after writing to experiences.jsonl. Skip silently if the tool is absent — JSONL is the canonical record.

Domain	Frequency	α (activity)	Est. Clock Power (mW)	Gating Class
core	1 GHz	0.08	45	high
dsp	500 MHz	0.55	30	always-on

Domain	Frequency	α (activity)	Est. Clock Power (mW)	Gating Class
core	1 GHz	0.08	45	high
dsp	500 MHz	0.55	30	always-on