Run any Skill in Manus with one click

dsm-llm-modularization

LLM-based Design Structure Matrix (DSM) modularization methodology. Use when partitioning complex systems into cohesive modules, optimizing system architecture, or applying LLMs to combinatorial engineering problems. Activation triggers: DSM, design structure matrix, system modularization, architecture decomposition, LLM combinatorial optimization, semantic alignment hypothesis, engineering design optimization.

Run Skill in Manus

Stars1

Forks0

UpdatedJune 3, 2026 at 15:39

Source

hiyenwong

hiyenwong/ai_collection

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

SKILL.md

readonly

name

dsm-llm-modularization

description

DSM Modularization with Large Language Models

LLM-based approach to Design Structure Matrix modularization achieving near-reference quality within 30 iterations without specialized optimization code. Introduces the semantic-alignment hypothesis governing when domain knowledge helps or hurts LLM-based optimization.

Metadata

Source: arXiv:2604.28018
Authors: Shuo Jiang, Jianxi Luo
Published: 2026-04-30
Subjects: cs.CE (Computational Engineering); cs.AI

Core Methodology

Key Innovation

DSM modularization partitions system elements into cohesive modules — a fundamental combinatorial challenge in engineering design. This paper shows that LLMs can solve DSM modularization through iterative prompting, without requiring specialized optimization code.

Technical Framework

Three-stage LLM-based modularization:

Input Representation
- Convert DSM (N×N dependency matrix) into natural language description
- Include element names, dependency strengths, and constraints
- Best practice: Include raw matrix structure + semantic labels
Objective Formulation
- Frame modularization as minimizing inter-module dependencies
- Use standard metrics: Minimum Description Length (MDL), clustering coefficient
- Best practice: Provide explicit objective function in natural language
Solution Pool Design
- Maintain a pool of candidate solutions across iterations
- LLM generates new candidates, evaluates against pool
- Best practice: Pool size 5-10, retain top-k by objective value

Critical Finding: Semantic-Alignment Hypothesis

Counterintuitive result: Domain knowledge, beneficial in DSM sequencing, consistently impairs performance on DSM modularization for complex DSMs.

Hypothesis: LLM effectiveness with domain knowledge depends on semantic alignment between:

The LLM's functional priors (what it "knows" about the domain)
The optimization objective (structural vs. functional)

When knowledge helps: The domain semantics align with the optimization goal When knowledge hurts: The LLM's functional priors conflict with structural optimization objectives

Testable condition: Before adding domain knowledge to LLM prompts, verify whether the knowledge supports or contradicts the mathematical objective.

Implementation Guide

Step 1: DSM Construction

Build N×N matrix of element dependencies
Weight edges by dependency strength
Validate matrix completeness

Step 2: LLM Prompt Design

System: You are an expert in system architecture modularization.
Task: Partition these {N} system elements into cohesive modules.
Input: {DSM_description_with_element_names_and_dependencies}
Objective: Minimize inter-module dependencies (total coupling between modules).
Constraints: {max_modules, min_module_size, etc.}
Format: Return module assignments as JSON.

Step 3: Iterative Optimization

Run 30 iterations of LLM prompting
Each iteration: present current best + ask for improvement
Track objective value across iterations
Convergence typically within 20-30 iterations

Step 4: Validation

Compare against reference solutions (if available)
Compute modularity metrics (MDL, clustering coefficient)
Evaluate semantic coherence of modules

Pitfalls

Semantic misalignment: Adding domain knowledge to prompts can degrade results if the knowledge conflicts with structural optimization. Test both with and without domain context.
LLM variability: Different backbone LLMs produce different quality results. Test across 2-3 models.
Prompt sensitivity: Input representation format significantly impacts results. Use ablation studies to find optimal format.
Scale limits: Method validated on DSMs up to moderate size. Very large DSMs (N > 200) may need hierarchical decomposition.

Applications

System architecture design
Software modularization
Product family planning
Organization design
Supply chain clustering

Related Skills

modern-systems-engineering-patterns
emergent-systems-design
agent-first-bootstrap

More from this repository

same repository

qldpc-breakeven-demonstration

hiyenwong/ai_collection

Breakeven demonstration of quantum low-density parity-check (qLDPC) codes — first experimental evidence that qLDPC codes can achieve fault-tolerance breakeven on trapped-ion quantum hardware. Critical milestone for scalable quantum error correction. Activation: qLDPC, quantum error correction, breakeven, trapped-ion, fault tolerance, quantum coding, logical qubit, error suppression.

2026-06-081

amm-fairness-impossibility

hiyenwong/ai_collection

Arrovian impossibility theorem for Automated Market Maker (AMM) design. Proves no aggregation rule for weighted-product AMMs can be simultaneously fair and strategy-proof when n>2 liquidity providers. Key result: fairness forces mean-type aggregation (weighted Aitchison centroid) while strategy-proofness forces median-type; only single-provider dictatorship satisfies both. Obstruction vanishes at n=2. Applies to DeFi protocol design, mechanism design, and prediction markets. (arXiv: 2606.04959)

2026-06-081

bbqram-state-preparation-finance

hiyenwong/ai_collection

Architecture-aware quantum state preparation using Bucket Brigade QRAM (BBQRAM) with segment tree for polylogarithmic query time. Covers complex-valued matrix encoding, classical precomputation of rotation angles, and magnitude-then-phase procedures. Enables efficient data loading for quantum finance applications. Based on arXiv:2604.25644. Use when: designing QRAM-based quantum data loaders, optimizing state preparation for quantum finance, loading complex-valued financial data into quantum circuits, implementing efficient amplitude encoding with BBQRAM.

2026-06-081

distributional-portfolio-optimization

hiyenwong/ai_collection

Distributional Portfolio Optimization (DPO) unified framework — organizing Bayesian, robust, chance-constrained, stochastic-allocation, and distributional RL portfolio methods through joint coupling Gamma_theta(dw,dr). Includes Wasserstein-CVaR duality, credible-radius calibration, and distributional Bellman contraction. Activation: distributional portfolio optimization, DPO, Wasserstein DRO, Bayesian portfolio, CVaR, credible radius, distributional reinforcement learning.

2026-06-081

inverse-born-rule-fallacy

hiyenwong/ai_collection

Critical analysis methodology for quantum data encoding — identifies how naive amplitude encoding (psi=sqrt(P)) abelianizes the Hilbert space and fails to achieve genuine quantum advantage in QML/finance. Advocates for Dynamical Hamiltonian Encoding (DHE) where data generates non-commutative evolution.

2026-06-081

portfolio-optimization-mean-variance-spectrum

hiyenwong/ai_collection

Portfolio Optimization with Mean-Variance-Spectrum Preferences

2026-06-081

name

dsm-llm-modularization

description

DSM Modularization with Large Language Models

LLM-based approach to Design Structure Matrix modularization achieving near-reference quality within 30 iterations without specialized optimization code. Introduces the semantic-alignment hypothesis governing when domain knowledge helps or hurts LLM-based optimization.

Metadata

Source: arXiv:2604.28018
Authors: Shuo Jiang, Jianxi Luo
Published: 2026-04-30
Subjects: cs.CE (Computational Engineering); cs.AI

Core Methodology

Key Innovation

Technical Framework

Three-stage LLM-based modularization:

Input Representation
- Convert DSM (N×N dependency matrix) into natural language description
- Include element names, dependency strengths, and constraints
- Best practice: Include raw matrix structure + semantic labels
Objective Formulation
- Frame modularization as minimizing inter-module dependencies
- Use standard metrics: Minimum Description Length (MDL), clustering coefficient
- Best practice: Provide explicit objective function in natural language
Solution Pool Design
- Maintain a pool of candidate solutions across iterations
- LLM generates new candidates, evaluates against pool
- Best practice: Pool size 5-10, retain top-k by objective value

Critical Finding: Semantic-Alignment Hypothesis

Counterintuitive result: Domain knowledge, beneficial in DSM sequencing, consistently impairs performance on DSM modularization for complex DSMs.

Hypothesis: LLM effectiveness with domain knowledge depends on semantic alignment between:

The LLM's functional priors (what it "knows" about the domain)
The optimization objective (structural vs. functional)

When knowledge helps: The domain semantics align with the optimization goal When knowledge hurts: The LLM's functional priors conflict with structural optimization objectives

Testable condition: Before adding domain knowledge to LLM prompts, verify whether the knowledge supports or contradicts the mathematical objective.

Implementation Guide

Step 1: DSM Construction

Build N×N matrix of element dependencies
Weight edges by dependency strength
Validate matrix completeness

Step 2: LLM Prompt Design

System: You are an expert in system architecture modularization.
Task: Partition these {N} system elements into cohesive modules.
Input: {DSM_description_with_element_names_and_dependencies}
Objective: Minimize inter-module dependencies (total coupling between modules).
Constraints: {max_modules, min_module_size, etc.}
Format: Return module assignments as JSON.

Step 3: Iterative Optimization

Run 30 iterations of LLM prompting
Each iteration: present current best + ask for improvement
Track objective value across iterations
Convergence typically within 20-30 iterations

Step 4: Validation

Compare against reference solutions (if available)
Compute modularity metrics (MDL, clustering coefficient)
Evaluate semantic coherence of modules

Pitfalls

Semantic misalignment: Adding domain knowledge to prompts can degrade results if the knowledge conflicts with structural optimization. Test both with and without domain context.
LLM variability: Different backbone LLMs produce different quality results. Test across 2-3 models.
Prompt sensitivity: Input representation format significantly impacts results. Use ablation studies to find optimal format.
Scale limits: Method validated on DSMs up to moderate size. Very large DSMs (N > 200) may need hierarchical decomposition.

Applications

System architecture design
Software modularization
Product family planning
Organization design
Supply chain clustering

Related Skills

modern-systems-engineering-patterns
emergent-systems-design
agent-first-bootstrap