一键在 Manus 中运行任何 Skill

adaptive-lora-personalized-ranks

Dynamically allocate LoRA ranks per-layer during fine-tuning instead of using fixed uniform ranks. Learn optimal rank for each layer and subject via variational framework with discretized exponential distribution, reducing memory footprint while maintaining fidelity and text-alignment.

在 Manus 中运行

概览

安装命令

npx skills add https://github.com/ADu2021/skillXiv --skill adaptive-lora-personalized-ranks

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

ADu2021/skillXiv

星标3

分支0

更新时间2026年3月26日 15:00

SKILL.md

readonly

name	adaptive-lora-personalized-ranks
title	Not All Layers Are Created Equal: Adaptive LoRA Rank Allocation for Personalized Image Generation
version	0.0.3
engine	skillxiv-v0.0.3-claude-opus-4.6
license	MIT
url	https://arxiv.org/abs/2603.21884
keywords	["Low-Rank Adaptation","Adaptive Ranks","Diffusion Models","Parameter Efficiency","Personalization"]
description	Dynamically allocate LoRA ranks per-layer during fine-tuning instead of using fixed uniform ranks. Learn optimal rank for each layer and subject via variational framework with discretized exponential distribution, reducing memory footprint while maintaining fidelity and text-alignment.

Adaptive LoRA: Per-Layer Rank Allocation

Problem Statement

Current LoRA practice applies identical ranks uniformly across all layers, which is suboptimal because layer importance varies and subjects have different complexity. Simple subjects waste capacity with high-rank components, while complex subjects suffer from insufficient expressiveness with uniform low ranks.

Component Innovation: Adaptive Rank Learning

The Modification: Replace fixed-rank LoRA with learnable per-layer rank parameters (νℓ) that are optimized during fine-tuning.

Technical Mechanism:

The method introduces learnable parameters νℓ for each LoRA component that control effective rank. A discretized exponential distribution imposes importance ordering on rank indices, preventing all ranks from collapsing identically.

# Variational rank framework: learnable importance weights per layer
# For each layer l, learn importance parameters nu_l that gate
# which rank dimensions activate during training
# Rank is dynamically added/removed via gated forward passes

# Forward pass with adaptive rank masking:
# out = (A @ diag(Lambda_l) @ B @ x)
# where Lambda_l is dynamically scaled based on learned nu_l

Weight rescaling through diagonal matrices (Λℓ) normalizes magnitudes during forward passes. The training loss combines three objectives:

Reconstruction loss (MSE between fine-tuned and target outputs)
Rank regularization (pushing toward target rank)
Cross-attention entropy minimization (preserving attention patterns)

Ablation Results

Memory Footprint: LoRA2 achieves comparable visual quality with 0.40 GB vs. 2.80 GB for fixed rank-512 LoRA—a 7× reduction.

Quality Metrics across 29 subjects:

Subject fidelity (DINO, CLIP-I): Competitive with fixed-rank baselines
Text alignment (CLIP-T): Better than many fixed-rank configurations
Rank analysis: Optimal ranks vary significantly across subjects (range 32-256) and layers, confirming heterogeneous requirements

Drop-In Checklist

Initialization: Start with fixed rank estimate (e.g., 64) across all layers
Enable Adaptation: Introduce learnable νℓ parameters with exponential prior
Training: Use three-component loss; tune regularization strength via validation
Rank Monitoring: Track effective ranks per layer; verify they diverge (not collapse to identical values)
Memory Validation: Compare footprint; target 5-10× reduction over fixed high-rank baseline
Quality Gate: Ensure DINO/CLIP-I fidelity ≥ baseline; accept CLIP-T variance if overall efficiency gain is 5×+

Conditions for Effectiveness

Subject Complexity Diversity: Method shines when fine-tuning multiple subjects with varying detail (e.g., 29 subjects spanning simple logos to complex faces). Single subject may not benefit.
Backbone Model: Tested on SDXL and KOALA-700m; effectiveness depends on having learnable LoRA components across all target layers.
Training Budget: Requires sufficient data to learn stable rank preferences; very low-shot scenarios may revert to fixed ranks.
Compute-Memory Tradeoff: Accept ~2× slower training (rank optimization overhead) for 7× memory savings.

Practical Implications

Eliminates Hyperparameter Search: Single training run replaces grid search over rank configurations.
Scalability: Enables personal model development at scale where uniform high ranks become prohibitive.

同仓库更多 Skills

同仓库

meaningful-kebab-case-name

ADu2021/skillXiv

Convert arXiv papers into ready-to-use agent skills using category-aware extraction. First classifies the paper into one or more of 11 research categories, then applies a specialized extraction pipeline for each category — because different types of papers produce different types of usable knowledge. A single paper can yield multiple skills if it spans categories. Use this skill whenever the user wants to turn a paper into a skill, extract practical techniques from research, build a skill library from papers, convert arXiv papers into reusable agent instructions, or batch-process multiple papers into skills. Also trigger when someone asks about extracting actionable knowledge from papers, making research practical for LLM agents, or systematically converting academic contributions into structured agent capabilities.

2026-03-263

action-quantization-behavior-cloning

ADu2021/skillXiv

Establish regret bounds for behavior cloning with discretized actions combining statistical error and quantization error terms. Prove smoothness requirements for safe quantizer design, show that learning-based quantizers fail these requirements, and propose model-based augmentation to reduce error dependence from H² to H.

2026-03-263

additivellm2-domain-adaptation

ADu2021/skillXiv

Adapt general LLMs to specialized manufacturing domains via domain-adaptive pretraining on open-access journals and visual instruction tuning. Extract 50M tokens and 24K images from peer-reviewed papers, achieve >90% accuracy on domain knowledge tasks, and enable real-time defect identification from manufacturing images.

2026-03-263

agentic-ai-intelligence-explosion

ADu2021/skillXiv

Future intelligence explosions will be plural, social, and entangled with humanity through distributed collaborative systems rather than singular superintelligence. Intelligence is inherently social, demanding infrastructure matching agent development; integrate governance, institutional frameworks, and constitutional checks across hierarchies of autonomous agents and human-AI centaurs in shifting configurations.

2026-03-263

animalclap-taxonomy-aware-pretraining

ADu2021/skillXiv

Build taxonomy-aware audio-text pretraining systems for species recognition from animal vocalizations. Train contrastive models that augment text prompts with hierarchical taxonomic structure (scientific/common names, phylogenetic sequences), evaluate on unseen species via rare-species test sets, and predict ecological traits directly from audio.

2026-03-263

bubblerag-evidence-driven-graphs

ADu2021/skillXiv

Address hallucinations in LLM QA over black-box knowledge graphs using evidence-driven retrieval. Formalize Optimal Informative Subgraph Retrieval and employ bubble expansion to discover candidate evidence graphs, achieving state-of-the-art multi-hop QA performance.

2026-03-263

来源

ADu2021

ADu2021/skillXiv

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

数据科学家计算机与数学类职业15-2051L4

name	adaptive-lora-personalized-ranks
title	Not All Layers Are Created Equal: Adaptive LoRA Rank Allocation for Personalized Image Generation
version	0.0.3
engine	skillxiv-v0.0.3-claude-opus-4.6
license	MIT
url	https://arxiv.org/abs/2603.21884
keywords	["Low-Rank Adaptation","Adaptive Ranks","Diffusion Models","Parameter Efficiency","Personalization"]
description	Dynamically allocate LoRA ranks per-layer during fine-tuning instead of using fixed uniform ranks. Learn optimal rank for each layer and subject via variational framework with discretized exponential distribution, reducing memory footprint while maintaining fidelity and text-alignment.

Adaptive LoRA: Per-Layer Rank Allocation

Problem Statement

Component Innovation: Adaptive Rank Learning

The Modification: Replace fixed-rank LoRA with learnable per-layer rank parameters (νℓ) that are optimized during fine-tuning.

Technical Mechanism:

# Variational rank framework: learnable importance weights per layer
# For each layer l, learn importance parameters nu_l that gate
# which rank dimensions activate during training
# Rank is dynamically added/removed via gated forward passes

# Forward pass with adaptive rank masking:
# out = (A @ diag(Lambda_l) @ B @ x)
# where Lambda_l is dynamically scaled based on learned nu_l

Weight rescaling through diagonal matrices (Λℓ) normalizes magnitudes during forward passes. The training loss combines three objectives:

Reconstruction loss (MSE between fine-tuned and target outputs)
Rank regularization (pushing toward target rank)
Cross-attention entropy minimization (preserving attention patterns)

Ablation Results

Memory Footprint: LoRA2 achieves comparable visual quality with 0.40 GB vs. 2.80 GB for fixed rank-512 LoRA—a 7× reduction.

Quality Metrics across 29 subjects:

Subject fidelity (DINO, CLIP-I): Competitive with fixed-rank baselines
Text alignment (CLIP-T): Better than many fixed-rank configurations
Rank analysis: Optimal ranks vary significantly across subjects (range 32-256) and layers, confirming heterogeneous requirements

Drop-In Checklist

Initialization: Start with fixed rank estimate (e.g., 64) across all layers
Enable Adaptation: Introduce learnable νℓ parameters with exponential prior
Training: Use three-component loss; tune regularization strength via validation
Rank Monitoring: Track effective ranks per layer; verify they diverge (not collapse to identical values)
Memory Validation: Compare footprint; target 5-10× reduction over fixed high-rank baseline
Quality Gate: Ensure DINO/CLIP-I fidelity ≥ baseline; accept CLIP-T variance if overall efficiency gain is 5×+

Conditions for Effectiveness

Subject Complexity Diversity: Method shines when fine-tuning multiple subjects with varying detail (e.g., 29 subjects spanning simple logos to complex faces). Single subject may not benefit.
Backbone Model: Tested on SDXL and KOALA-700m; effectiveness depends on having learnable LoRA components across all target layers.
Training Budget: Requires sufficient data to learn stable rank preferences; very low-shot scenarios may revert to fixed ranks.
Compute-Memory Tradeoff: Accept ~2× slower training (rank optimization overhead) for 7× memory savings.

Practical Implications

Eliminates Hyperparameter Search: Single training run replaces grid search over rank configurations.
Scalability: Enables personal model development at scale where uniform high ranks become prohibitive.