一键在 Manus 中运行任何 Skill

additivellm2-domain-adaptation

Adapt general LLMs to specialized manufacturing domains via domain-adaptive pretraining on open-access journals and visual instruction tuning. Extract 50M tokens and 24K images from peer-reviewed papers, achieve >90% accuracy on domain knowledge tasks, and enable real-time defect identification from manufacturing images.

在 Manus 中运行

概览

安装命令

npx skills add https://github.com/ADu2021/skillXiv --skill additivellm2-domain-adaptation

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

ADu2021/skillXiv

星标3

分支0

更新时间2026年3月26日 15:00

SKILL.md

readonly

name	additivellm2-domain-adaptation
title	AdditiveLLM2: Multi-Modal Language Models for Additive Manufacturing
version	0.0.3
engine	skillxiv-v0.0.3-claude-opus-4.6
license	MIT
url	https://arxiv.org/abs/2603.22017
keywords	["Domain Adaptation","Language Models","Additive Manufacturing","Knowledge Extraction","Visual Understanding"]
description	Adapt general LLMs to specialized manufacturing domains via domain-adaptive pretraining on open-access journals and visual instruction tuning. Extract 50M tokens and 24K images from peer-reviewed papers, achieve >90% accuracy on domain knowledge tasks, and enable real-time defect identification from manufacturing images.

AdditiveLLM2: Domain Adaptation for Manufacturing

Domain Problem: Knowledge Gap in General LLMs

Additive manufacturing (AM) involves specialized terminology (LPBF, FDM, melt pool dynamics, material properties) and visual patterns (layer defects, porosity) that general LLMs rarely encounter. Manufacturing practitioners need models that understand equipment specifications, material behavior, and quality control—not general knowledge about 3D printing.

Gap Analysis: General Gemma-3-12B achieves <50% accuracy on manufacturing knowledge tasks; domain-specific terminology is treated as out-of-vocabulary or incorrectly classified.

Source Method & Adaptation Recipe

Foundation Model: Gemma-3-12B (open-weights baseline)

Domain Data Curation: Extract text and images from 1,704 peer-reviewed papers across four open-access journals:

Journal of Additive Manufacturing
Rapid Prototyping Journal
Specialized AM conferences (via arXiv)

Dataset Composition:

29 million text tokens (50M target after tokenization)
24,000 images with captions
Focus on: process parameters, material properties, defect modes, quality metrics

Three-Stage Training Pipeline:

Text Domain-Adaptive Pretraining (DAPT): Unsupervised MLM on AM corpus; trains terminology and concept associations specific to manufacturing.

# Domain-adaptive pretraining: continuous pretraining on domain data
# Train masked language modeling on AM corpus
# This builds internal representations for:
# - Equipment types (LPBF, FDM, SLM, DMLS)
# - Material properties (viscosity, thermal conductivity, porosity)
# - Defect modes (layer adhesion, warping, spatter)

Image Domain-Adaptive Pretraining: Vision encoder fine-tuning on AM images; learns visual patterns specific to manufacturing artifacts and defects.
Visual Instruction Tuning: Supervised fine-tuning on (image, question, answer) triples extracted from papers; teaches model to answer AM-specific questions about images.

# Instruction tuning examples:
# Q: "What defects are visible in this LPBF part cross-section?"
# A: "Lack-of-fusion porosity in layers 5-7, surface roughness >5μm"
# Extracted from figure captions and supplementary materials

Training Details: LoRA rank-16 for efficiency; epochs tuned on validation set; separate loss weights for text and vision modalities.

Deployment Lessons

Lesson 1: Data Quality Over Quantity

29M tokens from curated peer-reviewed sources outperforms 100M tokens from web-scraped manufacturing forums
Academic papers provide accurate causal explanations; forums often contain myths about equipment behavior

Lesson 2: Image Diversity Matters

24K images sufficient for defect recognition if sourced from multiple equipment types, materials, and process parameters
Overfitting risk when training data heavily skewed toward single process (e.g., 80% LPBF); require balanced sampling

Lesson 3: Evaluation Must Be Domain-Aware

Standard NLU benchmarks (GLUE) irrelevant; create domain-specific benchmark:
- General AM knowledge (multiple-choice): 20 questions from textbooks
- Process parameter prediction: "Given part geometry and material, what laser power range?"
- Defect identification: Classification from images

Lesson 4: User Acceptance Requires Transparency

Manufacturing teams need explanations: "Layer 5 shows porosity because melt pool temperature likely dropped below X°C (detected via image features)."
Black-box accuracy isn't enough; trace predictions to training data examples

Practical Impact

Accessible Specialization: 29M tokens is feasible for domain teams to curate; previous domain-specific models required billions of tokens
Real-Time Defect Detection: Vision instruction tuning enables edge deployment for factory-floor QC
Continuous Improvement: Model can be re-trained quarterly as new process innovations appear in literature
Cost Reduction: Reduces reliance on expert technicians for initial defect triage

Generalization to Other Domains

Recipe applies to any technical domain with:

Established peer-reviewed literature (>1K papers)
Visual patterns (images, diagrams, schematics)
Specialized terminology and causal reasoning

Tested concept on biomedical imaging, materials science, semiconductor manufacturing.

同仓库更多 Skills

同仓库

meaningful-kebab-case-name

ADu2021/skillXiv

Convert arXiv papers into ready-to-use agent skills using category-aware extraction. First classifies the paper into one or more of 11 research categories, then applies a specialized extraction pipeline for each category — because different types of papers produce different types of usable knowledge. A single paper can yield multiple skills if it spans categories. Use this skill whenever the user wants to turn a paper into a skill, extract practical techniques from research, build a skill library from papers, convert arXiv papers into reusable agent instructions, or batch-process multiple papers into skills. Also trigger when someone asks about extracting actionable knowledge from papers, making research practical for LLM agents, or systematically converting academic contributions into structured agent capabilities.

2026-03-263

action-quantization-behavior-cloning

ADu2021/skillXiv

Establish regret bounds for behavior cloning with discretized actions combining statistical error and quantization error terms. Prove smoothness requirements for safe quantizer design, show that learning-based quantizers fail these requirements, and propose model-based augmentation to reduce error dependence from H² to H.

2026-03-263

adaptive-lora-personalized-ranks

ADu2021/skillXiv

Dynamically allocate LoRA ranks per-layer during fine-tuning instead of using fixed uniform ranks. Learn optimal rank for each layer and subject via variational framework with discretized exponential distribution, reducing memory footprint while maintaining fidelity and text-alignment.

2026-03-263

agentic-ai-intelligence-explosion

ADu2021/skillXiv

Future intelligence explosions will be plural, social, and entangled with humanity through distributed collaborative systems rather than singular superintelligence. Intelligence is inherently social, demanding infrastructure matching agent development; integrate governance, institutional frameworks, and constitutional checks across hierarchies of autonomous agents and human-AI centaurs in shifting configurations.

2026-03-263

animalclap-taxonomy-aware-pretraining

ADu2021/skillXiv

Build taxonomy-aware audio-text pretraining systems for species recognition from animal vocalizations. Train contrastive models that augment text prompts with hierarchical taxonomic structure (scientific/common names, phylogenetic sequences), evaluate on unseen species via rare-species test sets, and predict ecological traits directly from audio.

2026-03-263

bubblerag-evidence-driven-graphs

ADu2021/skillXiv

Address hallucinations in LLM QA over black-box knowledge graphs using evidence-driven retrieval. Formalize Optimal Informative Subgraph Retrieval and employ bubble expansion to discover candidate evidence graphs, achieving state-of-the-art multi-hop QA performance.

2026-03-263

来源

ADu2021

ADu2021/skillXiv

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

数据科学家计算机与数学类职业15-2051L4

name	additivellm2-domain-adaptation
title	AdditiveLLM2: Multi-Modal Language Models for Additive Manufacturing
version	0.0.3
engine	skillxiv-v0.0.3-claude-opus-4.6
license	MIT
url	https://arxiv.org/abs/2603.22017
keywords	["Domain Adaptation","Language Models","Additive Manufacturing","Knowledge Extraction","Visual Understanding"]
description	Adapt general LLMs to specialized manufacturing domains via domain-adaptive pretraining on open-access journals and visual instruction tuning. Extract 50M tokens and 24K images from peer-reviewed papers, achieve >90% accuracy on domain knowledge tasks, and enable real-time defect identification from manufacturing images.

AdditiveLLM2: Domain Adaptation for Manufacturing

Domain Problem: Knowledge Gap in General LLMs

Gap Analysis: General Gemma-3-12B achieves <50% accuracy on manufacturing knowledge tasks; domain-specific terminology is treated as out-of-vocabulary or incorrectly classified.

Source Method & Adaptation Recipe

Foundation Model: Gemma-3-12B (open-weights baseline)

Domain Data Curation: Extract text and images from 1,704 peer-reviewed papers across four open-access journals:

Journal of Additive Manufacturing
Rapid Prototyping Journal
Specialized AM conferences (via arXiv)

Dataset Composition:

29 million text tokens (50M target after tokenization)
24,000 images with captions
Focus on: process parameters, material properties, defect modes, quality metrics

Three-Stage Training Pipeline:

Text Domain-Adaptive Pretraining (DAPT): Unsupervised MLM on AM corpus; trains terminology and concept associations specific to manufacturing.

# Domain-adaptive pretraining: continuous pretraining on domain data
# Train masked language modeling on AM corpus
# This builds internal representations for:
# - Equipment types (LPBF, FDM, SLM, DMLS)
# - Material properties (viscosity, thermal conductivity, porosity)
# - Defect modes (layer adhesion, warping, spatter)

Image Domain-Adaptive Pretraining: Vision encoder fine-tuning on AM images; learns visual patterns specific to manufacturing artifacts and defects.
Visual Instruction Tuning: Supervised fine-tuning on (image, question, answer) triples extracted from papers; teaches model to answer AM-specific questions about images.

# Instruction tuning examples:
# Q: "What defects are visible in this LPBF part cross-section?"
# A: "Lack-of-fusion porosity in layers 5-7, surface roughness >5μm"
# Extracted from figure captions and supplementary materials

Training Details: LoRA rank-16 for efficiency; epochs tuned on validation set; separate loss weights for text and vision modalities.

Deployment Lessons

Lesson 1: Data Quality Over Quantity

29M tokens from curated peer-reviewed sources outperforms 100M tokens from web-scraped manufacturing forums
Academic papers provide accurate causal explanations; forums often contain myths about equipment behavior

Lesson 2: Image Diversity Matters

24K images sufficient for defect recognition if sourced from multiple equipment types, materials, and process parameters
Overfitting risk when training data heavily skewed toward single process (e.g., 80% LPBF); require balanced sampling

Lesson 3: Evaluation Must Be Domain-Aware

Standard NLU benchmarks (GLUE) irrelevant; create domain-specific benchmark:
- General AM knowledge (multiple-choice): 20 questions from textbooks
- Process parameter prediction: "Given part geometry and material, what laser power range?"
- Defect identification: Classification from images

Lesson 4: User Acceptance Requires Transparency

Manufacturing teams need explanations: "Layer 5 shows porosity because melt pool temperature likely dropped below X°C (detected via image features)."
Black-box accuracy isn't enough; trace predictions to training data examples

Practical Impact

Accessible Specialization: 29M tokens is feasible for domain teams to curate; previous domain-specific models required billions of tokens
Real-Time Defect Detection: Vision instruction tuning enables edge deployment for factory-floor QC
Continuous Improvement: Model can be re-trained quarterly as new process innovations appear in literature
Cost Reduction: Reduces reliance on expert technicians for initial defect triage

Generalization to Other Domains

Recipe applies to any technical domain with:

Established peer-reviewed literature (>1K papers)
Visual patterns (images, diagrams, schematics)
Specialized terminology and causal reasoning

Tested concept on biomedical imaging, materials science, semiconductor manufacturing.