Run any Skill in Manus with one click

$pwd:

implementing-new-modules

Name: Implementing New Modules
Author: MultiQC

// Create a new MultiQC module from scratch. Parse a bioinformatics tool's output, register search patterns and entry points, add general stats columns, build plots and sections, write tests, open a PR. Use when implementing a `module: new` GitHub issue, when the user asks to add support for a new tool, or when adding a parser for a new tool output format.

Run Skill in Manus

$ git log --oneline --stat

stars:1,455

forks:667

updated:May 11, 2026 at 22:44

File Explorer

4 files

SKILL.md

readonly

name	implementing-new-modules
description	Create a new MultiQC module from scratch. Parse a bioinformatics tool's output, register search patterns and entry points, add general stats columns, build plots and sections, write tests, open a PR. Use when implementing a `module: new` GitHub issue, when the user asks to add support for a new tool, or when adding a parser for a new tool output format.

Implement New MultiQC Module

Workflow

Research the tool: read its docs, get example output files (check MultiQC/test-data/data/modules/ first), find similar existing modules for reference. Note version- and flag-dependent output variations.
Pick architecture: single-tool or multi-subtool (see below).
Build the module: parser, general stats, sections, plots. Full step-by-step in implementation-checklist.md.
Register: add to search_patterns.yaml and pyproject.toml entry points.
Test: unit tests for parsers, integration test via pytest tests/test_modules_run.py -k "toolname" -v, and multiqc … --strict on test data.
Quality gate: prek run, ruff check, python .github/workflows/code_checks.py, mypy.
PR: brief summary, <details> for the full write-up, Closes #XXXX.

Architecture decision

Single-tool (FastQC, Qualimap): one output format, one parser.

multiqc/modules/toolname/
├── __init__.py
└── toolname.py

Multi-subtool (samtools, seqkit, picard): distinct subcommands with different output formats, or more subcommands likely to be added.

multiqc/modules/toolname/
├── __init__.py
├── toolname.py        # Orchestrator
├── subtool1.py        # parse_toolname_subtool1() function
├── subtool2.py        # parse_toolname_subtool2() function
└── tests/

Class skeletons and full file templates: module-structure.md. Patterns for code inside a module (parsing, plots, alerts, etc.): code-patterns.md.

Common Pitfalls

Forgetting add_software_version() — required by linting, even if version is None.
Calling write_data_file() too early — must be at end, after all sections.
Raising UserWarning instead of ModuleNoSamplesFound.
Not handling both tab- and space-separated output when both are valid.
Hardcoding values instead of using f["s_name"] and other dynamic variables.
Manually cleaning sample names instead of self.clean_s_name().
Inappropriate colour scales — e.g. RdYlGn for GC% (which is not "higher is better").
Silently defaulting on known fields — parsed.get(key, 0) for fields the tool always emits (or try/except: return {} over the whole parse) hides real format breakage behind a fake-looking report. Access documented keys directly; reserve .get(default) for genuinely optional fields. Catching a parse error to raise a friendlier message is fine; silently producing zeros is not.
Trivial single-statement helpers — a helper that wraps one or two lines, or just renames a one-liner, adds indirection without aiding readability. Per-section / per-parser helpers (_add_adapter_section, _parse_log) are fine and often clearer; one-liner wrappers (_add_filtered_section calling add_section with no real logic) are not.
Using raw parsed dict keys in user-facing text — total_counts and pct_dup belong in code, never in plot/column titles, axis labels, or section names. Convert to "Total Counts", "% Duplicates".
Dropping the whole section when all samples are zero — keep the section, pass plot=None, and add a SectionAlert via the alerts= parameter on add_section() listing affected samples. Don't append raw <div class="alert ..."> HTML to description — use the alerts API.
Pre-filtering samples at parse time — keep every sample in the main data dict so write_data_file is complete; filter at plot-render time.
Em-dashes (—) in any user-facing text — descriptions, docstrings, alerts, PR text. AI tell. Use commas or split sentences.

related-skills.json

same repository

reviewing-prs.md

from "MultiQC/MultiQC"

Review MultiQC pull requests or local branch diffs. Scan changes for anti-patterns against the project's rules, check PR hygiene (issue link, test data, screenshots), and produce a structured comment with severity-tagged findings. Use when the user asks to review a MultiQC PR, asks for feedback on a branch/diff, runs `gh pr view`/`gh pr diff`, or wants a code review of MultiQC changes (new modules, plot tweaks, core refactors, docs, anything).

2026-05-111.5k

triaging-module-requests.md

from "MultiQC/MultiQC"

Triage MultiQC `module: new` GitHub issues: calculate 0-100 priority scores, apply priority labels, post analysis comments with score breakdowns, and give contributors actionable feedback to improve their request. Use when a new `module: new` issue is opened, when a user comments `@claude analyze-module` on a request, during weekly bulk triage, or when manually re-evaluating a module request.

2026-05-111.5k

package.json

"author": "MultiQC"

"repository": "MultiQC/MultiQC"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	implementing-new-modules
description	Create a new MultiQC module from scratch. Parse a bioinformatics tool's output, register search patterns and entry points, add general stats columns, build plots and sections, write tests, open a PR. Use when implementing a `module: new` GitHub issue, when the user asks to add support for a new tool, or when adding a parser for a new tool output format.

Implement New MultiQC Module

Workflow

Research the tool: read its docs, get example output files (check MultiQC/test-data/data/modules/ first), find similar existing modules for reference. Note version- and flag-dependent output variations.
Pick architecture: single-tool or multi-subtool (see below).
Build the module: parser, general stats, sections, plots. Full step-by-step in implementation-checklist.md.
Register: add to search_patterns.yaml and pyproject.toml entry points.
Test: unit tests for parsers, integration test via pytest tests/test_modules_run.py -k "toolname" -v, and multiqc … --strict on test data.
Quality gate: prek run, ruff check, python .github/workflows/code_checks.py, mypy.
PR: brief summary, <details> for the full write-up, Closes #XXXX.

Architecture decision

Single-tool (FastQC, Qualimap): one output format, one parser.

multiqc/modules/toolname/
├── __init__.py
└── toolname.py

Multi-subtool (samtools, seqkit, picard): distinct subcommands with different output formats, or more subcommands likely to be added.

multiqc/modules/toolname/
├── __init__.py
├── toolname.py        # Orchestrator
├── subtool1.py        # parse_toolname_subtool1() function
├── subtool2.py        # parse_toolname_subtool2() function
└── tests/

Class skeletons and full file templates: module-structure.md. Patterns for code inside a module (parsing, plots, alerts, etc.): code-patterns.md.

Common Pitfalls

Forgetting add_software_version() — required by linting, even if version is None.
Calling write_data_file() too early — must be at end, after all sections.
Raising UserWarning instead of ModuleNoSamplesFound.
Not handling both tab- and space-separated output when both are valid.
Hardcoding values instead of using f["s_name"] and other dynamic variables.
Manually cleaning sample names instead of self.clean_s_name().
Inappropriate colour scales — e.g. RdYlGn for GC% (which is not "higher is better").
Silently defaulting on known fields — parsed.get(key, 0) for fields the tool always emits (or try/except: return {} over the whole parse) hides real format breakage behind a fake-looking report. Access documented keys directly; reserve .get(default) for genuinely optional fields. Catching a parse error to raise a friendlier message is fine; silently producing zeros is not.
Trivial single-statement helpers — a helper that wraps one or two lines, or just renames a one-liner, adds indirection without aiding readability. Per-section / per-parser helpers (_add_adapter_section, _parse_log) are fine and often clearer; one-liner wrappers (_add_filtered_section calling add_section with no real logic) are not.
Using raw parsed dict keys in user-facing text — total_counts and pct_dup belong in code, never in plot/column titles, axis labels, or section names. Convert to "Total Counts", "% Duplicates".
Dropping the whole section when all samples are zero — keep the section, pass plot=None, and add a SectionAlert via the alerts= parameter on add_section() listing affected samples. Don't append raw <div class="alert ..."> HTML to description — use the alerts API.
Pre-filtering samples at parse time — keep every sample in the main data dict so write_data_file is complete; filter at plot-render time.
Em-dashes (—) in any user-facing text — descriptions, docstrings, alerts, PR text. AI tell. Use commas or split sentences.

implementing-new-modules

Implement New MultiQC Module

Workflow

Architecture decision

Common Pitfalls

More from this repository

More from this repository

Implement New MultiQC Module

Workflow

Architecture decision

Common Pitfalls