Run any Skill in Manus with one click

$pwd:

nextflow

Name: Nextflow
Author: K-Dense-AI

// Build, run, and debug Nextflow data pipelines and nf-core workflows end to end. Use whenever the user mentions Nextflow, nf-core, .nf files, nextflow.config, DSL2, processes/channels/operators, samplesheets, or wants to run a community pipeline (e.g. nf-core/rnaseq, nf-core/sarek), write or test a module/subworkflow with nf-test, configure executors/containers (Docker, Singularity/Apptainer, Conda, Wave), scale a workflow to HPC/SLURM or cloud (AWS Batch, Google Batch, Azure, Kubernetes), or debug a failed/-resume run. Make sure to use this skill for any reproducible scientific/bioinformatics workflow work even if the user does not say the word "Nextflow", and for authoring nf-core-compliant pipelines, modules, configs, and linting.

Run Skill in Manus

$ git log --oneline --stat

stars:26,663

forks:2,760

updated:May 28, 2026 at 19:46

File Explorer

8 files

SKILL.md

readonly

related-skills.json

same repository

scvi-tools.md

from "K-Dense-AI/scientific-agent-skills"

Deep generative models for single-cell omics. Use when you need probabilistic batch correction (scVI), transfer learning, differential expression with uncertainty, or multi-modal integration (TOTALVI, MultiVI). Best for advanced modeling, batch effects, multimodal data. For standard analysis pipelines use scanpy.

2026-05-2826.7k

scikit-bio.md

from "K-Dense-AI/scientific-agent-skills"

Biological data toolkit. Sequence analysis, alignments, phylogenetic trees, diversity metrics (alpha/beta, UniFrac), ordination (PCoA), PERMANOVA, FASTA/Newick I/O, for microbiome analysis.

2026-05-2826.7k

modal.md

from "K-Dense-AI/scientific-agent-skills"

Modal is a serverless cloud platform for running Python on demand, including on-demand GPUs. Use when deploying or serving AI/ML models, running GPU-accelerated workloads (training, fine-tuning, inference), serving web endpoints, scheduling batch jobs, or scaling Python code to cloud containers with the Modal SDK.

2026-05-2826.7k

bulk-rnaseq.md

from "K-Dense-AI/scientific-agent-skills"

End-to-end bulk RNA-seq orchestrator — takes raw FASTQ reads through QC and trimming (FastQC, fastp/Trim Galore), alignment and quantification (STAR, Salmon, featureCounts), assembles a gene-level counts matrix, then hands off to differential expression (pydeseq2), pathway/GSEA enrichment (pathway-enrichment), and publication figures (scientific-visualization). Use whenever the user has bulk RNA-seq reads or quant output and wants a complete, reproducible differential-expression workflow — e.g. "analyze my RNA-seq", "FASTQ to DESeq2", "run nf-core/rnaseq", "STAR/Salmon quantification", "build a counts matrix for DESeq2", or "go from reads to differentially expressed genes and enriched pathways". Routes between an nf-core/rnaseq (Nextflow) path and a standalone STAR/Salmon path, and covers experimental design, strandedness, and QC gates. For single-cell RNA-seq use the scanpy skill instead.

2026-05-2826.7k

pathway-enrichment.md

from "K-Dense-AI/scientific-agent-skills"

Run pathway and gene-set enrichment analysis on gene lists or ranked gene data, then interpret the results. Use whenever the user has a set of genes (differentially expressed genes from PyDESeq2/Scanpy, CRISPR-screen hits, cluster marker genes, proteomics hits) and wants to know which biological pathways, GO terms, or gene sets are over-represented or enriched. Covers over-representation analysis (ORA / Enrichr / Fisher / hypergeometric), ranked Gene Set Enrichment Analysis (GSEA / preranked), single-sample scoring (ssGSEA/GSVA), and functional profiling via gseapy, g:Profiler, Enrichr libraries, MSigDB, GO, KEGG, Reactome, and WikiPathways — plus gene-ID mapping, choosing the right background universe, multiple-testing correction, redundancy reduction, dotplots/enrichment maps, and publication-ready tables. Use this for "pathway analysis", "enrichment analysis", "GO enrichment", "KEGG/Reactome pathways", "GSEA", "over-representation", "functional annotation", or "what pathways are my genes in".

2026-05-2826.7k

dask.md

from "K-Dense-AI/scientific-agent-skills"

Distributed computing for larger-than-RAM pandas/NumPy workflows. Use when you need to scale existing pandas/NumPy code beyond memory or across clusters. Best for parallel file processing, distributed ML, integration with existing pandas code. For out-of-core analytics on single machine use vaex; for in-memory speed use polars.

2026-05-2726.7k

package.json

"author": "K-Dense-AI"

"repository": "K-Dense-AI/scientific-agent-skills"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	nextflow
description	Build, run, and debug Nextflow data pipelines and nf-core workflows end to end. Use whenever the user mentions Nextflow, nf-core, .nf files, nextflow.config, DSL2, processes/channels/operators, samplesheets, or wants to run a community pipeline (e.g. nf-core/rnaseq, nf-core/sarek), write or test a module/subworkflow with nf-test, configure executors/containers (Docker, Singularity/Apptainer, Conda, Wave), scale a workflow to HPC/SLURM or cloud (AWS Batch, Google Batch, Azure, Kubernetes), or debug a failed/-resume run. Make sure to use this skill for any reproducible scientific/bioinformatics workflow work even if the user does not say the word "Nextflow", and for authoring nf-core-compliant pipelines, modules, configs, and linting.
license	Apache-2.0
metadata	{"version":"1.0","skill-author":"K-Dense Inc."}

Nextflow

Overview

Nextflow is a workflow language and runtime for building reproducible, portable, scalable data pipelines. It is dominant in bioinformatics but works for any data-heavy computation. nf-core is a community curating production-grade Nextflow pipelines, reusable modules, and the nf-core tooling on top of Nextflow.

Key ideas:

Dataflow programming: pipelines are process tasks connected by channels. Nextflow infers execution order and parallelism from data dependencies — there is no explicit scheduler to write.
Write once, run anywhere: the same pipeline runs locally, on HPC (SLURM, SGE, LSF, PBS), and on cloud (AWS Batch, Google Batch, Azure Batch, Kubernetes) by changing config/profiles, not code.
Reproducibility: per-task containers (Docker/Singularity/Apptainer/Conda/Wave) + -resume caching + pinned pipeline revisions.
DSL2 is the modern, required syntax: modular process/workflow/include definitions.

This skill covers both running existing pipelines and developing your own (Nextflow language + nf-core conventions, testing with nf-test, configuration, and deployment).

When to Use This Skill

Use this skill when the user wants to:

Run an nf-core or custom Nextflow pipeline, or debug a failing/resuming run.
Write or modify .nf scripts, nextflow.config, profiles, or nextflow_schema.json.
Author or test nf-core-style modules/subworkflows (main.nf, meta.yml, tests/, nf-test).
Configure executors, containers, or resources; scale to HPC or cloud.
Build a reproducible scientific/bioinformatics workflow (even if "Nextflow" is not named).
Understand processes, channels, operators, take/emit, publishDir, ext.args, meta maps.

Setup

Nextflow needs Bash and Java 17 or newer (17–25 supported). Verify with java -version.

# Install Nextflow (self-contained launcher)
curl -s https://get.nextflow.io | bash      # creates ./nextflow
sudo mv nextflow /usr/local/bin/             # put on PATH
nextflow info                                # verify

# Or via conda/bioconda (also gets a managed Java)
conda create -n nf -c bioconda -c conda-forge nextflow nf-core

# nf-core tools (Python) for creating/linting/running nf-core assets
pip install nf-core            # or: conda install -c bioconda nf-core
nf-core --version

Pin the engine for reproducibility: export NXF_VER=24.10.0 (use an [edge] release only if needed). For air-gapped/HPC, see references/running-pipelines.md (offline mode) and references/configuration.md.

Two Modes of Work

Decide which path the user is on — it changes everything:

Goal	Start here
Run an existing pipeline (nf-core or a `.nf` you were given)	`references/running-pipelines.md`
Develop a new pipeline / module / subworkflow	`references/language.md` + `references/developing.md`
Configure / scale (HPC, cloud, containers, resources)	`references/configuration.md` + `references/containers.md`
Test modules/pipelines	`references/testing.md`

Quick Start

Run an nf-core pipeline

Always smoke-test with the bundled test profile first; it uses tiny data and proves your environment works.

# 1. Confirm setup works (downloads pipeline + tiny test data)
nextflow run nf-core/rnaseq -profile test,docker --outdir results

# 2. Real run: pin a revision (-r), pick a container engine, pass inputs
nextflow run nf-core/rnaseq -r 3.14.0 \
  -profile docker \
  --input samplesheet.csv \
  --genome GRCh38 \
  --outdir results \
  -resume

-profile (single dash) selects bundled config profiles; combine them comma-separated, e.g. test,docker. Container/infra profiles (docker, singularity, conda) are mutually exclusive — pick one.
--input, --genome, --outdir (double dash) are pipeline parameters. nf-core pipelines take a samplesheet CSV, not loose files.
-resume reuses cached results from the last run. -r <version> pins a release for reproducibility.

Use nf-core pipelines launch <name> for an interactive, schema-validated way to build the command and a -params-file. See references/running-pipelines.md.

Write a minimal pipeline

#!/usr/bin/env nextflow

process SAYHELLO {
    tag "$greeting"
    publishDir "results", mode: 'copy'

    input:
    val greeting

    output:
    path "${greeting}.txt"

    script:
    """
    echo '$greeting world' > ${greeting}.txt
    """
}

workflow {
    channel.of('hello', 'bonjour', 'hola') | SAYHELLO
}

nextflow run main.nf            # add -resume on reruns

The full language (processes, channels, operators, DSL2 workflows with take/main/emit, modules) is in references/language.md.

Core Concepts at a Glance

Process: a unit of work that runs a script (Bash by default). Declares input:, output:, optional directives (resources, container, publishDir, tag, errorStrategy), and a script:/shell:/exec: block. Each task runs in its own isolated work directory (work/xx/yy…).
Channel: the async queues that connect processes. Queue channels are consumable streams; value channels hold a single reusable value. Created with factories like channel.of, channel.fromPath, channel.fromFilePairs, channel.value.
Operator: transforms/combines channels — map, filter, collect, groupTuple, join, combine, mix, flatten, branch, multiMap, splitCsv, view, set.
Workflow: composes processes. DSL2 workflows can declare take: (inputs), main: (logic), emit: (named outputs) and be included as subworkflows. The unnamed workflow {} is the entry point.
Module: a .nf file exposing processes/workflows via include { NAME } from './path' (supports as aliasing).
Configuration: nextflow.config sets params, process directives, executor, container engines, and named profiles. Selectors withName:/withLabel: target specific processes. See references/configuration.md.
meta map (nf-core): the convention of carrying a metadata map ([ id:'sample1', single_end:false ]) alongside files in input/output tuples so samples stay labeled through the pipeline. See references/developing.md.

nf-core tools CLI

nf-core tools (v3+) group subcommands under pipelines, modules, and subworkflows. (Bare forms like nf-core lint still work but warn — prefer the grouped form.)

Command	Purpose
`nf-core pipelines list`	List/search nf-core pipelines (`--json`, keywords)
`nf-core pipelines create`	Scaffold a new pipeline from the nf-core template
`nf-core pipelines launch <name>`	Interactive, schema-driven run command + params file
`nf-core pipelines download <name>`	Download pipeline + containers for offline/HPC use
`nf-core pipelines lint`	Lint a pipeline against nf-core standards (run in repo root)
`nf-core pipelines schema build`	Build/edit `nextflow_schema.json` via web GUI
`nf-core pipelines create-params-file <name>`	Generate a documented YAML params file
`nf-core pipelines bump-version` / `sync`	Bump version / sync with template updates
`nf-core modules list/info/install/update/remove`	Manage modules from nf-core/modules
`nf-core modules create` / `lint` / `test`	Author, lint, and nf-test a module
`nf-core modules patch` / `bump-versions`	Patch an installed module / bump tool versions
`nf-core subworkflows install/create/lint/test`	Same lifecycle for subworkflows

Full command reference, flags, and examples: references/nf-core-tools.md.

Essential `nextflow` CLI

Command	Purpose
`nextflow run <pipeline> -profile <p> --outdir <dir>`	Run a pipeline (path, `.nf`, or `user/repo`)
`-resume`	Reuse cached results from prior run
`-r <rev>`	Run a specific git revision/tag/branch
`-params-file params.yml`	Supply parameters from YAML/JSON
`-c custom.config`	Layer in an extra config file
`-with-report -with-trace -with-timeline -with-dag flow.html`	Execution report, trace, timeline, DAG
`-stub-run`	Run `stub:` blocks only (dry-run plumbing)
`nextflow log`	Inspect past runs
`nextflow clean -f -before <run>`	Delete old `work/` data
`nextflow pull / drop / list / info <repo>`	Manage cached remote pipelines

Config, executors, caching internals, and tracing details: references/configuration.md.

Best Practices (high-value habits)

Always test first: -profile test,docker (or singularity/conda) before real data — fast and catches environment problems.
Pin everything: pipeline revision (-r), NXF_VER, and tool versions (containers). Don't run latest for science you'll publish.
Use -resume and understand caching: a task re-runs if its inputs, script, or container change. See cache-debugging in references/configuration.md.
Parameterize via config/params-file, not hardcoded paths. Keep params and profiles in nextflow.config.
One container/conda env per process; never rely on tools installed on the host.
For nf-core dev: reuse existing modules (nf-core modules install) before writing new ones; pass tool flags through ext.args (not hardcoded in the script); always include a stub: block and nf-test tests; run nf-core pipelines lint and prettier before committing.
Right-size resources with process_low/medium/high labels and errorStrategy 'retry' with dynamic task.attempt scaling instead of one giant request.
Write forward-compatible syntax: the strict-syntax parser becomes the default in Nextflow 26.04. Prefer lowercase channel.of(...), explicit closure params ({ v -> ... }), def for all variables, and emit:-named outputs. Check with nextflow lint.

Reference Files

Read the relevant file when you need depth — each is self-contained:

references/language.md — DSL2 language: processes, directives, channels, operators, workflows (take/emit), modules, dynamic resources, error handling.
references/configuration.md — nextflow.config, scopes, profiles, withName/withLabel selectors, executors (local/SLURM/cloud), caching/-resume internals, tracing/reports, the nextflow CLI.
references/containers.md — Docker, Singularity/Apptainer, Podman, Conda, Wave containers; choosing and enabling engines; common gotchas.
references/running-pipelines.md — finding/running nf-core pipelines, samplesheets, params files, reference genomes (iGenomes), offline runs, institutional configs, Seqera Platform.
references/nf-core-tools.md — complete nf-core CLI reference (pipelines/modules/subworkflows), flags, and workflows.
references/developing.md — authoring nf-core pipelines & modules: template layout, module main.nf/meta.yml, meta maps, ext.args/modules.config, subworkflows, resource labels, linting & Harshil alignment style.
references/testing.md — nf-test for modules/subworkflows/pipelines: test structure, assertions, snapshots, tags, running tests, CI.

Official docs: Nextflow https://www.nextflow.io/docs/latest/ · nf-core https://nf-co.re/docs/ · Training https://training.nextflow.io/

nextflow

More from this repository

More from this repository

Nextflow

Overview

When to Use This Skill

Setup

Two Modes of Work

Quick Start

Run an nf-core pipeline

Write a minimal pipeline

Core Concepts at a Glance

nf-core tools CLI

Essential nextflow CLI

Best Practices (high-value habits)

Reference Files

Nextflow

Overview

When to Use This Skill

Setup

Two Modes of Work

Quick Start

Run an nf-core pipeline

Write a minimal pipeline

Core Concepts at a Glance

nf-core tools CLI

Essential nextflow CLI

Best Practices (high-value habits)

Reference Files

Essential `nextflow` CLI

Essential `nextflow` CLI