Run any Skill in Manus with one click

genomics-analysis

Orchestrates a genomics analysis workflow from gene query through expression analysis to pathway enrichment. Use when investigating gene function, analyzing expression data, or performing pathway-level interpretation. NOT for pure protein structure modeling or drug-target interaction analysis.

Run Skill in Manus

Stars850

Forks98

UpdatedMarch 12, 2026 at 04:53

Source

beita6969

beita6969/ScienceClaw

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Related occupationsSOC

Based on SOC occupation classification

Biochemists and BiophysicistsLife, Physical, and Social Science Occupations·SOC 19-1021

SKILL.md

readonly

name	genomics-analysis
description	Orchestrates a genomics analysis workflow from gene query through expression analysis to pathway enrichment. Use when investigating gene function, analyzing expression data, or performing pathway-level interpretation. NOT for pure protein structure modeling or drug-target interaction analysis.
metadata	{"openclaw":{"emoji":"🧬"}}

Genomics Analysis (Meta Skill)

This meta-skill coordinates a complete genomics analysis pipeline by integrating gene database queries, sequence analysis, expression profiling, and pathway enrichment into a unified workflow. It combines three specialized skills to deliver comprehensive gene-level and systems-level biological insights.

Workflow

Step 1: Gene Information Retrieval

Query NCBI Entrez for comprehensive gene details including official nomenclature, genomic coordinates, transcript variants, and functional annotations. Retrieve orthologs across model organisms for evolutionary context. Pull known variants from ClinVar and dbSNP, noting pathogenic or pharmacogenomic associations. Collect linked references from PubMed for recent literature context.

Step 2: Sequence Analysis

Use BioPython to perform sequence-level analyses on retrieved gene and protein sequences:

Multiple sequence alignment of orthologs to identify conserved regions
Motif discovery in promoter regions or protein domains
Domain architecture mapping against Pfam/InterPro signatures
Codon usage analysis for expression optimization studies
Variant impact prediction based on conservation scores

Step 3: Expression Analysis

Apply scanpy for expression data analysis, supporting both single-cell and bulk RNA-seq workflows:

For single-cell: quality control, normalization, clustering, marker gene identification, cell type annotation
For bulk: differential expression analysis, volcano plots, heatmaps
Cross-dataset comparison when multiple conditions are available
Identification of co-expressed gene modules

Step 4: Pathway Enrichment and Functional Annotation

Map differentially expressed or co-expressed genes to biological pathways:

KEGG pathway mapping for metabolic and signaling context
Gene Ontology enrichment (biological process, molecular function, cellular component)
Reactome pathway analysis for detailed mechanistic understanding
Network-based enrichment to identify hub genes and regulatory modules

Step 5: Integrated Report Generation

Compile findings into a structured report with:

Gene summary card with key identifiers and annotations
Sequence conservation highlights and domain maps
Expression analysis results with statistical summaries
Enriched pathways ranked by significance
Key findings synthesis connecting sequence, expression, and pathway data
Publication-ready figures and supplementary tables

Integration Points

ncbi-entrez -- Gene records, variant data, orthologs, literature links
biopython-bio -- Sequence alignment, motif search, domain analysis, format conversion
scanpy-singlecell -- Expression quantification, clustering, differential expression, visualization

Output Formats

Gene card: Symbol, aliases, genomic location, function summary, disease associations
Alignment view: Conserved regions highlighted across orthologs
Expression summary: DE gene lists with fold change, p-values, FDR
Pathway table: Enriched pathways with gene counts, p-values, leading-edge genes
Figures: Heatmaps, volcano plots, UMAP embeddings, pathway diagrams

Best Practices

Start with gene identifiers from a reliable source (NCBI Gene ID or HGNC symbol)
Verify gene nomenclature across databases to avoid confusion from aliases
Use appropriate normalization for the expression data type (TPM, CPM, SCTransform)
Apply multiple testing correction (Benjamini-Hochberg) for all enrichment analyses
Set biologically meaningful fold-change thresholds alongside statistical cutoffs
Include both up- and down-regulated gene sets in pathway analysis
Cross-reference pathway results with known biology to filter spurious enrichments
Report effect sizes and confidence intervals, not just p-values
Note species differences when translating findings from model organisms
Archive intermediate results for reproducibility and downstream re-analysis

More from this repository

same repository

academic-literature-search

beita6969/ScienceClaw

# Academic Literature Search — 学术文献检索与引用管理

2026-03-12850

arxiv-search

beita6969/ScienceClaw

Search arXiv for preprints in physics, math, CS, quantitative biology, quantitative finance, statistics, electrical engineering, economics. Use when: (1) finding preprints by topic, (2) searching by author, (3) browsing arXiv categories, (4) getting paper metadata/abstracts. NOT for: published journal articles (use crossref-search), biomedical (use pubmed-search).

2026-03-12850

asreview-screening

beita6969/ScienceClaw

Screen papers for systematic reviews using ASReview active learning. Use when: user has a large set of papers to screen for inclusion/exclusion, wants to prioritize relevant papers, or needs to reduce manual screening workload. NOT for: searching papers (use literature-search) or meta-analysis (use meta-analysis).

2026-03-12850

astronomy-cosmology

beita6969/ScienceClaw

Analyzes astronomical observations and cosmological models including telescope data processing, celestial mechanics calculations, stellar evolution, galaxy classification, and cosmological parameter estimation; trigger when users discuss stars, galaxies, exoplanets, dark matter, or the universe's large-scale structure.

2026-03-12850

astropy-astronomy

beita6969/ScienceClaw

"Astronomical computations via Astropy. Use when: user asks about celestial coordinates, FITS files, or cosmological calculations. NOT for: telescope control or real-time observation planning."

2026-03-12850

bioinformatics

beita6969/ScienceClaw

Performs bioinformatics analyses including pathway enrichment, gene ontology analysis, protein-protein interaction networks, multi-omics integration, and biological sequence database querying; trigger when users discuss gene sets, biological pathways, functional annotation, or omics data integration.

2026-03-12850

name	genomics-analysis
description	Orchestrates a genomics analysis workflow from gene query through expression analysis to pathway enrichment. Use when investigating gene function, analyzing expression data, or performing pathway-level interpretation. NOT for pure protein structure modeling or drug-target interaction analysis.
metadata	{"openclaw":{"emoji":"🧬"}}

Genomics Analysis (Meta Skill)

Workflow

Step 1: Gene Information Retrieval

Step 2: Sequence Analysis

Use BioPython to perform sequence-level analyses on retrieved gene and protein sequences:

Multiple sequence alignment of orthologs to identify conserved regions
Motif discovery in promoter regions or protein domains
Domain architecture mapping against Pfam/InterPro signatures
Codon usage analysis for expression optimization studies
Variant impact prediction based on conservation scores

Step 3: Expression Analysis

Apply scanpy for expression data analysis, supporting both single-cell and bulk RNA-seq workflows:

For single-cell: quality control, normalization, clustering, marker gene identification, cell type annotation
For bulk: differential expression analysis, volcano plots, heatmaps
Cross-dataset comparison when multiple conditions are available
Identification of co-expressed gene modules

Step 4: Pathway Enrichment and Functional Annotation

Map differentially expressed or co-expressed genes to biological pathways:

KEGG pathway mapping for metabolic and signaling context
Gene Ontology enrichment (biological process, molecular function, cellular component)
Reactome pathway analysis for detailed mechanistic understanding
Network-based enrichment to identify hub genes and regulatory modules

Step 5: Integrated Report Generation

Compile findings into a structured report with:

Gene summary card with key identifiers and annotations
Sequence conservation highlights and domain maps
Expression analysis results with statistical summaries
Enriched pathways ranked by significance
Key findings synthesis connecting sequence, expression, and pathway data
Publication-ready figures and supplementary tables

Integration Points

ncbi-entrez -- Gene records, variant data, orthologs, literature links
biopython-bio -- Sequence alignment, motif search, domain analysis, format conversion
scanpy-singlecell -- Expression quantification, clustering, differential expression, visualization

Output Formats

Gene card: Symbol, aliases, genomic location, function summary, disease associations
Alignment view: Conserved regions highlighted across orthologs
Expression summary: DE gene lists with fold change, p-values, FDR
Pathway table: Enriched pathways with gene counts, p-values, leading-edge genes
Figures: Heatmaps, volcano plots, UMAP embeddings, pathway diagrams

Best Practices

Start with gene identifiers from a reliable source (NCBI Gene ID or HGNC symbol)
Verify gene nomenclature across databases to avoid confusion from aliases
Use appropriate normalization for the expression data type (TPM, CPM, SCTransform)
Apply multiple testing correction (Benjamini-Hochberg) for all enrichment analyses
Set biologically meaningful fold-change thresholds alongside statistical cutoffs
Include both up- and down-regulated gene sets in pathway analysis
Cross-reference pathway results with known biology to filter spurious enrichments
Report effect sizes and confidence intervals, not just p-values
Note species differences when translating findings from model organisms
Archive intermediate results for reproducibility and downstream re-analysis