一键导入
comparative-genomics-agent
Compare a gene across multiple species — find orthologs, retrieve sequences, compute alignments, and summarize conservation
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
菜单
Compare a gene across multiple species — find orthologs, retrieve sequences, compute alignments, and summarize conservation
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
基于 SOC 职业分类
Full clinical variant workup — gnomAD population frequency, ClinVar significance, protein domain impact, AlphaFold structure context, and PubMed literature
Druggability assessment — protein function, 3D structures, ligand-bound PDBs, binding sites, interaction network, disease associations, and literature
Functional analysis of a gene list — batch summaries, pathway mapping, protein interactions, tissue expression, and phenotype associations
Generate a comprehensive gene report by combining data from NCBI, Ensembl, UniProt, ClinVar, PDB, InterPro, STRING, and KEGG
Research session lab notebook — start, annotate, update, report, or check status
Search PubMed for relevant biomedical literature on a gene, variant, disease, or topic and produce a structured literature summary
| name | comparative-genomics-agent |
| description | Compare a gene across multiple species — find orthologs, retrieve sequences, compute alignments, and summarize conservation |
Perform a multi-species comparative genomics analysis for: $ARGUMENTS
Use the MCP tools available to you to find orthologs, retrieve sequences, align them, and report on conservation. Follow the steps below in order. If a step fails for a particular species, note the gap and continue with the remaining species.
Parse the user input to identify:
TP53), Ensembl gene ID (e.g., ENSG00000141510), or NCBI Gene IDIf the user says "across vertebrates" or similar, use: human, mouse, zebrafish, chicken (4 representative species). If no species are specified, default to: human, mouse, zebrafish.
ensembl_lookup_gene with the symbol and species homo_sapiens (or the first species listed) to get the Ensembl gene ID.datasets_summary_gene with the gene symbol (taxon: human) for NCBI gene metadata (full name, summary).ensembl_get_homologs with the reference Ensembl gene ID, homology_type: "orthologues".For the reference gene and each ortholog found:
ensembl_get_sequence with the Ensembl gene ID, seq_type: "protein", format: "json".If a protein sequence is not available for a gene ID, try the canonical transcript ID instead.
For each ortholog protein sequence, align it against the reference (human) protein:
sequence_align with the two protein sequences, sequence_type: "protein", mode: "global".If there are 3+ species, also consider one key pairwise comparison between distant species (e.g., mouse vs zebrafish) to show the range of divergence.
For the reference protein:
sequence_stats with the protein sequence to get molecular weight, amino acid composition.interpro_get_domains with the UniProt accession (if known) or look up via uniprot_search for the reference gene.Present the analysis as a structured comparative genomics report:
# Comparative Genomics Report: [GENE SYMBOL]
## Gene Overview
- Full name, function summary (from NCBI/Ensembl)
- Reference species and Ensembl gene ID
- Number of species analyzed
## Ortholog Summary
| Species | Ensembl Gene ID | Protein Length | % Identity to [Reference] | % Positives |
|---------|-----------------|----------------|---------------------------|-------------|
| Human (reference) | ENSG... | 393 aa | — | — |
| Mouse | ENSMUSG... | 390 aa | 77.8% | 85.2% |
| Zebrafish | ENSDARG... | 373 aa | 52.1% | 66.3% |
## Pairwise Alignments
For each species pair aligned:
- **[Reference] vs [Species]**: X% identity, Y gaps, alignment length Z
- Key observations: conserved regions, notable insertions/deletions
## Conservation Analysis
Summarize the overall conservation pattern:
- Which regions are most conserved (relate to known domains if domain data was retrieved)
- Which regions show the most divergence
- Overall trend: is this gene highly conserved, moderately conserved, or rapidly evolving?
- Note any species-specific insertions or deletions
## Functional Domain Context
If domain data was retrieved:
- List key domains with positions
- Note whether these domains span the conserved regions
## Evolutionary Insights
Brief interpretation:
- What does the conservation pattern suggest about functional constraints?
- Are there species-specific adaptations visible in the sequence differences?
- How does the conservation level compare to expectations for this gene family?
## Data Sources
List which databases were queried and whether each returned data successfully for each species.
Keep the report factual — only include data returned by the tools. Do not hallucinate sequences, identity scores, or ortholog relationships. If alignment data is unavailable for a species, note "Alignment not performed — sequence unavailable."