تشغيل أي مهارة في Manus بنقرة واحدة

$pwd:

bulk-fastq-quantification

Name: Bulk Fastq Quantification
Author: omicverse

// End-to-end bulk RNA-seq quantification with omicverse's alignment module — SRA download, fastp QC, two interchangeable quantification paths (STAR + featureCount, OR alignment-free kb-python with technology='BULK'), and wiring into `ov.bulk.pyDEG` DESeq2. Single-cell kb-python (10XV2/10XV3) is out of scope — use the `single-cell-kb-alignment` skill instead.

تشغيل في Manus

$ git log --oneline --stat

stars:٤

forks:١

updated:١١ مايو ٢٠٢٦ في ١٠:٥٥

مستكشف الملفات

2 ملفات

SKILL.md

readonly

related-skills.json

نفس المستودع

omicverse-bulk-metabol-untargeted-lipidomics.md

from "omicverse/omicverse-skills"

Two adjacent LC-MS workflows on AnnData — (1) untargeted metabolomics with m/z-based peak annotation, mummichog pathway inference and adduct-ppm matching, and (2) lipidomics with LIPID MAPS shorthand parsing, lipid-class aggregation, and LION term enrichment. Use when converting `t_metabol_04_untargeted` or `t_metabol_05_lipidomics` into a reusable skill, when the input feature IDs encode `m/z`/`RT`, or when the var_names look like `PC 34:1` / `Cer d18:1/24:0` / `TAG 54:3`.

2026-05-164

bulk-rna-seq-differential-expression-with-omicverse.md

from "omicverse/omicverse-skills"

Bulk RNA-seq DEG pipeline: gene ID mapping, DESeq2 normalization, statistical testing, volcano plots, and pathway enrichment in OmicVerse.

2026-05-114

omicverse-single-cell-cellrank-fate.md

from "omicverse/omicverse-skills"

CellRank fate maps from RNA velocity. Combine VelocityKernel + ConnectivityKernel into a transition matrix, fit a GPCCA estimator, predict terminal states, and produce per-cell fate probabilities. Visualise with `ov.pl.branch_streamplot` and feed branch-resolved gene-trends into `ov.single.dynamic_features` / `ov.pl.dynamic_trends` / `ov.pl.dynamic_heatmap`. Use after RNA velocity is computed (scvelo / dynamo / latentvelo / graphvelo) and before reporting fate probabilities or marker dynamics.

2026-05-114

omicverse-single-cell-cnmf-program-discovery.md

from "omicverse/omicverse-skills"

Run OmicVerse single-cell NMF program discovery as a reusable, triggerable skill — both the classical Python `ov.single.cNMF` (consensus NMF with CPU/GPU factorization, K-selection, RFC labelling) and the Rust-backed `ov.single.NMF` (fast `nmf-rs` backend: dnmf default, Brunet-style K-selection with stability-drop auto-K, cNMF-style consensus heatmap, RFC labels). Use when fitting consensus NMF gene programs on single-cell AnnData, choosing K, building consensus, or converting normalized usage programs into hard cluster labels.

2026-05-114

omicverse-single-cell-monocle2-trajectory.md

from "omicverse/omicverse-skills"

Monocle2-style single-cell trajectory analysis on AnnData via the `ov.single.Monocle` class - DDRTree pseudotime + branch detection + per-gene differential test + BEAM branch-dependent gene discovery, plus the unified `ov.pl.trajectory` / `ov.pl.trajectory_overlay` / `ov.pl.trajectory_tree` plotters and the shared pseudotime visualisations (`branch_streamplot`, `dynamic_heatmap`, `dynamic_trends`). Use when fitting a Monocle2 trajectory on an annotated AnnData, when deriving branch-aware gene trends with `dynamic_features`, or when reproducing `t_traj_monocle2`.

2026-05-114

omicverse-single-cell-sctour-trajectory.md

from "omicverse/omicverse-skills"

Run the OmicVerse sctour trajectory branch on raw-count single-cell AnnData. Use when adapting the scTour part of an OmicVerse trajectory notebook, or when you need sctour pseudotime, latent space, or vector-field outputs instead of the diffusion_map, slingshot, or palantir branches.

2026-05-114

package.json

"author": "omicverse"

"repository": "omicverse/omicverse-skills"

فتح مستودع GitHub عرض مستودعات المنشئ

$ install --global

$ download --local

تشغيل في Manus

$ useful --forSOC

علماء البياناتمهن الحاسوب والرياضيات15-2051L4

import omicverse as ov # Variant 1 — SRR accession via prefetch + fqdump pre = ov.alignment.prefetch(['SRR1234567', 'SRR1234568'], output_dir='prefetch', jobs=4) fq = ov.alignment.fqdump(['SRR1234567', 'SRR1234568'], output_dir='fastq', sra_dir='prefetch', gzip=True, threads=8, jobs=4) # Variant 2 — direct .lite.1 link + parallel_fastq_dump (paired-end) ov.datasets.download_data( 'https://sra-downloadb.be-md.ncbi.nlm.nih.gov/.../SRR12544419.lite.1', dir='./data', ) ov.alignment.parallel_fastq_dump( sra_id='./data/SRR12544419.lite.1', outdir='./data/SRR12544419', tmpdir='./tmp', threads=12, split_files=True, gzip=True, )

samples = [ ('S1', 'fastq/SRR1234567/SRR1234567_1.fastq.gz', 'fastq/SRR1234567/SRR1234567_2.fastq.gz'), ('S2', 'fastq/SRR1234568/SRR1234568_1.fastq.gz', 'fastq/SRR1234568/SRR1234568_2.fastq.gz'), ] clean = ov.alignment.fastp(samples, output_dir='fastp', threads=8, jobs=2)

star_samples = [ ('S1', 'fastp/S1/S1_clean_1.fastq.gz', 'fastp/S1/S1_clean_2.fastq.gz'), ('S2', 'fastp/S2/S2_clean_1.fastq.gz', 'fastp/S2/S2_clean_2.fastq.gz'), ] bams = ov.alignment.STAR( star_samples, genome_dir='star_index', output_dir='star_out', gtf='genes.gtf', genome_fasta_files=['genome.fa'], threads=8, memory='50G', )

bam_items = [ ('S1', 'star_out/S1/Aligned.sortedByCoord.out.bam'), ('S2', 'star_out/S2/Aligned.sortedByCoord.out.bam'), ] counts = ov.alignment.featureCount( bam_items, gtf='genes.gtf', output_dir='counts', gene_mapping=True, merge_matrix=True, threads=8, ) # counts is a pandas DataFrame (gene_id × samples) — feed directly to pyDEG

ref_result = ov.alignment.single.ref( fasta_paths='genomes/Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz', gtf_paths='genomes/Homo_sapiens.GRCh38.108.gtf.gz', index_path='kb_ref/index.idx', t2g_path='kb_ref/t2g.txt', cdna_path='kb_ref/cdna.fa', temp_dir='tmp', overwrite=False, )

for sra in ['SRR12544419', 'SRR12544421', 'SRR12544433', 'SRR12544435']: ov.alignment.count( fastq_paths=[ f'./data/{sra}/{sra}.lite.1_1.fastq.gz', f'./data/{sra}/{sra}.lite.1_2.fastq.gz', ], index_path='kb_ref/index.idx', t2g_path='kb_ref/t2g.txt', technology='BULK', output_path=f'results/{sra}/', h5ad=True, filter_barcodes=False, parity='paired', strand='unstranded', threads=12, )

ad_dict = {} for sra in ['SRR12544419', 'SRR12544421', 'SRR12544433', 'SRR12544435']: ad = ov.read(f'./results/{sra}/counts_unfiltered/adata.h5ad') gene_name = ov.pd.read_csv( f'./results/{sra}/counts_unfiltered/cells_x_genes.genes.names.txt', header=None, ) ad.var['gene_name'] = gene_name[0].tolist() ad.var['gene_id'] = ad.var.index ad.var.index = ad.var['gene_name'] ad.var_names_make_unique() ad.obs['sra'] = sra ad_dict[sra] = ad adata = ov.concat(ad_dict) adata.obs_names_make_unique() adata.obs['Group'] = ['no', 'no', 'yes', 'yes'] # phenotype labels counts = adata.to_df().T # gene × sample dds = ov.bulk.pyDEG(counts)

dds.drop_duplicates_index() result = dds.deg_analysis( treatment_groups=[...], # sample IDs in the treatment arm (matrix columns) control_groups=[...], # sample IDs in the control arm method='DEseq2', ) # Optional: filter low-expression genes after DE result = result.loc[result['log2(BaseMean)'] > 1] dds.foldchange_set(fc_threshold=-1, pval_threshold=0.05, logp_max=10) dds.plot_volcano(title='DEG Analysis', figsize=(4, 4), plot_genes_num=8, plot_genes_fontsize=12)

# All functions support these parameters: auto_install=True # Auto-install missing tools via conda/mamba overwrite=False # Skip if outputs already exist threads=8 # Per-tool thread count jobs=None # Concurrent job count (auto-detected from CPU count)

bulk-fastq-quantification

Overview

Instructions

Path A — STAR + featureCounts (alignment-based)

Path B — kb-python `technology='BULK'` (alignment-free)

Handoff to differential expression

Critical API Reference

Sample format convention (Path A)

Auto-installation

Examples

Related skills

References

Overview

Instructions

Path A — STAR + featureCounts (alignment-based)

Path B — kb-python `technology='BULK'` (alignment-free)

Handoff to differential expression

Critical API Reference

Sample format convention (Path A)

Auto-installation

Examples

Related skills

References

name	bulk-fastq-quantification
title	Bulk RNA-seq FASTQ → count matrix with omicverse
description	End-to-end bulk RNA-seq quantification with omicverse's alignment module — SRA download, fastp QC, two interchangeable quantification paths (STAR + featureCount, OR alignment-free kb-python with technology='BULK'), and wiring into `ov.bulk.pyDEG` DESeq2. Single-cell kb-python (10XV2/10XV3) is out of scope — use the `single-cell-kb-alignment` skill instead.

bulk-fastq-quantification

المزيد من هذا المستودع

المزيد من هذا المستودع

Overview

Instructions

Path A — STAR + featureCounts (alignment-based)

Path B — kb-python technology='BULK' (alignment-free)

Handoff to differential expression

Critical API Reference

Sample format convention (Path A)

Auto-installation

Examples

Related skills

References

Overview

Instructions

Path A — STAR + featureCounts (alignment-based)

Path B — kb-python technology='BULK' (alignment-free)

Handoff to differential expression

Critical API Reference

Sample format convention (Path A)

Auto-installation

Examples

Related skills

References

Path B — kb-python `technology='BULK'` (alignment-free)

Path B — kb-python `technology='BULK'` (alignment-free)