تشغيل أي مهارة في Manus بنقرة واحدة

ابدأ الآن

datamol

Datamol for molecular manipulation, SMILES processing, and cheminformatics

تشغيل في Manus

النجوم١٩

التفرعات٢

آخر تحديث٢٠ فبراير ٢٠٢٦ في ٠٠:٤٤

المصدر

omar-A-hassan

omar-A-hassan/medsci-agent

فتح مستودع GitHub عرض مستودعات المنشئ

أمر التثبيت

تنزيل

تشغيل في Manus

مفيد لـSOC

علماء البياناتمهن الحاسوب والرياضيات15-2051L4

SKILL.md

readonly

name	datamol
description	Datamol for molecular manipulation, SMILES processing, and cheminformatics

Datamol

Overview

Datamol is a lightweight Python library built on top of RDKit that simplifies molecular manipulation. It provides a clean API for SMILES parsing, standardization, fingerprints, scaffolds, and visualization.

Core Operations

import datamol as dm

# Parse and standardize SMILES
mol = dm.to_mol("CC(=O)Oc1ccccc1C(=O)O")
std_mol = dm.standardize_mol(mol)
smiles = dm.to_smiles(std_mol, canonical=True)

# Fix and sanitize
mol = dm.to_mol("bad_smiles", ordered=True)  # returns None if invalid
fixed = dm.fix_mol(mol)
sanitized = dm.sanitize_mol(fixed)

Descriptors and Fingerprints

# Molecular properties
dm.descriptors.mw(mol)       # molecular weight
dm.descriptors.logp(mol)     # cLogP
dm.descriptors.tpsa(mol)     # topological polar surface area
dm.descriptors.n_hba(mol)    # H-bond acceptors
dm.descriptors.n_hbd(mol)    # H-bond donors

# Fingerprints
fp = dm.to_fp(mol, fp_type="ecfp", n_bits=2048)  # numpy array

Key Details

Scaffolds: dm.to_scaffold_murcko(mol), dm.fragment.brics(mol).
All functions gracefully handle None inputs (return None).
dm.to_smiles returns canonical SMILES by default.
Batch: dm.to_mol(["CCO", "c1ccccc1"]) accepts lists.
Clustering: dm.cluster.cluster_mols(mols, cutoff=0.7).
Install: pip install datamol.

المزيد من هذا المستودع

نفس المستودع

operational-guardrails

omar-A-hassan/medsci-agent

Shared operational contract for all MedSci agents: sequential execution, planning phase, retry limits, evidence standards.

2026-03-0819

sandbox-execution

omar-A-hassan/medsci-agent

Isolated exploratory code execution with medsci-sandbox tools. Use when analysis requires custom code beyond existing domain MCP tools.

2026-03-0219

alphafold

omar-A-hassan/medsci-agent

AlphaFold DB for predicted protein structures and pLDDT confidence scores

2026-02-2019

biopython

omar-A-hassan/medsci-agent

Molecular biology toolkit. Use for FASTA parsing, sequence analysis, and translation.

2026-02-2019

chembl

omar-A-hassan/medsci-agent

ChEMBL database access for bioactivity data and target search

2026-02-2019

deepchem

omar-A-hassan/medsci-agent

Molecular ML with DeepChem - featurizers, models, and molecular property prediction

2026-02-2019

name	datamol
description	Datamol for molecular manipulation, SMILES processing, and cheminformatics

Datamol

Overview

Core Operations

import datamol as dm

# Parse and standardize SMILES
mol = dm.to_mol("CC(=O)Oc1ccccc1C(=O)O")
std_mol = dm.standardize_mol(mol)
smiles = dm.to_smiles(std_mol, canonical=True)

# Fix and sanitize
mol = dm.to_mol("bad_smiles", ordered=True)  # returns None if invalid
fixed = dm.fix_mol(mol)
sanitized = dm.sanitize_mol(fixed)

Descriptors and Fingerprints

# Molecular properties
dm.descriptors.mw(mol)       # molecular weight
dm.descriptors.logp(mol)     # cLogP
dm.descriptors.tpsa(mol)     # topological polar surface area
dm.descriptors.n_hba(mol)    # H-bond acceptors
dm.descriptors.n_hbd(mol)    # H-bond donors

# Fingerprints
fp = dm.to_fp(mol, fp_type="ecfp", n_bits=2048)  # numpy array

Key Details

Scaffolds: dm.to_scaffold_murcko(mol), dm.fragment.brics(mol).
All functions gracefully handle None inputs (return None).
dm.to_smiles returns canonical SMILES by default.
Batch: dm.to_mol(["CCO", "c1ccccc1"]) accepts lists.
Clustering: dm.cluster.cluster_mols(mols, cutoff=0.7).
Install: pip install datamol.