Run any Skill in Manus with one click

text-based-molecule-editing

Modify molecules based on natural language descriptions using MolT5/BioT5 models. Use this skill when: (1) User wants to modify a molecule to improve specific properties (solubility, potency, etc.), (2) User provides a molecule and asks to "make it more X" or "improve Y", (3) User wants to generate molecule variants guided by text descriptions. Triggers on phrases like "modify this molecule", "edit the molecule", "make it more soluble", "improve drug-likeness", "change the molecule to", "optimize this compound".

Run Skill in Manus

Overview

Install command

npx skills add https://github.com/PharMolix/OpenBioMed --skill text-based-molecule-editing

Copy and paste this command into Claude Code to install the skill

Source

PharMolix/OpenBioMed

Stars1,071

Forks129

UpdatedMarch 19, 2026 at 13:31

File Explorer

4 files

SKILL.md

readonly

name	text-based-molecule-editing
description	Modify molecules based on natural language descriptions using MolT5/BioT5 models. Use this skill when: (1) User wants to modify a molecule to improve specific properties (solubility, potency, etc.), (2) User provides a molecule and asks to "make it more X" or "improve Y", (3) User wants to generate molecule variants guided by text descriptions. Triggers on phrases like "modify this molecule", "edit the molecule", "make it more soluble", "improve drug-likeness", "change the molecule to", "optimize this compound".
license	MIT
category	drug-discovery
tags	["molecule-editing","text-guided","molecular-optimization","de-novo-design"]

Text-Based Molecule Editing

Modify molecular structures guided by natural language property descriptions.

When to Use

User wants to optimize a molecule for specific properties (solubility, binding, drug-likeness)
User provides a molecule and requests property-based modifications
User wants to explore structural variants guided by text descriptions

Workflow

Step 1: Prepare Input Molecule

from open_biomed.data import Molecule
from open_biomed.tools.tool_registry import TOOLS

# Option A: From molecule name (queries PubChem)
tool = TOOLS["molecule_name_request"]
result, _ = tool.run(accession="aspirin")
molecule = result[0]

# Option B: From SMILES directly
molecule = Molecule.from_smiles("CC(=O)Oc1ccccc1C(=O)O")

Step 2: Calculate Baseline Properties (Optional)

qed_tool = TOOLS["molecule_qed"]
logp_tool = TOOLS["molecule_logp"]
sa_tool = TOOLS["molecule_sa"]

qed, _ = qed_tool.run(molecule=molecule)
logp, _ = logp_tool.run(molecule=molecule)
sa, _ = sa_tool.run(molecule=molecule)

Step 3: Run Text-Based Editing

from open_biomed.core.pipeline import InferencePipeline
from open_biomed.data import Text

pipeline = InferencePipeline(
    task="text_based_molecule_editing",
    model="molt5",
    model_ckpt="./checkpoints/server/text_based_molecule_editing_biot5.ckpt",
    device="cuda:0"
)

outputs = pipeline.run(
    molecule=molecule,
    text=Text.from_str("This molecule should be more soluble in water"),
)
edited_molecule = outputs[0][0]

Step 4: Compare Properties

qed_new, _ = qed_tool.run(molecule=edited_molecule)
logp_new, _ = logp_tool.run(molecule=edited_molecule)

print(f"Original SMILES: {molecule.smiles}")
print(f"Edited SMILES: {edited_molecule.smiles}")
print(f"LogP change: {logp[0]:.2f} → {logp_new[0]:.2f}")

Expected Outputs

Step	Output	Description
Step 1	`Molecule` object	Input molecule with SMILES
Step 2	`float` values	QED (0-1), LogP, SA scores
Step 3	`Molecule` object	Edited molecule with new structure
Step 4	Comparison	Before/after property summary

Interpretation Guide

LogP (Lipophilicity)

Value	Solubility	Interpretation
< 0	High water solubility	Very hydrophilic
0-2	Moderate	Good balance for oral drugs
2-5	Low water solubility	May need formulation help
> 5	Very lipophilic	Poor absorption likely

QED (Quantitative Estimate of Drug-likeness)

Value	Quality	Interpretation
> 0.7	Excellent	Highly drug-like
0.5-0.7	Good	Acceptable drug-likeness
0.3-0.5	Moderate	May need optimization
< 0.3	Poor	Significant liabilities

SA (Synthetic Accessibility)

Value	Difficulty	Interpretation
1-3	Easy	Straightforward synthesis
3-5	Moderate	Some challenges
5-7	Difficult	Complex synthesis needed
> 7	Very difficult	Likely impractical

Error Handling

Model Checkpoint Not Found

Symptom: FileNotFoundError for checkpoint file

Solution: Ensure checkpoint exists at ./checkpoints/server/text_based_molecule_editing_biot5.ckpt

import os
ckpt_path = "./checkpoints/server/text_based_molecule_editing_biot5.ckpt"
if not os.path.exists(ckpt_path):
    raise FileNotFoundError(f"Download checkpoint to: {ckpt_path}")

Invalid SMILES Output

Symptom: Model generates invalid SMILES string

Solution: The model returns None for invalid molecules. Try:

Rephrasing the edit prompt
Using beam search with more beams
Running multiple times for different outputs

CUDA Out of Memory

Symptom: RuntimeError: CUDA out of memory

Solution: Use CPU or smaller batch:

pipeline = InferencePipeline(
    task="text_based_molecule_editing",
    model="molt5",
    model_ckpt="./checkpoints/server/text_based_molecule_editing_biot5.ckpt",
    device="cpu"  # Fallback to CPU
)

Example

Input: aspirin
Prompt: "This molecule should be more soluble in water"

Original SMILES: CC(=O)Oc1ccccc1C(=O)O
Edited SMILES:   CC(=O)Oc1ccc(C(=O)O)cc1C(=O)O

Property Changes:
  LogP: 1.31 → 1.01 (-0.30, more soluble)
  QED:  0.55 → 0.59 (+0.04, better drug-likeness)
  SA:   1.58 → 1.81 (+0.23, slightly harder to synthesize)

Generate diverse druggable molecules for a given target or disease using OpenBioMed's AI-powered drug discovery tools. Use this skill when: (1) Generating drug candidates, molecules, or compounds for a target/disease, (2) Performing structure-based drug design or de novo drug design, (3) Finding or creating molecules that bind to a specific protein target, (4) Discovering potential drugs for a disease name, (5) Designing molecules with specific properties (LogP, QED, docking scores). The skill handles target identification, structure retrieval, molecule generation, and in silico evaluation.

2026-03-201.1k

drug-drug-interaction-analysis

PharMolix/OpenBioMed

Analyze potential drug-drug interactions (DDI) for up to 5 drugs using KEGG DDI database. Use this skill when: (1) Checking interactions between multiple medications, (2) Assessing DDI risk for drug combinations, (3) Understanding interaction mechanisms and severity, (4) Analyzing CYP enzyme involvement in DDIs.

2026-03-191.1k

target-drug-report

PharMolix/OpenBioMed

Generate comprehensive drug development progress reports for disease therapeutic targets. Use when user asks about target drug pipeline, clinical trials, or research progress. Triggers on phrases like "target report", "drug development progress", "clinical trial summary", "靶点报告", "药物研发进展", "竞品分析", "专利分析".

2026-03-191.1k

biomed-skill-creator

PharMolix/OpenBioMed

Create new biomedical skills or improve existing ones for the OpenBioMed toolkit. Use this skill when: (1) Creating a new skill from scratch, (2) Capturing a workflow as a reusable skill, (3) Automating a biomedical task, (4) Improving an existing skill. This skill guides through an interactive process: define intent → design workflow → validate with real data → iterate → evaluate.

2026-03-191.1k

chembl-query

PharMolix/OpenBioMed

Query ChEMBL database for bioactivity data on drug-like compounds. Use this skill when: (1) Finding compounds active against a protein target (target-based search), (2) Getting bioactivity profile for a molecule (molecule-based search), (3) Finding drugs for a disease indication (indication-based search).

2026-03-191.1k

Source

PharMolix

PharMolix/OpenBioMed

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Mathematical Science Occupations, All OtherComputer and Mathematical Occupations15-2099L4

name	text-based-molecule-editing
description	Modify molecules based on natural language descriptions using MolT5/BioT5 models. Use this skill when: (1) User wants to modify a molecule to improve specific properties (solubility, potency, etc.), (2) User provides a molecule and asks to "make it more X" or "improve Y", (3) User wants to generate molecule variants guided by text descriptions. Triggers on phrases like "modify this molecule", "edit the molecule", "make it more soluble", "improve drug-likeness", "change the molecule to", "optimize this compound".
license	MIT
category	drug-discovery
tags	["molecule-editing","text-guided","molecular-optimization","de-novo-design"]

Text-Based Molecule Editing

Modify molecular structures guided by natural language property descriptions.

When to Use

User wants to optimize a molecule for specific properties (solubility, binding, drug-likeness)
User provides a molecule and requests property-based modifications
User wants to explore structural variants guided by text descriptions

Workflow

Step 1: Prepare Input Molecule

from open_biomed.data import Molecule
from open_biomed.tools.tool_registry import TOOLS

# Option A: From molecule name (queries PubChem)
tool = TOOLS["molecule_name_request"]
result, _ = tool.run(accession="aspirin")
molecule = result[0]

# Option B: From SMILES directly
molecule = Molecule.from_smiles("CC(=O)Oc1ccccc1C(=O)O")

Step 2: Calculate Baseline Properties (Optional)

qed_tool = TOOLS["molecule_qed"]
logp_tool = TOOLS["molecule_logp"]
sa_tool = TOOLS["molecule_sa"]

qed, _ = qed_tool.run(molecule=molecule)
logp, _ = logp_tool.run(molecule=molecule)
sa, _ = sa_tool.run(molecule=molecule)

Step 3: Run Text-Based Editing

from open_biomed.core.pipeline import InferencePipeline
from open_biomed.data import Text

pipeline = InferencePipeline(
    task="text_based_molecule_editing",
    model="molt5",
    model_ckpt="./checkpoints/server/text_based_molecule_editing_biot5.ckpt",
    device="cuda:0"
)

outputs = pipeline.run(
    molecule=molecule,
    text=Text.from_str("This molecule should be more soluble in water"),
)
edited_molecule = outputs[0][0]

Step 4: Compare Properties

qed_new, _ = qed_tool.run(molecule=edited_molecule)
logp_new, _ = logp_tool.run(molecule=edited_molecule)

print(f"Original SMILES: {molecule.smiles}")
print(f"Edited SMILES: {edited_molecule.smiles}")
print(f"LogP change: {logp[0]:.2f} → {logp_new[0]:.2f}")

Expected Outputs

Step	Output	Description
Step 1	`Molecule` object	Input molecule with SMILES
Step 2	`float` values	QED (0-1), LogP, SA scores
Step 3	`Molecule` object	Edited molecule with new structure
Step 4	Comparison	Before/after property summary

Interpretation Guide

LogP (Lipophilicity)

Value	Solubility	Interpretation
< 0	High water solubility	Very hydrophilic
0-2	Moderate	Good balance for oral drugs
2-5	Low water solubility	May need formulation help
> 5	Very lipophilic	Poor absorption likely

QED (Quantitative Estimate of Drug-likeness)

Value	Quality	Interpretation
> 0.7	Excellent	Highly drug-like
0.5-0.7	Good	Acceptable drug-likeness
0.3-0.5	Moderate	May need optimization
< 0.3	Poor	Significant liabilities

SA (Synthetic Accessibility)

Value	Difficulty	Interpretation
1-3	Easy	Straightforward synthesis
3-5	Moderate	Some challenges
5-7	Difficult	Complex synthesis needed
> 7	Very difficult	Likely impractical

Error Handling

Model Checkpoint Not Found

Symptom: FileNotFoundError for checkpoint file

Solution: Ensure checkpoint exists at ./checkpoints/server/text_based_molecule_editing_biot5.ckpt

import os
ckpt_path = "./checkpoints/server/text_based_molecule_editing_biot5.ckpt"
if not os.path.exists(ckpt_path):
    raise FileNotFoundError(f"Download checkpoint to: {ckpt_path}")

Invalid SMILES Output

Symptom: Model generates invalid SMILES string

Solution: The model returns None for invalid molecules. Try:

Rephrasing the edit prompt
Using beam search with more beams
Running multiple times for different outputs

CUDA Out of Memory

Symptom: RuntimeError: CUDA out of memory

Solution: Use CPU or smaller batch:

pipeline = InferencePipeline(
    task="text_based_molecule_editing",
    model="molt5",
    model_ckpt="./checkpoints/server/text_based_molecule_editing_biot5.ckpt",
    device="cpu"  # Fallback to CPU
)

Example

Input: aspirin
Prompt: "This molecule should be more soluble in water"

Original SMILES: CC(=O)Oc1ccccc1C(=O)O
Edited SMILES:   CC(=O)Oc1ccc(C(=O)O)cc1C(=O)O

Property Changes:
  LogP: 1.31 → 1.01 (-0.30, more soluble)
  QED:  0.55 → 0.59 (+0.04, better drug-likeness)
  SA:   1.58 → 1.81 (+0.23, slightly harder to synthesize)

text-based-molecule-editing

Text-Based Molecule Editing

When to Use

Workflow

Step 1: Prepare Input Molecule

Step 2: Calculate Baseline Properties (Optional)

Step 3: Run Text-Based Editing

Step 4: Compare Properties

Expected Outputs

Interpretation Guide

LogP (Lipophilicity)

QED (Quantitative Estimate of Drug-likeness)

SA (Synthetic Accessibility)

Error Handling

Model Checkpoint Not Found

Invalid SMILES Output

CUDA Out of Memory

Example

See Also

Text-Based Molecule Editing

When to Use

Workflow

Step 1: Prepare Input Molecule

Step 2: Calculate Baseline Properties (Optional)

Step 3: Run Text-Based Editing

Step 4: Compare Properties

Expected Outputs

Interpretation Guide

LogP (Lipophilicity)

QED (Quantitative Estimate of Drug-likeness)

SA (Synthetic Accessibility)

Error Handling

Model Checkpoint Not Found

Invalid SMILES Output

CUDA Out of Memory

Example

See Also

text-based-molecule-editing

Text-Based Molecule Editing

When to Use

Workflow

Step 1: Prepare Input Molecule

Step 2: Calculate Baseline Properties (Optional)

Step 3: Run Text-Based Editing

Step 4: Compare Properties

Expected Outputs

Interpretation Guide

LogP (Lipophilicity)

QED (Quantitative Estimate of Drug-likeness)

SA (Synthetic Accessibility)

Error Handling

Model Checkpoint Not Found

Invalid SMILES Output

CUDA Out of Memory

Example

See Also

More from this repository

More from this repository

Text-Based Molecule Editing

When to Use

Workflow

Step 1: Prepare Input Molecule

Step 2: Calculate Baseline Properties (Optional)

Step 3: Run Text-Based Editing

Step 4: Compare Properties

Expected Outputs

Interpretation Guide

LogP (Lipophilicity)

QED (Quantitative Estimate of Drug-likeness)

SA (Synthetic Accessibility)

Error Handling

Model Checkpoint Not Found

Invalid SMILES Output

CUDA Out of Memory

Example

See Also