with one click
ToolUniverse workflow — Chemical Safety
npx skills add https://github.com/lamm-mit/scienceclaw --skill chemical-safetyCopy and paste this command into Claude Code to install the skill
ToolUniverse workflow — Chemical Safety
npx skills add https://github.com/lamm-mit/scienceclaw --skill chemical-safetyCopy and paste this command into Claude Code to install the skill
Onboard and manage Paperclip AI for research-paper knowledge and agent orchestration
Generate a structured scientific post and publish it to Infinite. Runs a focused single-agent investigation (PubMed search → LLM analysis → hypothesis/method/findings/conclusion) and posts the result. Faster than scienceclaw-investigate — best for targeted, single-topic posts.
Infinite platform integration for AI agent collaboration
Read a CSV or XLSX file and return columns, shape, dtypes, and first N rows as JSON.
Execute arbitrary Python code and return stdout. NumPy, pandas, scipy, matplotlib, and other scientific libraries are available.
Generate a structured scientific PDF report from a JSON description. Accepts a JSON file specifying title, authors, abstract, sections (headings, text, tables, figures), and inline data panels (heatmap, bar, scatter, line). Produces a publication-style A4 PDF using reportlab with no LaTeX dependency. All figures are either loaded from PNG paths or generated on-the-fly from inline data.
| name | chemical-safety |
| description | ToolUniverse workflow — Chemical Safety |
| source | https://github.com/mims-harvard/ToolUniverse/tree/main/skills/tooluniverse-chemical-safety |
| metadata | null |
Comprehensive chemical safety and toxicology analysis integrating predictive AI models, curated toxicogenomics databases, regulatory safety data, and chemical-biological interaction networks. Generates structured risk assessment reports with evidence grading.
Triggers:
Use Cases:
get_tool_info before calling unfamiliar toolsGrade every toxicity claim by evidence strength:
| Tier | Symbol | Criteria | Examples |
|---|---|---|---|
| T1 | [T1] | Direct human evidence, regulatory finding | FDA boxed warning, clinical trial toxicity, human case reports |
| T2 | [T2] | Animal studies, validated in vitro | Nonclinical toxicology, AMES positive, animal LD50 |
| T3 | [T3] | Computational prediction, association data | ADMET-AI prediction, CTD association, QSAR model |
| T4 | [T4] | Database annotation, text-mined | Literature mention, database entry without validation |
Evidence grades MUST appear in:
Chemical/Drug Query
|
+-- PHASE 0: Compound Disambiguation (ALWAYS FIRST)
| +-- Resolve name -> SMILES, PubChem CID, ChEMBL ID
| +-- Get molecular formula, weight, canonical structure
|
+-- PHASE 1: Predictive Toxicology (ADMET-AI)
| +-- Mutagenicity (AMES)
| +-- Hepatotoxicity (DILI, ClinTox)
| +-- Carcinogenicity
| +-- Acute toxicity (LD50)
| +-- Skin reactions
| +-- Stress response pathways
| +-- Nuclear receptor activity
|
+-- PHASE 2: ADMET Properties
| +-- Absorption: BBB penetrance, bioavailability
| +-- Distribution: clearance, volume of distribution
| +-- Metabolism: CYP interactions (1A2, 2C9, 2C19, 2D6, 3A4)
| +-- Physicochemical: solubility, lipophilicity, pKa
|
+-- PHASE 3: Toxicogenomics (CTD)
| +-- Chemical-gene interactions
| +-- Chemical-disease associations
| +-- Affected biological pathways
|
+-- PHASE 4: Regulatory Safety (FDA Labels)
| +-- Boxed warnings (Black Box)
| +-- Contraindications
| +-- Adverse reactions
| +-- Warnings and precautions
| +-- Nonclinical toxicology
|
+-- PHASE 5: Drug Safety Profile (DrugBank)
| +-- Toxicity data
| +-- Contraindications
| +-- Drug interactions affecting safety
|
+-- PHASE 6: Chemical-Protein Interactions (STITCH)
| +-- Direct chemical-protein binding
| +-- Interaction confidence scores
| +-- Off-target effects
|
+-- PHASE 7: Structural Alerts (ChEMBL)
| +-- Known toxic substructures (PAINS, Brenk)
| +-- Structural alert flags
|
+-- SYNTHESIS: Integrated Risk Assessment
+-- Aggregate all evidence tiers
+-- Risk classification (Low/Medium/High/Critical)
+-- Data gaps and recommendations
CRITICAL: Resolve compound identity before any analysis.
| Input Format | Resolution Strategy |
|---|---|
| Drug name (e.g., "Aspirin") | PubChem_get_CID_by_compound_name -> get SMILES from properties |
| SMILES string | Use directly for ADMET-AI; resolve to CID for other tools |
| PubChem CID | PubChem_get_compound_properties_by_CID -> get SMILES + name |
| ChEMBL ID | ChEMBL_get_molecule -> get SMILES + properties |
PubChem_get_CID_by_compound_name(name=<compound_name>)PubChem_get_compound_properties_by_CID(cid=<cid>)ConnectivitySMILES, CanonicalSMILES, or IsomericSMILES depending on response format)name, smiles, cid, formula, weight, inchi## Compound Identity
| Property | Value |
|----------|-------|
| **Name** | Acetaminophen |
| **PubChem CID** | 1983 |
| **SMILES** | CC(=O)Nc1ccc(O)cc1 |
| **Formula** | C8H9NO2 |
| **Molecular Weight** | 151.16 |
| **InChI** | InChI=1S/C8H9NO2/... |
When: SMILES is available (from Phase 0 or provided directly)
Objective: Run comprehensive AI-predicted toxicity endpoints
All ADMET-AI tools take the same parameter format:
| Tool | Predicted Endpoints | Parameter |
|---|---|---|
ADMETAI_predict_toxicity | AMES, Carcinogens_Lagunin, ClinTox, DILI, LD50_Zhu, Skin_Reaction, hERG | smiles: list[str] |
ADMETAI_predict_stress_response | Stress response pathway activation (ARE, ATAD5, HSE, MMP, p53) | smiles: list[str] |
ADMETAI_predict_nuclear_receptor_activity | AhR, AR, ER, PPARg, Aromatase nuclear receptor activity | smiles: list[str] |
ADMETAI_predict_toxicity(smiles=[resolved_smiles])ADMETAI_predict_stress_response(smiles=[resolved_smiles])ADMETAI_predict_nuclear_receptor_activity(smiles=[resolved_smiles])### Toxicity Predictions [T3]
| Endpoint | Prediction | Interpretation | Concern Level |
|----------|-----------|---------------|---------------|
| AMES Mutagenicity | Inactive | No mutagenic signal | Low |
| Carcinogenicity | Inactive | No carcinogenic signal | Low |
| ClinTox | Active | Clinical toxicity signal | HIGH |
| DILI | Active | Drug-induced liver injury risk | HIGH |
| LD50 (Zhu) | 2.45 log(mg/kg) | ~282 mg/kg (moderate) | Medium |
| Skin Reaction | Inactive | No skin sensitization signal | Low |
| hERG Inhibition | Active | Cardiac arrhythmia risk | HIGH |
*All predictions from ADMET-AI. Evidence tier: [T3] (computational prediction)*
When: SMILES is available
Objective: Full ADMET characterization beyond toxicity
| Tool | Properties Predicted | Parameter |
|---|---|---|
ADMETAI_predict_BBB_penetrance | Blood-brain barrier crossing probability | smiles: list[str] |
ADMETAI_predict_bioavailability | Oral bioavailability (F20%, F30%) | smiles: list[str] |
ADMETAI_predict_clearance_distribution | Clearance, VDss, half-life, PPB | smiles: list[str] |
ADMETAI_predict_CYP_interactions | CYP1A2, 2C9, 2C19, 2D6, 3A4 inhibition/substrate | smiles: list[str] |
ADMETAI_predict_physicochemical_properties | LogP, LogD, LogS, MW, pKa | smiles: list[str] |
ADMETAI_predict_solubility_lipophilicity_hydration | Aqueous solubility, lipophilicity, hydration free energy | smiles: list[str] |
### ADMET Profile [T3]
#### Absorption
| Property | Value | Interpretation |
|----------|-------|----------------|
| BBB Penetrance | Yes | Crosses blood-brain barrier |
| Bioavailability (F20%) | 85% | Good oral absorption |
#### Distribution
| Property | Value | Interpretation |
|----------|-------|----------------|
| VDss | 1.2 L/kg | Moderate tissue distribution |
| PPB | 92% | Highly protein bound |
#### Metabolism
| CYP Enzyme | Substrate | Inhibitor |
|------------|-----------|-----------|
| CYP1A2 | No | No |
| CYP2C9 | Yes | No |
| CYP2C19 | No | No |
| CYP2D6 | No | No |
| CYP3A4 | Yes | Yes (DDI risk) |
#### Excretion
| Property | Value | Interpretation |
|----------|-------|----------------|
| Clearance | 8.5 mL/min/kg | Moderate clearance |
| Half-life | 6.2 h | Moderate half-life |
When: Compound name is resolved
Objective: Map chemical-gene-disease relationships from curated CTD data
| Tool | Function | Parameter |
|---|---|---|
CTD_get_chemical_gene_interactions | Genes affected by chemical | input_terms: str (chemical name) |
CTD_get_chemical_diseases | Diseases linked to chemical exposure | input_terms: str (chemical name) |
CTD_get_chemical_gene_interactions(input_terms=compound_name)CTD_get_chemical_diseases(input_terms=compound_name)### Toxicogenomics (CTD) [T2/T3]
#### Chemical-Gene Interactions (Top 20)
| Gene | Interaction | Type | Evidence |
|------|------------|------|----------|
| CYP1A2 | increases expression | mRNA | [T2] curated |
| TP53 | affects activity | protein | [T2] curated |
| ... | ... | ... | ... |
**Total interactions found**: 156
**Top affected pathways**: Xenobiotic metabolism, Apoptosis, DNA damage response
#### Chemical-Disease Associations (Top 10)
| Disease | Association Type | Evidence |
|---------|-----------------|----------|
| Liver Neoplasms | marker/mechanism | [T2] curated |
| Contact Dermatitis | therapeutic | [T2] curated |
| ... | ... | ... |
When: Compound has an approved drug name
Objective: Extract regulatory safety information from FDA drug labels
| Tool | Information Retrieved | Parameter |
|---|---|---|
FDA_get_boxed_warning_info_by_drug_name | Black box warnings (most serious) | drug_name: str |
FDA_get_contraindications_by_drug_name | Absolute contraindications | drug_name: str |
FDA_get_adverse_reactions_by_drug_name | Known adverse reactions | drug_name: str |
FDA_get_warnings_by_drug_name | Warnings and precautions | drug_name: str |
FDA_get_nonclinical_toxicology_info_by_drug_name | Animal toxicology data | drug_name: str |
FDA_get_carcinogenic_mutagenic_fertility_by_drug_name | Carcinogenicity/mutagenicity/fertility data | drug_name: str |
### Regulatory Safety (FDA) [T1]
#### Boxed Warning
**PRESENT** - Hepatotoxicity risk with doses >4g/day. Liver failure reported. [T1]
#### Contraindications
- Severe hepatic impairment [T1]
- Known hypersensitivity [T1]
#### Adverse Reactions (by frequency)
| Reaction | Frequency | Severity |
|----------|-----------|----------|
| Nausea | Common (>1%) | Mild |
| Hepatotoxicity | Rare (<0.1%) | Severe |
| ... | ... | ... |
#### Nonclinical Toxicology [T2]
- **Carcinogenicity**: No carcinogenic potential in 2-year rat/mouse studies
- **Mutagenicity**: Negative in Ames assay and in vivo micronucleus test
- **Fertility**: No effects on fertility at doses up to 10x human dose
When: Compound is a known drug
Objective: Retrieve curated drug safety data from DrugBank
| Tool | Information | Parameters |
|---|---|---|
drugbank_get_safety_by_drug_name_or_drugbank_id | Toxicity, contraindications | query: str, case_sensitive: bool, exact_match: bool, limit: int |
drugbank_get_safety_by_drug_name_or_drugbank_id(query=drug_name, case_sensitive=False, exact_match=False, limit=5)When: Compound can be identified by name or SMILES
Objective: Map chemical-protein interaction network for off-target assessment
| Tool | Function | Parameters |
|---|---|---|
STITCH_resolve_identifier | Resolve chemical name to STITCH ID | identifier: str, species: int (9606=human) |
STITCH_get_chemical_protein_interactions | Get chemical-protein interactions | identifiers: list[str], species: int, required_score: int |
STITCH_get_interaction_partners | Get interaction network | identifiers: list[str], species: int, limit: int |
STITCH_resolve_identifier(identifier=compound_name, species=9606)STITCH_get_chemical_protein_interactions(identifiers=[stitch_id], species=9606, required_score=700)When: ChEMBL molecule ID is available (from Phase 0)
Objective: Check for known toxic substructures
| Tool | Function | Parameters |
|---|---|---|
ChEMBL_search_compound_structural_alerts | Find structural alert matches | molecule_chembl_id: str, limit: int |
ChEMBL_search_compound_structural_alerts(molecule_chembl_id=chembl_id, limit=20)Always the final section. Integrates all evidence into actionable risk classification.
| Risk Level | Criteria |
|---|---|
| CRITICAL | FDA boxed warning present OR multiple [T1] toxicity findings OR active DILI + active hERG |
| HIGH | FDA warnings present OR [T2] animal toxicity OR multiple active ADMET endpoints |
| MEDIUM | Some [T3] predictions positive OR CTD disease associations OR structural alerts |
| LOW | All ADMET endpoints negative AND no FDA/DrugBank safety flags AND no CTD concerns |
| INSUFFICIENT DATA | Fewer than 3 phases returned data; cannot make confident assessment |
## Integrated Risk Assessment
### Overall Risk Classification: [HIGH]
### Evidence Summary
| Dimension | Finding | Evidence Tier | Concern |
|-----------|---------|--------------|---------|
| ADMET Toxicity | DILI active, hERG active | [T3] | HIGH |
| FDA Label | Boxed warning for hepatotoxicity | [T1] | CRITICAL |
| CTD Toxicogenomics | 156 gene interactions, liver neoplasms | [T2] | HIGH |
| DrugBank | Known hepatotoxicity at high doses | [T2] | HIGH |
| STITCH | Binds CYP3A4, hERG | [T3] | MEDIUM |
| Structural Alerts | 2 Brenk alerts | [T3] | MEDIUM |
### Key Safety Concerns
1. **Hepatotoxicity** [T1]: FDA boxed warning + ADMET-AI DILI prediction + CTD liver disease associations
2. **Cardiac Risk** [T3]: ADMET-AI hERG prediction + STITCH hERG interaction
3. **Drug Interactions** [T3]: CYP3A4 substrate/inhibitor, potential DDI risk
### Data Gaps
- [ ] No in vivo genotoxicity data available
- [ ] STITCH interaction scores moderate (700-900)
- [ ] No environmental exposure data
### Recommendations
1. Avoid doses >4g/day (hepatotoxicity threshold) [T1]
2. Monitor liver function in chronic use [T1]
3. Screen for CYP3A4 interactions before co-administration [T3]
4. Consider cardiac monitoring for at-risk patients [T3]
Before finalizing any report, verify:
Critical Parameter Notes (verified from source code):
| Tool | Parameter Name | Type | Notes |
|---|---|---|---|
| All ADMETAI tools | smiles | list[str] | Always a list, even for single compound |
| All CTD tools | input_terms | str | Chemical name, MeSH name, CAS RN, or MeSH ID |
| All FDA tools | drug_name | str | Brand or generic drug name |
| drugbank_get_safety_* | query, case_sensitive, exact_match, limit | str, bool, bool, int | All 4 required |
| STITCH_resolve_identifier | identifier, species | str, int | species=9606 for human |
| STITCH_get_chemical_protein_interactions | identifiers, species, required_score | list[str], int, int | required_score=400 default |
| PubChem_get_CID_by_compound_name | name | str | Compound name (not SMILES) |
| PubChem_get_compound_properties_by_CID | cid | int | Numeric CID |
| ChEMBL_search_compound_structural_alerts | molecule_chembl_id | str | ChEMBL ID (e.g., "CHEMBL112") |
{status: "success", data: {...}} with prediction values{status, data} with label text{data: [...]} with drug records{IdentifierList: {CID: [...]}} (may or may not have data wrapper)CID, MolecularWeight, ConnectivitySMILES, IUPACNameInput: SMILES string for new molecule
Workflow: Phase 0 (SMILES->CID) -> Phase 1 (toxicity) -> Phase 2 (ADMET) -> Phase 7 (structural alerts) -> Synthesis
Output: Predictive safety profile for novel compound
Input: Drug name (e.g., "Acetaminophen")
Workflow: All phases (0-7 + Synthesis)
Output: Complete safety dossier with regulatory + predictive + database evidence
Input: Chemical name (e.g., "Bisphenol A")
Workflow: Phase 0 -> Phase 1 -> Phase 2 -> Phase 3 (CTD, key for env chemicals) -> Phase 6 -> Synthesis
Output: Environmental health risk assessment focused on gene-disease associations
Input: Multiple SMILES strings
Workflow: Phase 0 -> Phase 1 (batch) -> Phase 2 (batch) -> Comparative table -> Synthesis
Output: Comparative toxicity table ranking compounds by safety
Input: Chemical name + specific gene or disease interest
Workflow: Phase 0 -> Phase 3 (CTD expanded) -> Literature search -> Synthesis
Output: Detailed chemical-gene-disease mechanistic analysis
All analyses generate a structured markdown report with progressive sections:
# Chemical Safety & Toxicology Report: [Compound Name]
**Generated**: YYYY-MM-DD HH:MM
**Compound**: [Name] | SMILES: [SMILES] | CID: [CID]
## Executive Summary
[2-3 sentence overview with risk classification and key findings, all graded]
## 1. Compound Identity
[Phase 0 results - disambiguation table]
## 2. Predictive Toxicology
[Phase 1 results - ADMET-AI toxicity endpoints]
## 3. ADMET Profile
[Phase 2 results - absorption, distribution, metabolism, excretion]
## 4. Toxicogenomics
[Phase 3 results - CTD chemical-gene-disease relationships]
## 5. Regulatory Safety
[Phase 4 results - FDA label information]
## 6. Drug Safety Profile
[Phase 5 results - DrugBank data]
## 7. Chemical-Protein Interactions
[Phase 6 results - STITCH network]
## 8. Structural Alerts
[Phase 7 results - ChEMBL alerts]
## 9. Integrated Risk Assessment
[Synthesis - risk classification, evidence summary, data gaps, recommendations]
## Appendix: Methods and Data Sources
[Tool versions, databases queried, date of access]
Chemical Safety & Toxicology Assessment Skill provides comprehensive safety evaluation by integrating:
Outputs: Structured markdown report with risk classification, evidence grading, and actionable recommendations
Best for: Drug safety assessment, chemical hazard profiling, environmental toxicology, ADMET characterization, toxicogenomic analysis
Total tools integrated: 25+ tools across 6 databases