Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

m4-api

Use the M4 Python API to query clinical datasets programmatically. Use when writing code to access clinical databases, executing SQL via Python, or performing multi-step data analysis.

Exécuter dans Manus

Aperçu

Use the M4 Python API to query clinical datasets programmatically. Use when writing code to access clinical databases, executing SQL via Python, or performing multi-step data analysis.

Commande d'installation

npx skills add https://github.com/hannesill/m4 --skill m4-api

Copiez et collez cette commande dans Claude Code pour installer le skill

Source

hannesill/m4

Étoiles31

Forks12

Mis à jour30 mai 2026 à 18:11

Explorateur de fichiers

2 fichiers

SKILL.md

readonly

Plus depuis ce dépôt

même dépôt

apache-iv-score

hannesill/m4

Calculate APACHE IV (Acute Physiology and Chronic Health Evaluation IV) score for ICU mortality prediction. Use for severity assessment, hospital mortality prediction, ICU benchmarking, or case-mix adjustment. eICU has pre-computed scores; MIMIC-IV requires custom implementation with diagnosis mapping challenges.

2026-05-3031

clinical-research-session

hannesill/m4

Start a structured clinical research session. Use when users describe research goals, want to analyze cohorts, investigate hypotheses, or need a rigorous research plan. Interviews the user, then produces a research protocol.

2026-05-3031

m4-setup

hannesill/m4

Diagnose and repair common M4 environment, dataset, skill installation, backend, and vitrine setup problems. Use when M4 tools, datasets, skills, or visualization are missing or broken.

2026-05-3031

comorbidity-score

hannesill/m4

Calculate Charlson Comorbidity Index (CCI) and Elixhauser Comorbidity Index for hospital admissions. Use for risk adjustment, mortality prediction, case-mix analysis, or comparing comorbidity burden across patient populations.

2026-05-2531

first-icu-stay

hannesill/m4

Identify first ICU stays and first hospital admissions for cohort selection. Use to exclude readmissions, create independent observations, or build adult patient cohorts.

2026-05-2531

create-m4-skill

hannesill/m4

Guide users through creating M4 skills with proper structure, provenance tracking, and tier assignment. Use when users want to create a new M4 skill, document a clinical concept, or contribute a skill to the M4 skills library.

2026-05-2531

Source

hannesill

hannesill/m4

Ouvrir le dépôt GitHub Voir les dépôts du créateur

Commande d'installation

Téléchargement

Exécuter dans Manus

Utile pourSOC

Développeurs de logicielsProfessions informatiques et mathématiques15-1252L4

name	m4-api
description	Use the M4 Python API to query clinical datasets programmatically. Use when writing code to access clinical databases, executing SQL via Python, or performing multi-step data analysis.
tier	community
category	system

M4 Python API

The M4 Python API provides programmatic access to clinical datasets for code execution environments. It mirrors the MCP tools but returns native Python types (DataFrames, dicts) instead of formatted strings.

When to Use the API vs MCP Tools

Use the Python API when:

Complex clinical analysis - Multi-step analyses that require intermediate results, joins across queries, or statistical computations
Large result sets - Query results with thousands of rows can be stored in DataFrames without dumping into context
Mathematical operations - Aggregations, percentile calculations, statistical tests, and counting that benefit from pandas/numpy
Iterative exploration - Building up analysis through multiple queries where each step informs the next

Use MCP tools when:

Simple one-off queries where the result fits comfortably in context
Interactive exploration where you want to see results immediately

Required Workflow

You must follow this sequence:

Choose a dataset name and pass it explicitly, or create M4Client(dataset=...)
get_schema(dataset=...) / get_table_info(..., dataset=...) - Explore available tables
execute_query() - Run SQL queries

from m4 import get_schema, get_table_info, execute_query

dataset = "mimic-iv"  # or "mimic-iv-demo", "eicu", "mimic-iv-note"

# Step 1: Explore schema
schema = get_schema(dataset=dataset)
print(schema['tables'])  # List of table names

# Step 2: Inspect specific tables before querying
info = get_table_info("mimiciv_hosp.patients", dataset=dataset)
print(info['schema'])  # DataFrame with column names, types
print(info['sample'])  # DataFrame with sample rows

# Step 3: Execute queries
df = execute_query(
    "SELECT gender, COUNT(*) as n FROM mimiciv_hosp.patients GROUP BY gender",
    dataset=dataset,
)
# Returns pd.DataFrame - use pandas operations freely

API Reference

Dataset Management

Function	Returns	Description
`list_datasets()`	`list[str]`	Available dataset names
`M4Client(dataset=...)`	`M4Client`	Preferred explicit client for one dataset
`client.with_dataset(name)`	`M4Client`	New client with the same session context and a different dataset
`client.switch_dataset(name)`	`M4Client`	Mutate a client to another dataset for notebook-style sessions

Tabular Data (requires TABULAR modality)

Function	Returns	Description
`get_schema(dataset=...)`	`dict`	`{'backend_info': str, 'tables': list[str]}`
`get_table_info(table, dataset=..., show_sample=True)`	`dict`	`{'schema': DataFrame, 'sample': DataFrame}`
`execute_query(sql, dataset=...)`	`DataFrame`	Query results as pandas DataFrame

backend_info summarizes the backend and dataset. Local DuckDB paths are hidden unless M4_PATH_DISCLOSURE=1 is set for the process.

Clinical Notes (requires NOTES modality)

Function	Returns	Description
`search_notes(query, dataset=..., note_type, limit, snippet_length)`	`dict`	`{'results': dict[str, DataFrame]}`
`get_note(note_id, dataset=..., max_length)`	`dict`	`{'text': str, 'subject_id': int, ...}`
`list_patient_notes(subject_id, dataset=..., note_type, limit)`	`dict`	`{'notes': dict[str, DataFrame]}`

Error Handling

M4 uses a hierarchy of exceptions. Catch specific types to handle errors appropriately:

M4Error (base)
├── DatasetError      # Dataset doesn't exist or not configured
├── QueryError        # SQL syntax error, table not found, query failed
└── ModalityError     # Tool incompatible with dataset (e.g., notes on tabular-only)

Recovery patterns:

from m4 import execute_query, DatasetError, QueryError, ModalityError

try:
    df = execute_query("SELECT * FROM mimiciv_hosp.patients", dataset="mimic-iv")
except DatasetError as e:
    # Dataset missing, not initialized, or misspelled.
    # Recovery: check list_datasets() and m4 status --dataset mimic-iv.
    print(f"Dataset problem: {e}")
except QueryError as e:
    # SQL error or table not found
    # Recovery: check table name with get_schema(), fix SQL syntax
    print(f"Query failed: {e}")
except ModalityError as e:
    # Tried notes function on tabular-only dataset
    # Recovery: pass dataset="mimic-iv-note" to notes functions
    print(f"Modality problem: {e}")

Displaying Results

Use show() from the vitrine module to present query results to the researcher in the browser:

from m4 import execute_query
from vitrine import show

df = execute_query(
    "SELECT gender, COUNT(*) as n FROM mimiciv_hosp.patients GROUP BY gender",
    dataset="mimic-iv",
)
df.to_csv("output/demographics.csv", index=False)  # Save for reproducibility
show(df, title="Demographics", study="my-study")   # Show for review

For blocking review (agent waits for researcher approval), use show(df, wait=True, prompt="Proceed?"). For the full display API, use the vitrine-api skill.

Dataset Selection

Important: Dataset selection is explicit. Prefer M4Client(dataset=...) when several calls target the same dataset, or pass dataset=... to each convenience function. For a long-lived session, use client.with_dataset(...) to create a new client for another dataset without mutating the current one. Use client.switch_dataset(...) only for single-session, notebook-style workflows where mutation is expected.

from m4 import M4Client, execute_query

client = M4Client(dataset="mimic-iv")
df1 = client.query("SELECT COUNT(*) FROM mimiciv_hosp.patients")

eicu_client = client.with_dataset("eicu")
df2 = eicu_client.query("SELECT COUNT(*) FROM patient")

client.switch_dataset("mimic-iv-note")  # mutates client and its execution context
df3 = execute_query("SELECT COUNT(*) FROM patient", dataset="eicu")

MCP Tool Equivalence

The Python API mirrors MCP tools but with better return types:

MCP Tool	Python Function	MCP Returns	Python Returns
`execute_query`	`execute_query()`	Formatted string	`pd.DataFrame`
`get_database_schema`	`get_schema()`	Formatted string	`dict` with `tables` list
`get_table_info`	`get_table_info()`	Formatted string	`dict` with `schema`/`sample` DataFrames

Use the Python API when you need to:

Chain queries in analysis pipelines
Perform pandas operations on results
Avoid parsing formatted output

NOTE: All queries use canonical schema.table names (e.g., mimiciv_hosp.patients, mimiciv_icu.icustays). These names work on both the local DuckDB backend and the BigQuery backend — no need to adjust table names per backend.