Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

modal

Estrellas11

Forks1

Actualizado24 de marzo de 2026, 19:28

Run Python code in the cloud with serverless containers, GPUs, and autoscaling. Use when deploying ML models, running batch processing jobs, scheduling compute-intensive tasks, or serving APIs that require GPU acceleration or dynamic scaling.

Instalación

Instalar con Codex o Claude Copia este prompt, pégalo en Codex, Claude u otro asistente, y deja que revise la página de la skill y la instale por ti.

Ejecutar en Manus

Fuente

yulonglin

yulonglin/dotfiles

Abrir repositorio de GitHub Ver repositorios del creador

Descarga

Ejecutar en Manus

Ocupaciones relacionadasSOC

Basado en la clasificación ocupacional SOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas·SOC 15-1252

Explorador de archivos

6 archivos

SKILL.md

readonly

Más de este repositorio

mismo repositorio

commit-push-sync

yulonglin/dotfiles

This skill should be used when the user asks to "commit and push", "commit push", "sync changes", "push changes", "commit and sync", or "update remote". Handles the full workflow of committing changes, pulling with rebase, and pushing to remote.

2026-06-2511

sweep-ai-safety

yulonglin/dotfiles

Sweep recent AI safety research from curated sources (Anthropic alignment science / red team, OpenAI, GDM, Apollo, Redwood, METR, FAR AI, Truthful AI, alphaxiv, arXiv) and surface items matching tracked topic terms (inoculation prompting, reward hacking, exploration hacking, metagaming, eval gaming, OOCR, scheming, alignment faking, sandbagging, etc.). Use when asked to "sweep AI safety", "what's new in alignment", "any recent papers on X", "weekly safety digest", or for staying current on AI safety literature.

2026-06-1011

anthropic-style

yulonglin/dotfiles

Anthropic visual style for plots, diagrams, slides, and web. Use when creating any visual output that should have Anthropic's look-and-feel — matplotlib charts, TikZ diagrams, HTML/CSS, or presentations.

2026-06-0711

check-bib-references

yulonglin/dotfiles

Catch LLM-fabricated citations in BibTeX files. Verifies arXiv/OpenReview entries against live metadata (titles, first authors), then guides manual verification of authorless prose claims. Use before submitting papers, after any LLM-assisted citation generation, or when a reference smells off.

2026-05-2411

check-prose-claims

yulonglin/dotfiles

Fact-check prose claims in slides, reports, PDFs, and papers — statistics, comparatives, attributions, causal claims, quotes. Two-pass extract-then-verify protocol with strict numerical precision and a doc-only mode. Use when the user asks to "check the claims in this deck", "fact-check this report", "audit this PDF", "verify the numbers in these slides", or before publishing/shipping any externally-facing document with quantitative claims. Complements `check-bib-references` (which handles BibTeX entries) — this skill handles the prose around them.

2026-05-2411

log-gap

yulonglin/dotfiles

Log a one-line knowledge gap to the project's gaps.md file. Use when the user is surprised by Claude's answer, says "I didn't know that", "wait what", or wants to record a misconception they just discovered. Format "I assumed X but actually Y". Personal misconception log — much higher learning signal than feedback memories.

2026-05-0211

name	modal
description	Run Python code in the cloud with serverless containers, GPUs, and autoscaling. Use when deploying ML models, running batch processing jobs, scheduling compute-intensive tasks, or serving APIs that require GPU acceleration or dynamic scaling.

Modal

Overview

Modal is a serverless platform for running Python code in the cloud with minimal configuration. Execute functions on powerful GPUs, scale automatically to thousands of containers, and pay only for compute used.

When to Use This Skill

GPU-accelerated training or inference
Batch processing / parallel .map() over inputs
Scheduled jobs (cron)
Serverless ML API endpoints
Scientific computing on specialized hardware

Setup

uv add --dev modal     # dev dep — only needed locally, not in containers
modal token new        # opens browser for auth → saves to ~/.modal.toml

Core Concepts

Images: Container Environment

debian_slim() is the default base — a minimal Debian image with Python. PyTorch bundles its own CUDA, so you don't need an nvidia base image.

Preferred: uv_sync() for existing projects — reads pyproject.toml + uv.lock:

image = (
    modal.Image.debian_slim(python_version="3.12")
    .uv_sync()  # installs from lockfile — deps never drift
)

Alternative: explicit packages (for standalone scripts without pyproject.toml):

image = (
    modal.Image.debian_slim(python_version="3.12")
    .uv_pip_install("torch==2.5.1", "transformers==4.46.0")  # pin tightly
)

Pin versions for reproducibility. Each change invalidates the image cache layer.

Also available: .pip_install_from_pyproject("pyproject.toml") (uses pip, slower than uv).

Getting Local Code into Containers

Three methods, from most to least common:

# Named package — best for multi-file projects (requires __init__.py)
image = image.add_local_python_source("my_package")

# Entire directory — includes non-.py files too
image = image.add_local_dir("src/", remote_path="/root/src")

# Single file — when you only need one script
image = image.add_local_file("train.py", "/root/train.py")

IMPORTANT — add_local_python_source(".") requires a proper Python package (directory with __init__.py). It fails with ModuleNotMountable("no package specified for '.'") for loose scripts. Use a named package: add_local_python_source("my_package").

IMPORTANT — ordering constraint: add_local_file / add_local_python_source MUST come AFTER all build steps (uv_pip_install, run_function, run_commands). Modal mounts these at container startup, not build time. Build steps after local mounts cause: InvalidError('An image tried to run a build step after using image.add_local_*'). Set copy=True to copy into the image layer if you need build steps after.

By default (copy=False), files are added at container startup (fast re-deploys — image doesn't rebuild when code changes). Set copy=True to bake into the image layer (needed if subsequent build steps depend on those files).

Functions

Functions run in the cloud. Import heavy packages inside the function body — they exist in the container but may not be installed locally:

@app.function(image=image, gpu="A10G")
def train():
    import torch  # ← inside body, not at top of file
    assert torch.cuda.is_available()

Why inside the body? Modal serializes function definitions locally and runs them remotely. Top-level import torch would fail locally if torch isn't installed on your laptop.

Alternative — use the imports() context manager for module-level imports:

with image.imports():
    import torch  # deferred — only runs in container

Calling Functions

@app.local_entrypoint()
def main():
    result = train.remote()  # runs on Modal
    print(result)

CLI args are auto-parsed from local_entrypoint type hints:

@app.local_entrypoint()
def main(lr: float = 0.001, epochs: int = 10):
    train.remote(lr, epochs)

Run: modal run train.py --lr 0.01 --epochs 20

GPUs

@app.function(gpu="A10G")       # 24GB, cost-effective
@app.function(gpu="L40S")       # 48GB, best value for inference
@app.function(gpu="A100")       # 40/80GB, training
@app.function(gpu="H100")       # top-tier training
@app.function(gpu="H100:4")     # multi-GPU
@app.function(gpu=["H100", "A100-40GB:2"])  # fallback chain

PyTorch bundles CUDA — debian_slim works fine, no nvidia base image needed.

Volumes: Persistent Storage

Network-attached filesystem that persists across runs. Good for datasets and experiment outputs. Not as fast as local SSD (image layers) — use volumes for data that changes, not for static model weights.

vol = modal.Volume.from_name("my-data", create_if_missing=True)

@app.function(volumes={"/data": vol})
def save_results():
    with open("/data/results.json", "w") as f:
        json.dump(results, f)
    vol.commit()  # persist changes (also auto-commits on exit)

Model Weights: Bake into Image

For static model weights, bake into the image via run_function() — not a volume. Image layers are cached on local SSD; volumes are network reads on every cold start.

HF_SECRET = modal.Secret.from_name("huggingface-secret")

def download_model():
    from huggingface_hub import snapshot_download
    # Use unsloth/ mirrors to avoid HF gated model access issues
    snapshot_download("unsloth/Llama-3.2-1B-Instruct", cache_dir="/models")

image = (
    modal.Image.debian_slim(python_version="3.12")
    .uv_sync()
    .uv_pip_install("huggingface_hub[hf_transfer]")  # fast Rust-based downloads
    .env({"HF_HOME": "/models", "HF_HUB_ENABLE_HF_TRANSFER": "1"})
    .run_function(download_model, secrets=[HF_SECRET])
    # add_local_* MUST come after all build steps
)

First build downloads the model; subsequent builds reuse the cached image layer.

Secrets

modal secret create huggingface-secret HF_TOKEN="hf_xxx"

@app.function(secrets=[modal.Secret.from_name("huggingface-secret")])
def use_model():
    import os
    token = os.environ["HF_TOKEN"]

Also: Secret.from_dotenv(), Secret.from_dict({...}).

Environment Variables

Bake into the image with .env():

image = modal.Image.debian_slim().env({
    "HF_HOME": "/models",
    "CUDA_VISIBLE_DEVICES": "0",
})

These are set at build time and available in every container.

Parallel Execution

@app.function()
def process(item_id: int):
    return heavy_computation(item_id)

@app.local_entrypoint()
def main():
    results = list(process.map(range(1000)))  # auto-parallelized

Autoscaling

@app.function(
    max_containers=100,
    min_containers=2,       # keep warm
    buffer_containers=5,    # idle buffer for bursts
    scaledown_window=60,    # seconds before scale-down
)
def inference():
    pass

Web Endpoints

@app.function()
@modal.web_endpoint(method="POST")
def predict(data: dict):
    return {"result": model(data["input"])}

Deploy: modal deploy script.py → HTTPS URL.

Scheduled Jobs

@app.function(schedule=modal.Cron("0 2 * * *"))  # daily 2 AM
def nightly_job():
    pass

Complete Example: Research Experiment on GPU

"""Run an experiment on Modal with persistent output."""
import modal

app = modal.App("my-experiment")

image = (
    modal.Image.debian_slim(python_version="3.12")
    .uv_sync()
    .add_local_python_source("my_experiment")  # package with __init__.py
)

vol = modal.Volume.from_name("experiment-data", create_if_missing=True)

@app.function(image=image, gpu="A10G", volumes={"/data": vol}, timeout=600)
def run_experiment():
    from my_experiment import train  # local code, importable via add_local_python_source
    results = train(output_dir="/data/results")
    vol.commit()
    return results

@app.local_entrypoint()
def main():
    results = run_experiment.remote()
    print(results)
    print("Download: modal volume get experiment-data results/ results/")

Decision Guide

Scenario	Approach
Existing project with pyproject.toml	`uv_sync()`
Standalone script	`uv_pip_install("pkg==1.2.3")`
Multi-file project (package with `__init__.py`)	`add_local_python_source("pkg_name")`
Multi-file project (loose scripts)	Multiple `add_local_file()` calls
Static model weights	`run_function()` at build time (image layer)
Dynamic data / outputs	Volume
Credentials / tokens	`modal.Secret`
Config values	`image.env({...})` or function args

References

Local references in references/:

functions.md — Decorators, .remote(), .map(), classes, async, retries
gpu.md — GPU types, multi-GPU, fallback chains, PyTorch setup
images.md — Base images, packages, local code mounting, caching
secrets.md — Environment variables, auth patterns
scheduled-jobs.md — Cron, periodic tasks

For volumes, scaling, and the latest API changes, use context7 (mcp__context7__query-docs with library ID /llmstxt/modal_llms-full_txt) or https://modal.com/docs/guide.