name	docker
description	Execute research code inside isolated Docker containers for safe replication, experiments, and benchmarks. Use when the user selects Docker as the execution environment or asks to run code safely, in isolation, or in a sandbox.
allowed-tools	Bash(docker:*)

Docker Sandbox

Run research code inside Docker containers while Feynman stays on the host. The container gets the project files, runs the commands, and results sync back.

When to use

User selects "Docker Sandbox" as the execution environment in /replicate or /autoresearch
Running untrusted code from a paper's repository
Experiments that install packages or modify system state
Any time the user asks to run something "safely" or "isolated"

How it works

Build or pull an appropriate base image for the research code
Mount the project directory into the container
Run experiment commands inside the container
Results write back to the mounted directory

Running commands in a container

For Python research code (most common):

docker run --rm -v "$(pwd)":/workspace -w /workspace python:3.11 bash -c "
  pip install -r requirements.txt &&
  python train.py
"

For projects with a Dockerfile:

docker build -t feynman-experiment .
docker run --rm -v "$(pwd)/results":/workspace/results feynman-experiment

For GPU workloads:

docker run --rm --gpus all -v "$(pwd)":/workspace -w /workspace pytorch/pytorch:latest bash -c "
  pip install -r requirements.txt &&
  python train.py
"

Choosing the base image

Research type	Base image
Python ML/DL	`pytorch/pytorch:latest` or `tensorflow/tensorflow:latest-gpu`
Python general	`python:3.11`
Node.js	`node:20`
R / statistics	`rocker/r-ver:4`
Julia	`julia:1.10`
Multi-language	`ubuntu:24.04` with manual installs

Persistent containers

For iterative experiments (like /autoresearch), create a named container instead of --rm. Choose a descriptive name based on the experiment:

docker create --name <name> -v "$(pwd)":/workspace -w /workspace python:3.11 tail -f /dev/null
docker start <name>
docker exec <name> bash -c "pip install -r requirements.txt"
docker exec <name> bash -c "python train.py"

This preserves installed packages across iterations. Clean up with:

docker stop <name> && docker rm <name>

Notes

The mounted workspace syncs results back to the host automatically
Containers are network-enabled by default — add --network none for full isolation
For GPU access, Docker must be configured with the NVIDIA Container Toolkit

name	docker
description	Execute research code inside isolated Docker containers for safe replication, experiments, and benchmarks. Use when the user selects Docker as the execution environment or asks to run code safely, in isolation, or in a sandbox.
allowed-tools	Bash(docker:*)