com um clique
com um clique
Find AI models on Replicate using search and curated collections.
Prompting techniques for AI image generation and editing models on Replicate. Use when writing prompts for image models or building image generation features.
Prompting techniques for AI video generation models on Replicate. Use when writing prompts for video models or building video generation features.
Package and build custom AI models with Cog for deployment on Replicate. Use when creating a cog.yaml or predict.py, defining model inputs and outputs, loading model weights at setup time, building Docker images for ML models, serving locally with cog serve or cog predict, or porting a HuggingFace, GitHub, or ComfyUI model to run on Replicate. Trigger on phrases like "build a model", "package a model", "create a Cog model", "wrap a model", "containerize an AI model", "predict.py", "cog.yaml", "BasePredictor", or "Cog container", and when referencing cog.run, github.com/replicate/cog, or github.com/replicate/cog-examples. Covers GPU and CUDA setup, pget for fast weight downloads, async predictors with continuous batching, streaming outputs, and cold-boot optimization for image, video, audio, and LLM models. For pushing built models to Replicate, see publish-models. For running existing models, see run-models.
Push and publish custom AI models to Replicate, and set up CI/CD for releasing new model versions safely. Use when running cog push, deploying a model to Replicate, releasing a new version, validating a model with cog-safe-push before publishing, configuring a Replicate deployment, setting up GitHub Actions for model releases, or porting a community model to an official one. Trigger on phrases like "push a model to Replicate", "publish a model", "deploy a model", "release a new version", "cog push", "cog-safe-push", "model CI", "r8.im", or "schema compatibility", and when referencing github.com/replicate/cog-safe-push or github.com/replicate/model-ci-template. Covers cog push, the full cog-safe-push config (test cases, fuzz, deployment, official_model), GitHub Actions patterns, multi-model matrix pushes, and post-publish monitoring. Assumes you already have a working Cog project; see build-models if you need to package one first.
Compare Replicate models by cost, speed, quality, and capabilities.
| name | run-models |
| description | Run AI models on Replicate via predictions, webhooks, and streaming. |
https://replicate.com/{owner}/{model}/llms.txtAccept: text/markdown when requesting docs pages for Markdown responses.Prefer: wait header when creating a prediction for a blocking synchronous response. Only recommended for very fast models. Max 60 seconds.POST /v1/predictions endpoint, as it supports both official and community models.minimum, maximum, enum values). Don't generate values that violate them.starting -> processing -> succeeded / failed / canceled.owner/name format. Community models require owner/name:version_id.POST /v1/predictions endpoint handles both.webhook to an HTTPS URL when creating a prediction. Replicate POSTs the full prediction object when it completes.webhook_events_filter: start, output, logs, completed.Webhook-ID, Webhook-Timestamp, and Webhook-Signature headers. Get the signing secret from GET /v1/webhooks/default/secret.lifetime to auto-cancel predictions that run too long (e.g. 30s, 5m, 1h). Measured from creation time.stream URL in the response. Use SSE to receive incremental output.