Run any Skill in Manus with one click

$pwd:

models-selector

Name: Models Selector
Author: Azure-Samples

// Select Microsoft Foundry models and regions, then configure azd environment variables (AZURE_LOCATION and AI_PROJECT_DEPLOYMENTS). Use when the user wants to choose, compare, or deploy AI models in Azure, set model deployments for azd provisioning, pick a region for AI workloads, or configure which models to deploy in their Azure AI Foundry project. Also use when user mentions model selection, model availability, region selection for AI, or azd env set for models.

Run Skill in Manus

$ git log --oneline --stat

stars:20

forks:4

updated:May 6, 2026 at 14:02

SKILL.md

readonly

package.json

"author": "Azure-Samples"

"repository": "Azure-Samples/Spec2Cloud"

View GitHub Repository

$ install --globalskills.sh

$ download --local

Run Skill in Manus

[HINT] Download the complete skill directory including SKILL.md and all related files

Run any Skill with one click

name

models-selector

description

Select Microsoft Foundry models and regions, then configure azd environment variables (AZURE_LOCATION and AI_PROJECT_DEPLOYMENTS). Use when the user wants to choose, compare, or deploy AI models in Azure, set model deployments for azd provisioning, pick a region for AI workloads, or configure which models to deploy in their Azure AI Foundry project. Also use when user mentions model selection, model availability, region selection for AI, or azd env set for models.

Foundry Models Selector

Configure AZURE_LOCATION and AI_PROJECT_DEPLOYMENTS azd environment variables by selecting the right models and regions for the user's needs.

Preferred Models by Task & Modality

Use this section to recommend models based on what the user needs. Always suggest the best-fit model first, then offer alternatives.

Chat & General Purpose (Text In / Text Out)

Priority	Model	Format	Version	Why
Best	`gpt-5-mini`	OpenAI	`2025-08-07`	Best balance of capability, speed, and cost. No registration required.
Strong	`gpt-4.1-mini`	OpenAI	`2025-04-14`	Fast, cheap, 1M token context, great for most tasks.
Strong	`gpt-5-nano`	OpenAI	`2025-08-07`	Cheapest GPT-5 series. No registration required.
Premium	`gpt-5`	OpenAI	`2025-08-07`	Top-tier reasoning. Registration required.
Premium	`gpt-5.1`	OpenAI	`2025-11-13`	Latest reasoning model. Registration required.
Alt	`DeepSeek-V3.2`	DeepSeek	`1`	Strong open-weight alternative. All regions.
Alt	`Llama-4-Maverick-17B-128E-Instruct-FP8`	Meta	`1`	Open-weight, 1M context, multimodal. All regions.

Advanced Reasoning & Complex Problem Solving

Priority	Model	Format	Version	Why
Best	`o4-mini`	OpenAI	`2025-04-16`	Best reasoning-to-cost ratio.
Strong	`o3`	OpenAI	`2025-04-16`	Stronger reasoning, higher cost.
Strong	`DeepSeek-R1-0528`	DeepSeek	`1`	Open-weight reasoning model. All regions.
Premium	`gpt-5`	OpenAI	`2025-08-07`	Full GPT-5 reasoning. Registration required.
Premium	`gpt-5.1`	OpenAI	`2025-11-13`	Latest reasoning. Registration required.
Alt	`grok-4-fast-reasoning`	xAI	`1`	External reasoning alternative. All regions.
Alt	`Kimi-K2-Thinking`	Moonshot	`1`	Open-weight, 262K context, tool calling. All regions.

Coding & Development

Priority	Model	Format	Version	Why
Best	`gpt-4.1`	OpenAI	`2025-04-14`	1M token context, great for large codebases.
Strong	`gpt-4.1-mini`	OpenAI	`2025-04-14`	Cost-efficient coding with large context.
Strong	`codex-mini`	OpenAI	`2025-05-16`	Fine-tuned o4-mini for code. East US 2 & Sweden Central only.
Alt	`grok-code-fast-1`	xAI	`1`	External coding specialist. Registration required.
Alt	`DeepSeek-V3.2`	DeepSeek	`1`	Strong coding performance, all regions.

Multimodal (Text + Image Input)

Priority	Model	Format	Version	Why
Best	`gpt-5-mini`	OpenAI	`2025-08-07`	Vision + text, cost-efficient.
Strong	`gpt-4.1`	OpenAI	`2025-04-14`	1M context with vision.
Strong	`o4-mini`	OpenAI	`2025-04-16`	Vision + reasoning.
Alt	`Llama-4-Maverick-17B-128E-Instruct-FP8`	Meta	`1`	Open-weight multimodal, 1M context.
Alt	`Kimi-K2.5`	Moonshot	`1`	Multimodal reasoning. All regions.

Embeddings

Priority	Model	Format	Version	Why
Best	`text-embedding-3-large`	OpenAI	`1`	Best quality embeddings, 3072 dimensions.
Smaller	`text-embedding-3-small`	OpenAI	`1`	Cheaper, 1536 dimensions.
Legacy	`text-embedding-ada-002`	OpenAI	`2`	Older model, still widely used.

Image Generation

Priority	Model	Format	Version	Why
Best	`gpt-image-1`	OpenAI	`1`	Best quality. Registration required. Limited regions.
Alt	`dall-e-3`	OpenAI	`3.0`	Generally available. East US, Australia East, Sweden Central.
Alt	`FLUX.2-pro`	BlackForestLabs	`1`	Multi-reference support. All regions.

Document Processing

Priority	Model	Format	Version	Why
Best	`mistral-document-ai-2505`	MistralAI	`1`	PDF/image to text specialist. All regions.

Cost-Optimized Routing

Priority	Model	Format	Version	Why
Best	`model-router`	Microsoft	`2025-11-18`	Automatically routes to optimal model. Up to 60% savings. East US 2 & Sweden Central only.

How to Set Environment Variables

Set Azure Location

azd env set AZURE_LOCATION "<region>"

Example:

azd env set AZURE_LOCATION "eastus2"

Set Model Deployments

azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'<deployment-name>', 'model': {'name': '<model-name>','format': '<format>', 'version':'<version>'},'sku': {'name': '<sku-name>','capacity':<capacity>}}]"

Single model example:

azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-5-mini', 'model': {'name': 'gpt-5-mini','format': 'OpenAI', 'version':'2025-08-07'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

Multiple models example:

azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-4.1-mini', 'model': {'name': 'gpt-4.1-mini','format': 'OpenAI', 'version':'2025-04-14'},'sku': {'name': 'GlobalStandard','capacity':100}}, {'name':'DeepSeek-V3.2', 'model': {'name': 'DeepSeek-V3.2','format': 'DeepSeek', 'version':'1'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

Deployment Entry Format

Each deployment entry in the JSON array:

{
  'name': '<deployment-name>',        # Deployment name (typically same as model name)
  'model': {
    'name': '<model-name>',           # Exact model name from tables below
    'format': '<format>',             # Provider format (see Model Catalog)
    'version': '<version>'            # Model version string
  },
  'sku': {
    'name': '<sku-name>',             # Usually 'GlobalStandard'
    'capacity': <number>              # TPM in thousands (e.g., 100 = 100K TPM)
  }
}

Model Catalog

Azure OpenAI Models (format: `OpenAI`)

Model Name	Version	Modality	Context	SKU
`gpt-5.2-codex`	`2026-01-14`	Text+Image→Text (Reasoning, Codex)	400K	GlobalStandard
`gpt-5.2`	`2025-12-11`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5.2-chat`	`2025-12-11`	Text→Text (Preview)	128K	GlobalStandard
`gpt-5.1`	`2025-11-13`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5.1-chat`	`2025-11-13`	Text→Text (Preview, Reasoning)	128K	GlobalStandard
`gpt-5.1-codex`	`2025-11-13`	Text+Image→Text (Codex)	400K	GlobalStandard
`gpt-5.1-codex-mini`	`2025-11-13`	Text+Image→Text (Codex)	400K	GlobalStandard
`gpt-5.1-codex-max`	`2025-12-04`	Text+Image→Text (Codex)	400K	GlobalStandard
`gpt-5`	`2025-08-07`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5-mini`	`2025-08-07`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5-nano`	`2025-08-07`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5-chat`	`2025-10-03`	Text+Image→Text (Preview)	128K	GlobalStandard
`gpt-5-codex`	`2025-09-11`	Text+Image→Text (Codex)	400K	GlobalStandard
`gpt-5-pro`	`2025-10-06`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-4.1`	`2025-04-14`	Text+Image→Text	1M (128K std)	GlobalStandard
`gpt-4.1-mini`	`2025-04-14`	Text+Image→Text	1M (128K std)	GlobalStandard
`gpt-4.1-nano`	`2025-04-14`	Text+Image→Text	1M (128K std)	GlobalStandard
`o4-mini`	`2025-04-16`	Text+Image→Text (Reasoning)	200K	GlobalStandard
`o3`	`2025-04-16`	Text+Image→Text (Reasoning)	200K	GlobalStandard
`o3-mini`	`2025-01-31`	Text→Text (Reasoning)	200K	GlobalStandard
`o3-pro`	`2025-06-10`	Text+Image→Text (Reasoning)	200K	GlobalStandard
`o1`	`2024-12-17`	Text+Image→Text (Reasoning)	200K	GlobalStandard
`codex-mini`	`2025-05-16`	Text+Image→Text (o4-mini fine-tuned)	200K	GlobalStandard
`gpt-4o`	`2024-11-20`	Text+Image→Text	128K	GlobalStandard
`gpt-4o-mini`	`2024-07-18`	Text+Image→Text	128K	GlobalStandard
`computer-use-preview`	`2025-03-11`	Text+Image→Text (Computer Use)	8K	GlobalStandard
`text-embedding-3-large`	`1`	Text→Embedding (3072 dim)	8K	GlobalStandard
`text-embedding-3-small`	`1`	Text→Embedding (1536 dim)	8K	GlobalStandard
`text-embedding-ada-002`	`2`	Text→Embedding (1536 dim)	8K	GlobalStandard
`dall-e-3`	`3.0`	Text→Image	4K	Standard
`gpt-image-1`	`1`	Text→Image	4K	GlobalStandard
`sora-2`	`1`	Text→Video	-	GlobalStandard

Azure OpenAI OSS Models (format: `OpenAI-OSS`)

Model Name	Version	Modality	Context	SKU
`gpt-oss-120b`	`1`	Text→Text (Reasoning, Preview)	131K	GlobalStandard

DeepSeek Models (format: `DeepSeek`)

Model Name	Version	Modality	Context	SKU
`DeepSeek-V3.2-Speciale`	`1`	Text→Text (Reasoning)	128K	GlobalStandard
`DeepSeek-V3.2`	`1`	Text→Text (Reasoning)	128K	GlobalStandard
`DeepSeek-V3.1`	`1`	Text→Text (Tool calling)	131K	GlobalStandard
`DeepSeek-V3-0324`	`1`	Text→Text (Tool calling)	131K	GlobalStandard
`DeepSeek-R1-0528`	`1`	Text→Text (Reasoning)	163K	GlobalStandard
`DeepSeek-R1`	`1`	Text→Text (Reasoning)	163K	GlobalStandard

Meta Models (format: `Meta`)

Model Name	Version	Modality	Context	SKU
`Llama-4-Maverick-17B-128E-Instruct-FP8`	`1`	Text+Image→Text	1M	GlobalStandard
`Llama-3.3-70B-Instruct`	`1`	Text→Text	128K	GlobalStandard

Microsoft Models (format: `Microsoft`)

Model Name	Version	Modality	Context	SKU
`model-router`	`2025-11-18`	Text+Image→Text (Auto-routing)	200K	GlobalStandard
`MAI-DS-R1`	`1`	Text→Text (Reasoning)	163K	GlobalStandard

Mistral Models (format: `MistralAI`)

Model Name	Version	Modality	Context	SKU
`Mistral-Large-3`	`1`	Text+Image→Text (Tool calling)	-	GlobalStandard
`mistral-document-ai-2505`	`1`	Image/PDF→Text	-	GlobalStandard

Cohere Models (format: `Cohere`)

Model Name	Version	Modality	Context	SKU
`Cohere-command-a`	`1`	Text→Text (Tool calling)	131K	GlobalStandard
`Cohere-rerank-v4.0-pro`	`1`	Text→Text (Rerank)	-	GlobalStandard
`Cohere-rerank-v4.0-fast`	`1`	Text→Text (Rerank)	-	GlobalStandard
`embed-v-4-0`	`1`	Text+Image→Embedding	512	GlobalStandard

Moonshot AI Models (format: `Moonshot`)

Model Name	Version	Modality	Context	SKU
`Kimi-K2.5`	`1`	Text+Image→Text (Reasoning, Tool calling)	262K	GlobalStandard
`Kimi-K2-Thinking`	`1`	Text→Text (Reasoning, Tool calling)	262K	GlobalStandard

xAI Models (format: `xAI`)

Model Name	Version	Modality	Context	SKU
`grok-4`	`1`	Text→Text (Tool calling)	262K	GlobalStandard
`grok-4-fast-reasoning`	`1`	Text+Image→Text (Tool calling)	128K	GlobalStandard
`grok-4-fast-non-reasoning`	`1`	Text+Image→Text (Tool calling)	128K	GlobalStandard
`grok-code-fast-1`	`1`	Text→Text (Tool calling)	256K	GlobalStandard
`grok-3`	`1`	Text→Text (Tool calling)	131K	GlobalStandard
`grok-3-mini`	`1`	Text→Text (Tool calling)	131K	GlobalStandard

Black Forest Labs Models (format: `BlackForestLabs`)

Model Name	Version	Modality	Context	SKU
`FLUX.2-pro`	`1`	Text+Image→Image	-	GlobalStandard
`FLUX.1-Kontext-pro`	`1`	Text+Image→Image	-	GlobalStandard
`FLUX-1.1-pro`	`1`	Text→Image	-	GlobalStandard

Region Availability

Regions with Maximum Azure OpenAI Model Coverage

These regions have the broadest model availability. Prefer these when needing the widest selection:

Region	Azure ID	Coverage
East US 2	`eastus2`	All models — Best overall coverage. Has every Azure OpenAI model + audio + image + video + codex.
Sweden Central	`swedencentral`	Near-complete — All core models + audio + image + video. Missing only a few niche models vs East US 2.
Central US	`centralus`	Good coverage — GPT-5 series, GPT-4.1, o-series, plus some audio models.

Regions with Good Azure OpenAI Coverage

These regions support all mainstream models (GPT-5-mini/nano, GPT-4.1 series, o-series, GPT-4o, embeddings):

Region	Azure ID
Australia East	`australiaeast`
Canada East	`canadaeast`
Japan East	`japaneast`
Korea Central	`koreacentral`
UK South	`uksouth`
Switzerland North	`switzerlandnorth`
South India	`southindia`

Regions with Standard Azure OpenAI Coverage

Support GPT-4.1 series, o-series, GPT-4o, embeddings, but fewer GPT-5 variants:

Region	Azure ID
Brazil South	`brazilsouth`
East US	`eastus`
France Central	`francecentral`
Germany West Central	`germanywestcentral`
Italy North	`italynorth`
North Central US	`northcentralus`
Norway East	`norwayeast`
Poland Central	`polandcentral`
South Africa North	`southafricanorth`
South Central US	`southcentralus`
Spain Central	`spaincentral`
UAE North	`uaenorth`
West Europe	`westeurope`
West US	`westus`
West US 3	`westus3`

Non-Azure-OpenAI Models (DeepSeek, Meta, Cohere, xAI, Mistral, Moonshot, Black Forest Labs)

Most non-OpenAI models sold directly by Azure are available in all Global Standard regions listed above. Notable exceptions:

Mistral-Large-3: West US 3 only
model-router: East US 2 and Sweden Central only
codex-mini, o3-pro: East US 2 and Sweden Central only

Registration-Required Models

These models require application approval before use:

gpt-5, gpt-5-pro, gpt-5-codex, gpt-5.1, gpt-5.1-codex, gpt-5.1-codex-max, gpt-5.2, gpt-5.2-codex
computer-use-preview
gpt-image-1, gpt-image-1-mini, gpt-image-1.5
grok-code-fast-1, grok-4

Models not requiring registration: gpt-5-mini, gpt-5-nano, gpt-5-chat, gpt-4.1 series, o4-mini, o3, all DeepSeek, Meta, Cohere, Moonshot, and most xAI models.

Workflow

Ask the user what task/modality they need (chat, reasoning, coding, embeddings, image gen, etc.)
Recommend models from "Preferred Models by Task" section above
Confirm which region works for them (suggest eastus2 or swedencentral for maximum coverage)
Confirm capacity needs (default to 100 for chat models, 30 for embedding models)
Run the azd env set commands:

azd env set AZURE_LOCATION "<selected-region>"
azd env set AI_PROJECT_DEPLOYMENTS "[<deployment-entries>]"

Quick-Start Examples

Basic Chat App

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-5-mini', 'model': {'name': 'gpt-5-mini','format': 'OpenAI', 'version':'2025-08-07'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

RAG Application (Chat + Embeddings)

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-5-mini', 'model': {'name': 'gpt-5-mini','format': 'OpenAI', 'version':'2025-08-07'},'sku': {'name': 'GlobalStandard','capacity':100}}, {'name':'text-embedding-3-large', 'model': {'name': 'text-embedding-3-large','format': 'OpenAI', 'version':'1'},'sku': {'name': 'GlobalStandard','capacity':30}}]"

Coding Agent

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-4.1', 'model': {'name': 'gpt-4.1','format': 'OpenAI', 'version':'2025-04-14'},'sku': {'name': 'GlobalStandard','capacity':100}}, {'name':'o4-mini', 'model': {'name': 'o4-mini','format': 'OpenAI', 'version':'2025-04-16'},'sku': {'name': 'GlobalStandard','capacity':50}}]"

Multi-Provider Setup

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-4.1-mini', 'model': {'name': 'gpt-4.1-mini','format': 'OpenAI', 'version':'2025-04-14'},'sku': {'name': 'GlobalStandard','capacity':100}}, {'name':'DeepSeek-R1-0528', 'model': {'name': 'DeepSeek-R1-0528','format': 'DeepSeek', 'version':'1'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

Cost-Optimized with Model Router

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'model-router', 'model': {'name': 'model-router','format': 'Microsoft', 'version':'2025-11-18'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

Reference Links

name

models-selector

description

Foundry Models Selector

Configure AZURE_LOCATION and AI_PROJECT_DEPLOYMENTS azd environment variables by selecting the right models and regions for the user's needs.

Preferred Models by Task & Modality

Use this section to recommend models based on what the user needs. Always suggest the best-fit model first, then offer alternatives.

Chat & General Purpose (Text In / Text Out)

Priority	Model	Format	Version	Why
Best	`gpt-5-mini`	OpenAI	`2025-08-07`	Best balance of capability, speed, and cost. No registration required.
Strong	`gpt-4.1-mini`	OpenAI	`2025-04-14`	Fast, cheap, 1M token context, great for most tasks.
Strong	`gpt-5-nano`	OpenAI	`2025-08-07`	Cheapest GPT-5 series. No registration required.
Premium	`gpt-5`	OpenAI	`2025-08-07`	Top-tier reasoning. Registration required.
Premium	`gpt-5.1`	OpenAI	`2025-11-13`	Latest reasoning model. Registration required.
Alt	`DeepSeek-V3.2`	DeepSeek	`1`	Strong open-weight alternative. All regions.
Alt	`Llama-4-Maverick-17B-128E-Instruct-FP8`	Meta	`1`	Open-weight, 1M context, multimodal. All regions.

Advanced Reasoning & Complex Problem Solving

Priority	Model	Format	Version	Why
Best	`o4-mini`	OpenAI	`2025-04-16`	Best reasoning-to-cost ratio.
Strong	`o3`	OpenAI	`2025-04-16`	Stronger reasoning, higher cost.
Strong	`DeepSeek-R1-0528`	DeepSeek	`1`	Open-weight reasoning model. All regions.
Premium	`gpt-5`	OpenAI	`2025-08-07`	Full GPT-5 reasoning. Registration required.
Premium	`gpt-5.1`	OpenAI	`2025-11-13`	Latest reasoning. Registration required.
Alt	`grok-4-fast-reasoning`	xAI	`1`	External reasoning alternative. All regions.
Alt	`Kimi-K2-Thinking`	Moonshot	`1`	Open-weight, 262K context, tool calling. All regions.

Coding & Development

Priority	Model	Format	Version	Why
Best	`gpt-4.1`	OpenAI	`2025-04-14`	1M token context, great for large codebases.
Strong	`gpt-4.1-mini`	OpenAI	`2025-04-14`	Cost-efficient coding with large context.
Strong	`codex-mini`	OpenAI	`2025-05-16`	Fine-tuned o4-mini for code. East US 2 & Sweden Central only.
Alt	`grok-code-fast-1`	xAI	`1`	External coding specialist. Registration required.
Alt	`DeepSeek-V3.2`	DeepSeek	`1`	Strong coding performance, all regions.

Multimodal (Text + Image Input)

Priority	Model	Format	Version	Why
Best	`gpt-5-mini`	OpenAI	`2025-08-07`	Vision + text, cost-efficient.
Strong	`gpt-4.1`	OpenAI	`2025-04-14`	1M context with vision.
Strong	`o4-mini`	OpenAI	`2025-04-16`	Vision + reasoning.
Alt	`Llama-4-Maverick-17B-128E-Instruct-FP8`	Meta	`1`	Open-weight multimodal, 1M context.
Alt	`Kimi-K2.5`	Moonshot	`1`	Multimodal reasoning. All regions.

Embeddings

Priority	Model	Format	Version	Why
Best	`text-embedding-3-large`	OpenAI	`1`	Best quality embeddings, 3072 dimensions.
Smaller	`text-embedding-3-small`	OpenAI	`1`	Cheaper, 1536 dimensions.
Legacy	`text-embedding-ada-002`	OpenAI	`2`	Older model, still widely used.

Image Generation

Priority	Model	Format	Version	Why
Best	`gpt-image-1`	OpenAI	`1`	Best quality. Registration required. Limited regions.
Alt	`dall-e-3`	OpenAI	`3.0`	Generally available. East US, Australia East, Sweden Central.
Alt	`FLUX.2-pro`	BlackForestLabs	`1`	Multi-reference support. All regions.

Document Processing

Priority	Model	Format	Version	Why
Best	`mistral-document-ai-2505`	MistralAI	`1`	PDF/image to text specialist. All regions.

Cost-Optimized Routing

Priority	Model	Format	Version	Why
Best	`model-router`	Microsoft	`2025-11-18`	Automatically routes to optimal model. Up to 60% savings. East US 2 & Sweden Central only.

How to Set Environment Variables

Set Azure Location

azd env set AZURE_LOCATION "<region>"

Example:

azd env set AZURE_LOCATION "eastus2"

Set Model Deployments

azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'<deployment-name>', 'model': {'name': '<model-name>','format': '<format>', 'version':'<version>'},'sku': {'name': '<sku-name>','capacity':<capacity>}}]"

Single model example:

azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-5-mini', 'model': {'name': 'gpt-5-mini','format': 'OpenAI', 'version':'2025-08-07'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

Multiple models example:

azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-4.1-mini', 'model': {'name': 'gpt-4.1-mini','format': 'OpenAI', 'version':'2025-04-14'},'sku': {'name': 'GlobalStandard','capacity':100}}, {'name':'DeepSeek-V3.2', 'model': {'name': 'DeepSeek-V3.2','format': 'DeepSeek', 'version':'1'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

Deployment Entry Format

Each deployment entry in the JSON array:

{
  'name': '<deployment-name>',        # Deployment name (typically same as model name)
  'model': {
    'name': '<model-name>',           # Exact model name from tables below
    'format': '<format>',             # Provider format (see Model Catalog)
    'version': '<version>'            # Model version string
  },
  'sku': {
    'name': '<sku-name>',             # Usually 'GlobalStandard'
    'capacity': <number>              # TPM in thousands (e.g., 100 = 100K TPM)
  }
}

Model Catalog

Azure OpenAI Models (format: `OpenAI`)

Model Name	Version	Modality	Context	SKU
`gpt-5.2-codex`	`2026-01-14`	Text+Image→Text (Reasoning, Codex)	400K	GlobalStandard
`gpt-5.2`	`2025-12-11`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5.2-chat`	`2025-12-11`	Text→Text (Preview)	128K	GlobalStandard
`gpt-5.1`	`2025-11-13`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5.1-chat`	`2025-11-13`	Text→Text (Preview, Reasoning)	128K	GlobalStandard
`gpt-5.1-codex`	`2025-11-13`	Text+Image→Text (Codex)	400K	GlobalStandard
`gpt-5.1-codex-mini`	`2025-11-13`	Text+Image→Text (Codex)	400K	GlobalStandard
`gpt-5.1-codex-max`	`2025-12-04`	Text+Image→Text (Codex)	400K	GlobalStandard
`gpt-5`	`2025-08-07`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5-mini`	`2025-08-07`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5-nano`	`2025-08-07`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-5-chat`	`2025-10-03`	Text+Image→Text (Preview)	128K	GlobalStandard
`gpt-5-codex`	`2025-09-11`	Text+Image→Text (Codex)	400K	GlobalStandard
`gpt-5-pro`	`2025-10-06`	Text+Image→Text (Reasoning)	400K	GlobalStandard
`gpt-4.1`	`2025-04-14`	Text+Image→Text	1M (128K std)	GlobalStandard
`gpt-4.1-mini`	`2025-04-14`	Text+Image→Text	1M (128K std)	GlobalStandard
`gpt-4.1-nano`	`2025-04-14`	Text+Image→Text	1M (128K std)	GlobalStandard
`o4-mini`	`2025-04-16`	Text+Image→Text (Reasoning)	200K	GlobalStandard
`o3`	`2025-04-16`	Text+Image→Text (Reasoning)	200K	GlobalStandard
`o3-mini`	`2025-01-31`	Text→Text (Reasoning)	200K	GlobalStandard
`o3-pro`	`2025-06-10`	Text+Image→Text (Reasoning)	200K	GlobalStandard
`o1`	`2024-12-17`	Text+Image→Text (Reasoning)	200K	GlobalStandard
`codex-mini`	`2025-05-16`	Text+Image→Text (o4-mini fine-tuned)	200K	GlobalStandard
`gpt-4o`	`2024-11-20`	Text+Image→Text	128K	GlobalStandard
`gpt-4o-mini`	`2024-07-18`	Text+Image→Text	128K	GlobalStandard
`computer-use-preview`	`2025-03-11`	Text+Image→Text (Computer Use)	8K	GlobalStandard
`text-embedding-3-large`	`1`	Text→Embedding (3072 dim)	8K	GlobalStandard
`text-embedding-3-small`	`1`	Text→Embedding (1536 dim)	8K	GlobalStandard
`text-embedding-ada-002`	`2`	Text→Embedding (1536 dim)	8K	GlobalStandard
`dall-e-3`	`3.0`	Text→Image	4K	Standard
`gpt-image-1`	`1`	Text→Image	4K	GlobalStandard
`sora-2`	`1`	Text→Video	-	GlobalStandard

Azure OpenAI OSS Models (format: `OpenAI-OSS`)

Model Name	Version	Modality	Context	SKU
`gpt-oss-120b`	`1`	Text→Text (Reasoning, Preview)	131K	GlobalStandard

DeepSeek Models (format: `DeepSeek`)

Model Name	Version	Modality	Context	SKU
`DeepSeek-V3.2-Speciale`	`1`	Text→Text (Reasoning)	128K	GlobalStandard
`DeepSeek-V3.2`	`1`	Text→Text (Reasoning)	128K	GlobalStandard
`DeepSeek-V3.1`	`1`	Text→Text (Tool calling)	131K	GlobalStandard
`DeepSeek-V3-0324`	`1`	Text→Text (Tool calling)	131K	GlobalStandard
`DeepSeek-R1-0528`	`1`	Text→Text (Reasoning)	163K	GlobalStandard
`DeepSeek-R1`	`1`	Text→Text (Reasoning)	163K	GlobalStandard

Meta Models (format: `Meta`)

Model Name	Version	Modality	Context	SKU
`Llama-4-Maverick-17B-128E-Instruct-FP8`	`1`	Text+Image→Text	1M	GlobalStandard
`Llama-3.3-70B-Instruct`	`1`	Text→Text	128K	GlobalStandard

Microsoft Models (format: `Microsoft`)

Model Name	Version	Modality	Context	SKU
`model-router`	`2025-11-18`	Text+Image→Text (Auto-routing)	200K	GlobalStandard
`MAI-DS-R1`	`1`	Text→Text (Reasoning)	163K	GlobalStandard

Mistral Models (format: `MistralAI`)

Model Name	Version	Modality	Context	SKU
`Mistral-Large-3`	`1`	Text+Image→Text (Tool calling)	-	GlobalStandard
`mistral-document-ai-2505`	`1`	Image/PDF→Text	-	GlobalStandard

Cohere Models (format: `Cohere`)

Model Name	Version	Modality	Context	SKU
`Cohere-command-a`	`1`	Text→Text (Tool calling)	131K	GlobalStandard
`Cohere-rerank-v4.0-pro`	`1`	Text→Text (Rerank)	-	GlobalStandard
`Cohere-rerank-v4.0-fast`	`1`	Text→Text (Rerank)	-	GlobalStandard
`embed-v-4-0`	`1`	Text+Image→Embedding	512	GlobalStandard

Moonshot AI Models (format: `Moonshot`)

Model Name	Version	Modality	Context	SKU
`Kimi-K2.5`	`1`	Text+Image→Text (Reasoning, Tool calling)	262K	GlobalStandard
`Kimi-K2-Thinking`	`1`	Text→Text (Reasoning, Tool calling)	262K	GlobalStandard

xAI Models (format: `xAI`)

Model Name	Version	Modality	Context	SKU
`grok-4`	`1`	Text→Text (Tool calling)	262K	GlobalStandard
`grok-4-fast-reasoning`	`1`	Text+Image→Text (Tool calling)	128K	GlobalStandard
`grok-4-fast-non-reasoning`	`1`	Text+Image→Text (Tool calling)	128K	GlobalStandard
`grok-code-fast-1`	`1`	Text→Text (Tool calling)	256K	GlobalStandard
`grok-3`	`1`	Text→Text (Tool calling)	131K	GlobalStandard
`grok-3-mini`	`1`	Text→Text (Tool calling)	131K	GlobalStandard

Black Forest Labs Models (format: `BlackForestLabs`)

Model Name	Version	Modality	Context	SKU
`FLUX.2-pro`	`1`	Text+Image→Image	-	GlobalStandard
`FLUX.1-Kontext-pro`	`1`	Text+Image→Image	-	GlobalStandard
`FLUX-1.1-pro`	`1`	Text→Image	-	GlobalStandard

Region Availability

Regions with Maximum Azure OpenAI Model Coverage

These regions have the broadest model availability. Prefer these when needing the widest selection:

Region	Azure ID	Coverage
East US 2	`eastus2`	All models — Best overall coverage. Has every Azure OpenAI model + audio + image + video + codex.
Sweden Central	`swedencentral`	Near-complete — All core models + audio + image + video. Missing only a few niche models vs East US 2.
Central US	`centralus`	Good coverage — GPT-5 series, GPT-4.1, o-series, plus some audio models.

Regions with Good Azure OpenAI Coverage

These regions support all mainstream models (GPT-5-mini/nano, GPT-4.1 series, o-series, GPT-4o, embeddings):

Region	Azure ID
Australia East	`australiaeast`
Canada East	`canadaeast`
Japan East	`japaneast`
Korea Central	`koreacentral`
UK South	`uksouth`
Switzerland North	`switzerlandnorth`
South India	`southindia`

Regions with Standard Azure OpenAI Coverage

Support GPT-4.1 series, o-series, GPT-4o, embeddings, but fewer GPT-5 variants:

Region	Azure ID
Brazil South	`brazilsouth`
East US	`eastus`
France Central	`francecentral`
Germany West Central	`germanywestcentral`
Italy North	`italynorth`
North Central US	`northcentralus`
Norway East	`norwayeast`
Poland Central	`polandcentral`
South Africa North	`southafricanorth`
South Central US	`southcentralus`
Spain Central	`spaincentral`
UAE North	`uaenorth`
West Europe	`westeurope`
West US	`westus`
West US 3	`westus3`

Non-Azure-OpenAI Models (DeepSeek, Meta, Cohere, xAI, Mistral, Moonshot, Black Forest Labs)

Most non-OpenAI models sold directly by Azure are available in all Global Standard regions listed above. Notable exceptions:

Mistral-Large-3: West US 3 only
model-router: East US 2 and Sweden Central only
codex-mini, o3-pro: East US 2 and Sweden Central only

Registration-Required Models

These models require application approval before use:

gpt-5, gpt-5-pro, gpt-5-codex, gpt-5.1, gpt-5.1-codex, gpt-5.1-codex-max, gpt-5.2, gpt-5.2-codex
computer-use-preview
gpt-image-1, gpt-image-1-mini, gpt-image-1.5
grok-code-fast-1, grok-4

Models not requiring registration: gpt-5-mini, gpt-5-nano, gpt-5-chat, gpt-4.1 series, o4-mini, o3, all DeepSeek, Meta, Cohere, Moonshot, and most xAI models.

Workflow

Ask the user what task/modality they need (chat, reasoning, coding, embeddings, image gen, etc.)
Recommend models from "Preferred Models by Task" section above
Confirm which region works for them (suggest eastus2 or swedencentral for maximum coverage)
Confirm capacity needs (default to 100 for chat models, 30 for embedding models)
Run the azd env set commands:

azd env set AZURE_LOCATION "<selected-region>"
azd env set AI_PROJECT_DEPLOYMENTS "[<deployment-entries>]"

Quick-Start Examples

Basic Chat App

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-5-mini', 'model': {'name': 'gpt-5-mini','format': 'OpenAI', 'version':'2025-08-07'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

RAG Application (Chat + Embeddings)

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-5-mini', 'model': {'name': 'gpt-5-mini','format': 'OpenAI', 'version':'2025-08-07'},'sku': {'name': 'GlobalStandard','capacity':100}}, {'name':'text-embedding-3-large', 'model': {'name': 'text-embedding-3-large','format': 'OpenAI', 'version':'1'},'sku': {'name': 'GlobalStandard','capacity':30}}]"

Coding Agent

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-4.1', 'model': {'name': 'gpt-4.1','format': 'OpenAI', 'version':'2025-04-14'},'sku': {'name': 'GlobalStandard','capacity':100}}, {'name':'o4-mini', 'model': {'name': 'o4-mini','format': 'OpenAI', 'version':'2025-04-16'},'sku': {'name': 'GlobalStandard','capacity':50}}]"

Multi-Provider Setup

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'gpt-4.1-mini', 'model': {'name': 'gpt-4.1-mini','format': 'OpenAI', 'version':'2025-04-14'},'sku': {'name': 'GlobalStandard','capacity':100}}, {'name':'DeepSeek-R1-0528', 'model': {'name': 'DeepSeek-R1-0528','format': 'DeepSeek', 'version':'1'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

Cost-Optimized with Model Router

azd env set AZURE_LOCATION "eastus2"
azd env set AI_PROJECT_DEPLOYMENTS "[{'name':'model-router', 'model': {'name': 'model-router','format': 'Microsoft', 'version':'2025-11-18'},'sku': {'name': 'GlobalStandard','capacity':100}}]"

models-selector

Foundry Models Selector

Preferred Models by Task & Modality

Chat & General Purpose (Text In / Text Out)

Advanced Reasoning & Complex Problem Solving

Coding & Development

Multimodal (Text + Image Input)

Embeddings

Image Generation

Document Processing

Cost-Optimized Routing

How to Set Environment Variables

Set Azure Location

Set Model Deployments

Deployment Entry Format

Model Catalog

Azure OpenAI Models (format: OpenAI)

Azure OpenAI OSS Models (format: OpenAI-OSS)

DeepSeek Models (format: DeepSeek)

Meta Models (format: Meta)

Microsoft Models (format: Microsoft)

Mistral Models (format: MistralAI)

Cohere Models (format: Cohere)

Moonshot AI Models (format: Moonshot)

xAI Models (format: xAI)

Black Forest Labs Models (format: BlackForestLabs)

Region Availability

Regions with Maximum Azure OpenAI Model Coverage

Regions with Good Azure OpenAI Coverage

Regions with Standard Azure OpenAI Coverage

Non-Azure-OpenAI Models (DeepSeek, Meta, Cohere, xAI, Mistral, Moonshot, Black Forest Labs)

Registration-Required Models

Workflow

Quick-Start Examples

Basic Chat App

RAG Application (Chat + Embeddings)

Coding Agent

Multi-Provider Setup

Cost-Optimized with Model Router

Reference Links

Foundry Models Selector

Preferred Models by Task & Modality

Chat & General Purpose (Text In / Text Out)

Advanced Reasoning & Complex Problem Solving

Coding & Development

Multimodal (Text + Image Input)

Embeddings

Image Generation

Document Processing

Cost-Optimized Routing

How to Set Environment Variables

Set Azure Location

Set Model Deployments

Deployment Entry Format

Model Catalog

Azure OpenAI Models (format: OpenAI)

Azure OpenAI OSS Models (format: OpenAI-OSS)

DeepSeek Models (format: DeepSeek)

Meta Models (format: Meta)

Microsoft Models (format: Microsoft)

Mistral Models (format: MistralAI)

Cohere Models (format: Cohere)

Moonshot AI Models (format: Moonshot)

xAI Models (format: xAI)

Black Forest Labs Models (format: BlackForestLabs)

Region Availability

Regions with Maximum Azure OpenAI Model Coverage

Regions with Good Azure OpenAI Coverage

Regions with Standard Azure OpenAI Coverage

Non-Azure-OpenAI Models (DeepSeek, Meta, Cohere, xAI, Mistral, Moonshot, Black Forest Labs)

Registration-Required Models

Workflow

Quick-Start Examples

Basic Chat App

RAG Application (Chat + Embeddings)

Coding Agent

Multi-Provider Setup

Cost-Optimized with Model Router

Reference Links

Azure OpenAI Models (format: `OpenAI`)

Azure OpenAI OSS Models (format: `OpenAI-OSS`)

DeepSeek Models (format: `DeepSeek`)

Meta Models (format: `Meta`)

Microsoft Models (format: `Microsoft`)

Mistral Models (format: `MistralAI`)

Cohere Models (format: `Cohere`)

Moonshot AI Models (format: `Moonshot`)

xAI Models (format: `xAI`)

Black Forest Labs Models (format: `BlackForestLabs`)

Azure OpenAI Models (format: `OpenAI`)

Azure OpenAI OSS Models (format: `OpenAI-OSS`)

DeepSeek Models (format: `DeepSeek`)

Meta Models (format: `Meta`)

Microsoft Models (format: `Microsoft`)

Mistral Models (format: `MistralAI`)

Cohere Models (format: `Cohere`)

Moonshot AI Models (format: `Moonshot`)

xAI Models (format: `xAI`)

Black Forest Labs Models (format: `BlackForestLabs`)