一键在 Manus 中运行任何 Skill

$pwd:

documentdb-high-availability

Name: Documentdb High Availability
Author: Azure

// High availability, business-continuity, and disaster-recovery best practices for Azure DocumentDB — enabling in-region HA with availability zones (99.99% SLA), adding active-passive cross-region replica clusters (99.995% SLA), and understanding automatic backup retention. Use when designing production topology, planning failover, provisioning DR, picking regions, or reviewing cluster architecture.

在 Manus 中运行

$ git log --oneline --stat

stars:5

forks:4

updated:2026年5月12日 20:36

文件资源管理器

4 个文件

SKILL.md

readonly

name	documentdb-high-availability
description	High availability, business-continuity, and disaster-recovery best practices for Azure DocumentDB — enabling in-region HA with availability zones (99.99% SLA), adding active-passive cross-region replica clusters (99.995% SLA), and understanding automatic backup retention. Use when designing production topology, planning failover, provisioning DR, picking regions, or reviewing cluster architecture.
license	MIT

High Availability, Replication & DR — Azure DocumentDB

Azure DocumentDB's resiliency model has three layers. Pick the right combination for the workload — production-critical workloads should use all three.

Layer	What it protects against	SLA contribution	Automatic?
In-region HA (standby shard per primary, synchronous replication)	Node / zone failures within a region	99.99%	✅ Failover is automatic; connection string is unchanged
Cross-region replica (active-passive, asynchronous)	Regional outage; provides read scale-out	+ 0.005% → 99.995% combined	❌ Promotion is customer-triggered (shared-responsibility DR); HA must be re-enabled on the promoted cluster
Automatic backups (35 d active / 7 d deleted clusters)	Accidental deletion or corruption	—	✅ Continuous, no perf impact

Replication model at a glance

Primary ↔ standby shard (in-region): synchronous — every write commits to both before the client gets an ack, so failover is lossless and reads on the standby (after promotion) are strongly consistent. With HA on, each shard has 6 replicas in total: 3 LRS replicas under the primary shard + 3 LRS replicas under the standby shard. In AZ-enabled regions the primary and standby sit in different availability zones.
Primary cluster ↔ cross-region replica: asynchronous — design for eventual consistency on the replica. Some writes acknowledged on the primary may not yet be on the replica, so regional promotion has a non-zero RPO (recent writes can be lost). Replication lag scales with the primary's write intensity and the load on both clusters.
Without HA: each shard uses locally-redundant storage (LRS) with 3 synchronous Azure Storage replicas. Single-replica failures are auto-healed by Azure Storage (CRC checks + network checksums protect against silent corruption), but a zone or region failure can cause downtime and possible data loss. HA is also a prerequisite for availability-zone placement.

Applications connect to a cluster through a single connection string and endpoint regardless of shard count. The multi-shard topology is fully abstracted — a 16-shard cluster looks like one MongoDB endpoint to the driver.

Feature comparison

How HA and cross-region replicas protect different failure modes:

Failure scenario	Feature	No data loss (RPO = 0)	Survives region-wide outage	Automatic failover	Connection string preserved
Physical shard / zone failure	In-region HA	✅ (synchronous)	❌	✅	✅
Regional outage	Cross-region replica	❌ (asynchronous; RPO > 0)	✅	❌ (customer-triggered)	✅ ¹

¹ Only when the application uses the Global read-write connection string (<cluster>.global.mongocluster.cosmos.azure.com). The cluster-specific / "self" connection string becomes read-only after promotion.

Best-practice decision matrix

Scenario	Recommendation
Production cluster	Enable HA
Need 99.99% SLA	Enable HA
Need 99.995% SLA	Enable HA and create a cross-region replica
Automatic failover from node/zone failure	Enable HA
Cross-region disaster recovery	Create a replica cluster
Read scale-out within a single region (analytics / reporting offload)	Create a same-region replica (no DR benefit; you can have only one replica per primary, so this trades cross-region DR for in-region read offload)
Read scale-out across regions	Create a replica cluster
Availability-zone placement required	Enable HA (HA is required for AZ support)
Non-production / dev-test cluster	Disable HA to reduce cost
Recover from accidental delete/modify	Automatic backups (35-day retention for active clusters)

Rules

ha-enable-for-production — Enable HA on all production clusters for the 99.99% SLA, automatic failover, zone redundancy, and zero-data-loss synchronous replication.
ha-cross-region-replica — Add an active-passive replica (cross-region for DR + read scale-out, same-region for pure read scale-out); HA + cross-region replica = 99.995% SLA. Includes the post-promotion runbook (re-enable HA, update connection strings, restore the replica).
ha-backup-retention — Automatic backups are taken continuously and retained for 35 days on active clusters and 7 days on deleted clusters. Use them to recover from accidental deletes or modifications.

References

Best practices for HA and cross-region replication in Azure DocumentDB
Reliability in Azure DocumentDB
Availability and disaster recovery in Azure DocumentDB — behind the scenes — cluster anatomy, 6-replica HA layout, replication-lag drivers

related-skills.json

同仓库

documentdb-indexing.md

from "Azure/documentdb-agent-kit"

Index-type selection and shape guidance for Azure DocumentDB — when to use single-field, compound (ESR), multikey, wildcard, hashed, 2dsphere, TTL, and vector indexes; query-pattern → index-shape cookbook; per-collection index budget; DocumentDB-specific preference for `textSearch` over community `$text`. Use when designing or reviewing indexes, choosing an index type for a query pattern, or deciding whether an additional index is worth the write cost.

2026-05-185

documentdb-mcp-setup.md

from "Azure/documentdb-agent-kit"

Guide users through configuring the DocumentDB MCP server for Azure DocumentDB. Use this skill when a user has the DocumentDB MCP server installed but hasn't configured the required environment variables, or when they ask about connecting to Azure DocumentDB and don't have the credentials set up.

2026-05-185

documentdb-natural-language-querying.md

from "Azure/documentdb-agent-kit"

Generate read-only DocumentDB/MongoDB queries (find) or aggregation pipelines using natural language, with collection schema context and sample documents. Use this skill whenever the user asks to write, create, or generate queries for Azure DocumentDB, wants to filter/query/aggregate data, asks "how do I query...", needs help with query syntax, or discusses finding/filtering/grouping documents. Also use for translating SQL-like requests to MongoDB syntax. Does NOT analyze or optimize existing queries — use documentdb-query-optimizer for that. Requires DocumentDB MCP server.

2026-05-185

documentdb-sharding.md

from "Azure/documentdb-agent-kit"

Horizontal sharding (partitioning) for Azure DocumentDB collections — when to shard vs stay single-shard, how to pick a shard key for read-heavy vs write-heavy workloads, the logical/physical shard mental model, scaling out vs scaling up, hot-partition diagnosis, and the `sh.shardCollection` / `sh.reshardCollection` commands. Use when deciding whether to shard a collection, choosing or changing a shard key, sizing a cluster, or troubleshooting uneven storage / throughput across physical shards.

2026-05-125

documentdb-storage.md

from "Azure/documentdb-agent-kit"

Storage configuration guidance for Azure DocumentDB — when and how to use Premium SSD v2 high-performance storage, IOPS/bandwidth caps that are gated by compute tier (not disk size), Premium SSD v2 limitations (no CMK, migration paths, disk-hydration sequencing), and storage capacity change limits. Use when picking a storage type at cluster creation, sizing for I/O-intensive workloads, migrating from Premium SSD to Premium SSD v2, or sequencing compute/storage/HA changes on a Premium SSD v2 cluster.

2026-05-125

documentdb-azure-deployment.md

from "Azure/documentdb-agent-kit"

Deploy an Azure DocumentDB cluster (`Microsoft.DocumentDB/mongoClusters`) end-to-end — Bicep (primary), Azure CLI one-shot, Terraform, or portal. Covers resource-group creation, cluster parameters (tier, storage, server version, sharding, HA), firewall rule configuration, retrieving the connection string, and teardown. Use when the user asks to provision, create, deploy, or spin up an Azure DocumentDB cluster, or wants infrastructure-as-code for one.

2026-05-085

package.json

"author": "Azure"

"repository": "Azure/documentdb-agent-kit"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

数据库架构师计算机与数学类职业15-1243L4

name	documentdb-high-availability
description	High availability, business-continuity, and disaster-recovery best practices for Azure DocumentDB — enabling in-region HA with availability zones (99.99% SLA), adding active-passive cross-region replica clusters (99.995% SLA), and understanding automatic backup retention. Use when designing production topology, planning failover, provisioning DR, picking regions, or reviewing cluster architecture.
license	MIT

High Availability, Replication & DR — Azure DocumentDB

Azure DocumentDB's resiliency model has three layers. Pick the right combination for the workload — production-critical workloads should use all three.

Layer	What it protects against	SLA contribution	Automatic?
In-region HA (standby shard per primary, synchronous replication)	Node / zone failures within a region	99.99%	✅ Failover is automatic; connection string is unchanged
Cross-region replica (active-passive, asynchronous)	Regional outage; provides read scale-out	+ 0.005% → 99.995% combined	❌ Promotion is customer-triggered (shared-responsibility DR); HA must be re-enabled on the promoted cluster
Automatic backups (35 d active / 7 d deleted clusters)	Accidental deletion or corruption	—	✅ Continuous, no perf impact

Replication model at a glance

Primary ↔ standby shard (in-region): synchronous — every write commits to both before the client gets an ack, so failover is lossless and reads on the standby (after promotion) are strongly consistent. With HA on, each shard has 6 replicas in total: 3 LRS replicas under the primary shard + 3 LRS replicas under the standby shard. In AZ-enabled regions the primary and standby sit in different availability zones.
Primary cluster ↔ cross-region replica: asynchronous — design for eventual consistency on the replica. Some writes acknowledged on the primary may not yet be on the replica, so regional promotion has a non-zero RPO (recent writes can be lost). Replication lag scales with the primary's write intensity and the load on both clusters.
Without HA: each shard uses locally-redundant storage (LRS) with 3 synchronous Azure Storage replicas. Single-replica failures are auto-healed by Azure Storage (CRC checks + network checksums protect against silent corruption), but a zone or region failure can cause downtime and possible data loss. HA is also a prerequisite for availability-zone placement.

Applications connect to a cluster through a single connection string and endpoint regardless of shard count. The multi-shard topology is fully abstracted — a 16-shard cluster looks like one MongoDB endpoint to the driver.

Feature comparison

How HA and cross-region replicas protect different failure modes:

Failure scenario	Feature	No data loss (RPO = 0)	Survives region-wide outage	Automatic failover	Connection string preserved
Physical shard / zone failure	In-region HA	✅ (synchronous)	❌	✅	✅
Regional outage	Cross-region replica	❌ (asynchronous; RPO > 0)	✅	❌ (customer-triggered)	✅ ¹

Best-practice decision matrix

Scenario	Recommendation
Production cluster	Enable HA
Need 99.99% SLA	Enable HA
Need 99.995% SLA	Enable HA and create a cross-region replica
Automatic failover from node/zone failure	Enable HA
Cross-region disaster recovery	Create a replica cluster
Read scale-out within a single region (analytics / reporting offload)	Create a same-region replica (no DR benefit; you can have only one replica per primary, so this trades cross-region DR for in-region read offload)
Read scale-out across regions	Create a replica cluster
Availability-zone placement required	Enable HA (HA is required for AZ support)
Non-production / dev-test cluster	Disable HA to reduce cost
Recover from accidental delete/modify	Automatic backups (35-day retention for active clusters)

Rules

ha-enable-for-production — Enable HA on all production clusters for the 99.99% SLA, automatic failover, zone redundancy, and zero-data-loss synchronous replication.
ha-cross-region-replica — Add an active-passive replica (cross-region for DR + read scale-out, same-region for pure read scale-out); HA + cross-region replica = 99.995% SLA. Includes the post-promotion runbook (re-enable HA, update connection strings, restore the replica).
ha-backup-retention — Automatic backups are taken continuously and retained for 35 days on active clusters and 7 days on deleted clusters. Use them to recover from accidental deletes or modifications.

References

Best practices for HA and cross-region replication in Azure DocumentDB
Reliability in Azure DocumentDB
Availability and disaster recovery in Azure DocumentDB — behind the scenes — cluster anatomy, 6-replica HA layout, replication-lag drivers

documentdb-high-availability

High Availability, Replication & DR — Azure DocumentDB

Replication model at a glance

Feature comparison

Best-practice decision matrix

Rules

References

同仓库更多 Skills

同仓库更多 Skills

High Availability, Replication & DR — Azure DocumentDB

Replication model at a glance

Feature comparison

Best-practice decision matrix

Rules

References