| name | google-cloud-waf-operational-excellence |
| description | Generates operations-focused guidance for Google Cloud workloads based on the design principles and recommendations in the Operational Excellence pillar of the Google Cloud Well-Architected Framework (WAF). Use this skill to evaluate a workload, identify operational requirements, and provide actionable recommendations for deployment, monitoring, and incident management. |
Google Cloud Well-Architected Framework skill for the Operational Excellence pillar
Overview
The operational excellence pillar in the Google Cloud Well-Architected Framework
provides recommendations to operate workloads efficiently on Google Cloud.
Operational excellence in the cloud involves designing, implementing, and
managing cloud solutions that provide value, performance, security, and
reliability. The recommendations in this pillar help you to continuously improve
and adapt workloads to meet the dynamic and ever-evolving needs in the cloud.
Core principles
The recommendations in the operational excellence pillar of the Well-Architected
Framework are aligned with the following core principles:
Relevant Google Cloud products
The following are examples of Google Cloud products and features that are
relevant to operational excellence:
Workload assessment questions
Ask appropriate questions to understand operations-related requirements and
constraints of the workload and the user's organization. Choose questions from
the following list:
-
Operational readiness and performance
- How do you define and measure operational readiness for your cloud
workloads and what specific criteria or metrics do you use?
- Describe your process for defining, tracking, and achieving SLOs for
your critical workloads.
-
Incident and problem management
- Describe your incident management process, including roles,
responsibilities, and communication channels.
- How do you conduct post-incident reviews (PIRs) to identify root causes
and implement preventive measures?
-
Resource management and optimization
- How do you ensure that your cloud resources are right-sized for your
workloads, and what tools or techniques do you use?
-
Change automation
- Describe your change management process, including approval workflows,
testing procedures, and deployment strategies.
- How do you automate deployments, ensure their consistency and manage
configuration?
-
Continuous improvement
- How do you ensure that your cloud operations are continuously adapting
to meet evolving business needs and technological advancements?
Validation checklist
Use the following checklist to evaluate the architecture's alignment with
operational excellence recommendations:
-
Operational readiness
-
Incident management
-
Change automation
-
Resource optimization
-
Culture of improvement