AWS Well-Architected Framework

Tanner Jones
Nerd For Tech
Published in
8 min readMay 2, 2023

--

Image Source

Hello, I hope that this article finds you well in your learning journey. In this short article I will discuss the Well-Architected Framework in the cloud. It is important to to understand the importance of a solid design framework in the cloud. The cloud is allows you to be agile and and focus on providing services to meet a business need rather than focusing on the underlying infrastructure. This is one of the major contributors to the wide adoption the cloud. “Using the Framework helps you learn architectural best practices for designing and operating secure, reliable, efficient, cost-effective, and sustainable workloads in the AWS Cloud.” (Well-Architected Framework, 2023)

The 6 pillars outlines architectural best practices for designing workloads that run in the cloud. If you are planning on working in the cloud or studying for the AWS certificate this short explanation will come in handy! AWS provides a great learning resources such as whitepapers. I want to break these 6 pillars so that they are easy to understand.

6 Pillars of Well-Architected Framework

  1. Operational Excellence
  2. Security
  3. Reliability
  4. Performance Efficiency
  5. Cost Optimization
  6. Sustainability
Image Source

Operational Excellence

The goal of operational excellence is to get new features and bug fixes into customers’ hands quickly and reliably. The software within the workloads need to be built correctly while consistently delivering a great customer experience. The pillar focuses on operating correctly and reliably. There are design principles and best practices to achieve operational excellence.

Design Principles:

These design principles are pretty strait forward so I won’t go into more detail. You can find out more here.

  • Perform operations as code
  • Make frequent, small, reversible changes
  • Refine operations procedures frequently
  • Anticipate failure
  • Learn from all operational failures

Best Practices:

  • Organization: Understand the organization’s priorities, organizational structure, and how the organization supports the team members.
  • Prepare: Have to understand the workloads and their expected behaviors. You will then be able to design them to provide insight to their status and build the procedures to support them.
  • Operate: By understanding the health of the workload and operations, you can identify when organizational and business outcomes may become at risk, or are at risk, and respond appropriately.
  • Evolve: Implement frequent small incremental changes based on the lessons learned from your operations activities and evaluate the improvement.

To learn more about the the pillar look at the whitepaper.

Whitepaper: Operational excellence

Security

Describes how to take advantage of cloud technologies to protect data, systems, and assets to improve the security posture. This topic is dense and requires in-depth knowledge into the internal works of AWS. I will highlight a few topics that are important to understand. It is worth noting that there are entire careers that revolve around securing the workloads. This is my particular field of interest and plan on writing more articles on this topic.

Cloud Responsibility Model

This outlines the responsibilities of the cloud provider, in case it is AWS and responsibilities of the customer. It is worth noting that there are grey areas within this model and it is difficult for customers. Here is a use-case looking at a data breach that demonstrates the cloud responsibility model.

AWS Responsibility— Responsible for protecting the infrastructure that runs all of the services offered in the AWS Cloud. This infrastructure is composed of the hardware, software, networking, and facilities that run AWS Cloud services.

Customer Responsibility — Customer responsibility will be determined by the AWS Cloud services that a customer selects. This determines the amount of configuration work the customer must perform as part of their security responsibilities.

AWS Account Management

Organize workloads in separate accounts and group accounts based on function, compliance requirements.

Operating your workloads securely

Operating workloads securely covers the whole lifecycle of a workload from design, to build, to run, and to ongoing improvement.

Identity and access management

Must grant your users and applications access to resources in your AWS accounts.

Detection

Detection consists of two parts: detection of unexpected or unwanted configuration changes, and the detection of unexpected behavior.

There are other topics that fall under this pillar such as data protection, application security, infrastructure protection, and incident response. There is enough information to study all of these topics independently.

To learn more about the the pillar look at the whitepaper.

Whitepaper: Security

Reliability

Encompasses the ability of a workload to perform its intended function correctly and consistently when it’s expected to.

Design Principles

Automatically recover from failure: By monitoring a workload for key performance indicators (KPIs), you can start automation when a threshold is breached.

Test recovery procedures: In an on-premises environment, testing is often conducted to prove that the workload works in a particular scenario.

Scale horizontally to increase aggregate workload availability: Replace one large resource with multiple small resources to reduce the impact of a single failure on the overall workload.

Stop guessing capacity: A common cause of failure in on-premises workloads is resource saturation, when the demands placed on a workload exceed the capacity of that workload such as DoS attack.

Manage change in automation: Changes to your infrastructure should be made using automation. The changes that must be managed, tracked and reviewed.

Best Practices

Foundations: Before architecting any system, foundational requirements that influence reliability should be in place.

Workload architecture: A reliable workload starts with upfront design decisions for both software and infrastructure.

Change management: Changes to the workload or environment must be anticipated and accommodated to achieve reliable operation of the workload.

Failure management: Requires that the workload is aware of failures as they occur and take action to avoid impact on availability.

To learn more about the the pillar look at the whitepaper.

Whitepaper: Reliability

Performance Efficiency

Focuses on the efficient use of computing resources to meet requirements, and how to maintain efficiency as demand changes and technologies evolve.

Design Principles

Democratize advanced technologies: Make advanced technology implementation smoother for your team by delegating complex tasks to your cloud vendor.

Go global in minutes: Deploying your workload in multiple AWS Regions around the world permits you to provide lower latency and a better experience for your customers at minimal cost.

Use serverless architectures: Serverless architectures remove the need for you to run and maintain physical servers for traditional compute activities.

Experiment more often: Can quickly carry out comparative testing using different types of instances, storage, or configurations.

Consider mechanical sympathy: Understand how cloud services are consumed and always use the technology approach that aligns with your workload goals.

Best Practices

Selection: Choose the correction components of the workload such as compution and storage.

Review: Must verify that workload components are using the latest technologies and approaches to continually improve performance.

Monitoring: Must monitor its performance so that you can remediate any issues before they impact your customers.

Tradeoffs: Depending on your situation, you could trade consistency, durability, and space for time or latency, to deliver higher performance.

To learn more about the the pillar look at the whitepaper.

Whitepaper: Performance Efficiency

Cost Optimization

Help build and operate cost-aware workloads that achieve business outcomes while minimizing costs and allowing your organization to maximize its return on investment.

Design principles

Implement Cloud Financial Management: To achieve financial success and accelerate business value realization in the cloud.

Adopt a consumption model: Pay only for the computing resources that you require and increase or decrease usage depending on business requirements.

Measure overall efficiency: Measure the business output of the workload and the costs associated with delivering it.

Stop spending money on undifferentiated heavy lifting: AWS does the heavy lifting of data center operations like racking, stacking, and powering servers.

Analyze and attribute expenditure: The cloud makes it simple to accurately identify the usage and cost of systems. Permits transparent attribution of IT costs to individual workload owners.

Best Practices

Practice Cloud Financial Management: Builds capability across your organization by implementing organizational wide knowledge building, programs, resources, and processes.

Expenditure and usage awareness: Understanding the organization’s costs and drivers is critical for managing your cost and usage effectively, and identifying cost-reduction opportunities.

Cost effective resources: Using the appropriate services, resources, and configurations for your workloads is key to cost savings.

Manage demand and supply resources: Supply resources to match the workload demand at the time they’re needed. This eliminates the need for costly and wasteful over-provisioning.

Optimize over time: Optimize over time by reviewing new services and implementing them in your workload.

To learn more about the the pillar look at the whitepaper.

Whitepaper: Cost Optimization

Sustainability

The discipline of sustainability addresses the long-term environmental, economic, and societal impact of your business activities.

There are a variety of best practices and improvements processes to help reduce the greenhouse gasses and energy consumed by workloads in the cloud. Automation and monitoring aids in examining trends and reach organizational goals. AWS is setting goals to make the data centers green by using renewable energy and reduce water being used to cool the data centers.

To learn more about the the pillar look at the whitepaper.

Whitepaper: Sustainability

Conclusion

Image Source

AWS Well-Architected Framework aid in the design and deployment of architectural sound workloads based upon best practices for designing and operating secure, reliable, efficient, cost-effective, and sustainable workloads in the AWS Cloud. The short article covered the framework at a high-level and I hope that it aids your learning.

Resources:

Well-Architected Framework. (April, 2023). Amazon Web Services. Retrieved May 2, 2023, from https://docs.aws.amazon.com/wellarchitected/latest/framework/welcome.html

--

--

Tanner Jones
Nerd For Tech

I am passionate about technology and I am curious of how things work. I write to learn and help others learn about a variety of topics. I love the outdoors!