Video description
Reliability in AWS includes the ability of a system to recover from infrastructure or service disruptions. It's essential to acquire computing resources to meet the demand, and mitigate disruptions such as configuration issues or transient network problems.
In this course, you will first explore the key concepts and core services of AWS and Site Reliability Engineering (SRE). We show you step-by-step how to implement a real-world application that is built via the reliability principles defined within the AWS Well-Architected Framework using the SRE approach. So you can increase the reliability of application architectures on AWS by implementing resilience infrastructure and application resilience.
You will be covering some common architectural patterns used every day by real-world AWS solution architects to build reliable systems and implement fault tolerance into an application architecture running on AWS. While learning how to further increase the reliability of application architectures on AWS by implementing multi-region solutions for disaster recovery on a global scale.
By the end of this course, you will have gained a variety of AWS architecture skills that you can then apply to the real world.
What You Will Learn
- Understand the core principles of Site Reliability Engineering, and how cloud computing enables this
- Design applications for fault tolerance, auto-healing, resilience, and reliability
- Examine a simple python microservice ecosystem and understand its limitations
- Identify critical stack components, and redesign them so they re resilient and reliable
- Map design changes to native AWS services with ease
- Deploy redesigned applications in a globally accessible, resilient, and reliable way
Audience
Java developers, software engineers, students, or anyone who needs a thorough, reliable, and easy to understand resource that will help them move ahead in their career, will find this course useful.
Prior experience with coding in Java is assumed.
About The Author
Malcolm Orr: Malcolm Orr Is a Principal Architect in AWS Professional Services. He holds 7 AWS certifications along with CKAD and spends his time working with AWS customers to build, deploy and manage cloud native applications and microservices. Before AWS, Malcolm has worked in a number of roles including author, contractor, chief startup dogs body and advisory practice lead and enjoys the solving technical challenges.
Table of contents
- Chapter 1 : The Basics of Site Reliability Engineering
- Chapter 2 : Gaining Resilience and Reliability On AWS
- Chapter 3 : Accepting Failure In Multi-Tier Applications
- Chapter 4 : Deploying Py-Simple On AWS
- Chapter 5 : Designing Py-Global
- Chapter 6 : Deploying a Resilient, Fault Tolerant Py-Global Application
- Chapter 7 : Surviving Failure of a Global Scale
Product information
- Title: Site Reliability Engineering on AWS
- Author(s):
- Release date: June 2020
- Publisher(s): Packt Publishing
- ISBN: 9781800205970
You might also like
video
Site Reliability Engineering Fundamentals
Over the past five years, the ideas behind site reliability engineering (SRE) have caught fire because …
book
System Design on AWS
Enterprises building complex and large-scale applications in the cloud face multiple challenges. From figuring out the …
video
Amazon Web Services (AWS), 3rd Edition
18+ Hours of Video Instruction More than 18 Hours of Video Instruction Covering Cloud Computing and …
book
Site Reliability Engineering
The overwhelming majority of a software system's lifespan is spent in use, not in design or …