×

AWS Summit Online Australia & New Zealand

First Name
Last Name
Company Name
Country
CDN Province
US State
India State
AU State
Postal Code
Phone Number
Job Role
Industry
This information is associated with my:
I acknowledge that I have read and agree to the AWS Privacy Policy and AWS Event Code of Conduct.
Compliance Opt-in
By completing this form, I agree that I'd like to receive information from Amazon Web Services, Inc. and its affiliates related to AWS services, events and special offers, and my AWS needs by email and post. You may unsubscribe at any time by following the instructions in the communications received. Your information will be handled in accordance with the AWS Privacy Policy.
Thank you!
Error - something went wrong!

The art of successful Kubernetes failures

In the real world, things don't always go the way you want them to. Even when you’ve designed your Kubernetes cluster and the services it hosts to be highly available, scalable, and resilient, sometimes they fail anyway. These failures, if used correctly, can help you gain a deep understanding of how your system works and can act as tools that help spread knowledge throughout your engineering community. In this session, we cover expert techniques for defining and reviewing cluster-, node-, and pod-level metrics and for watching Kubernetes-based services before they fail. You also learn how to perform post-failure analyses that drives learning and meaningful improvement.

Speaker:
Mitch Beaumont, Principal Solutions Architect, Amazon Web Services

Download presentation:
The art of successful Kubernetes failures

Resource:
Operational Insights for Containers and Containerised Applications

Previous Video
Operations for serverless
Operations for serverless

Traditional operations are designed for long living infrastructure, low to medium velocity of changes, and ...

Next Video
Building resilient applications using chaos engineering on AWS
Building resilient applications using chaos engineering on AWS

With the wide adoption of microservices and large-scale distributed systems, architectures have grown incre...