Improve the Reliability of Your Kubernetes Clusters

Reliability and high availability are key features of Kubernetes, but even the most resilient systems can fail. Applications crash, hardware breaks and nodes can go offline. These failures can have damaging and unpredictable consequences for organizations, especially those that are unprepared.

We’ll explore how to improve the availability and reliability of Kubernetes clusters using the discipline of Chaos Engineering. You will also have an opportunity to ask questions of our experts during our live Q&A segment.

In this webinar:

You will learn how to use Chaos Engineering to safely inject failure into your applications and nodes in order to detect weaknesses.
We’ll walk through specific Chaos Experiments for you to run on Kubernetes to ensure you’ve designed a reliable system.
You’ll have specific recommendations for how to harden your infrastructure, improve reliability and keep your applications running smoothly.

Ana Margarita Medina

Senior Chaos Engineer at Gremlin

Ana Margarita is currently working as a senior chaos engineer at Gremlin, helping companies avoid outages by running proactive chaos engineering experiments. Before Gremlin, she has worked at various-sized companies including Google, Uber, SFEFCU and Miami-based startup. Ana is an internationally recognized speaker and has spoken at: AWS re:Invent, KubeCon, DockerCon, DevOpDays, AllDayDevOps, Write/Speak/Code and many others.

Webinar

Think About Your Audience Before Choosing a Webinar Title

Sponsored by gremlin

What You’ll Learn in This Webinar

On-Demand Viewing:

this webinar starts in

What You’ll Learn in This Webinar

About the Hosts