Achieving Reliability through Chaos Engineering with Tammy Bütow




PurePerformance show

Summary: Starting your new job as Infrastructure Engineer in a large bank with your to-be boss and his key architects just leaving feels like Chaos! Maybe that’s why Tammy Butow has made a career in Chaos and Site Reliability Engineering. In this episode, Tammy shares her experiences of bring reliability into highly complex systems at NAB, Digital Ocean, DropBox or now Gremlin through chaos engineering. You learn about the importance to know and baseline your metrics, to define your SLIs and SLOs and to continuously run your fire drills to ensure your system is as reliable as it has to be.<br><br>If you want to learn more check out Tammy’s presentations on speakerdeck and make sure to join the chaosengineering slack channel.<br><br><a href="https://www.linkedin.com/in/tammybutow/" rel="noopener">https://www.linkedin.com/in/tammybutow/</a><br><br><a href="https://speakerdeck.com/tammybutow" rel="noopener">https://speakerdeck.com/tammybutow</a><br><br><a href="https://slofile.com/slack/chaosengineering" rel="noopener">https://slofile.com/slack/chaosengineering</a>