The Psychology of Chaos Engineering

A presentation at Conf42: Chaos Engineering 2020 in January 2020 in London, UK by Matt Stratton

Slide 1

Slide 1

Slide 2

Slide 2

Slide 3

Slide 3

let’s set some agreement

Slide 4

Slide 4

Chaos Engineering is the discipline of experimenting on a system in order to build confidence in the system’s capability to withstand turbulent conditions in production.

https://principlesofchaos.org/

Slide 5

Slide 5

Netflix Technology Blog, 2011 By running Chaos Monkey in the middle of a business day, in a carefully monitored environment with engineers standing by to address any problems, we can still learn the lessons about the weaknesses of our system, and build automatic recovery mechanisms to deal with them. So next time an instance fails at 3 am on a Sunday, we won’t even notice.

https://bit.ly/netflix-chaos

Slide 6

Slide 6

perceptions

Slide 7

Slide 7

“isn’t all engineering chaotic?”

Slide 8

Slide 8

“isn’t all engineering chaotic?”

Slide 9

Slide 9

It’s not about breaking things

Slide 10

Slide 10

It’s not about breaking things

Slide 11

Slide 11

look, I know you know this

Slide 12

Slide 12

I’m gonna say it anyway

Slide 13

Slide 13

these are experiments

Slide 14

Slide 14

Slide 15

Slide 15

we know this, Matty

Slide 16

Slide 16

how we talk about things matters

Slide 17

Slide 17

people get nervous

Slide 18

Slide 18

“You want to do what in production??”

Slide 19

Slide 19

use your monitoring like it’s for real because it is

Slide 20

Slide 20

Something’s broken… …it’s your fault

Slide 21

Slide 21

but what about the people?

Slide 22

Slide 22

how does it make you feel to know Netflix practices chaos engineering?

Slide 23

Slide 23

what about your bank?

Slide 24

Slide 24

Slide 25

Slide 25

data, such as it is

Slide 26

Slide 26

Slide 27

Slide 27

Slide 28

Slide 28

management can get… …nervous

Slide 29

Slide 29

consider your words

Slide 30

Slide 30

it’s about the philosophy

Slide 31

Slide 31

Slide 32

Slide 32

safety first

Slide 33

Slide 33

• Know your conditions - when will you shut down the experiment? • This isn’t about causing stress on your people - be transparent • There are humans at the other end of those numbers

Slide 34

Slide 34

https://speaking.mattstratton.com

Slide 35

Slide 35

Pagey Says….