I learned a new acronym: STONITH

The term references how clustered systems handle a failure. During operations, all of the systems in the cluster are constantly running checks on their neighbors. If the systems, called nodes, sense that one of their neighbors has stopped responding, they force the failing system to switch offline, or power-off. Since the working node is killing the failing node, they are executing a stonith: shoot the other node in the head.

Gotta love that one.

