recall

← recall

death spiral term

load on remaining nodes increases as nodes fail, accelerating failure

10 nodes handle traffic fine. One fails → 9 nodes get 11% more load → another fails → 8 nodes get even more load → boom. Fix: shed load proactively, autoscale faster than failure cascades, design for >2x peak headroom.

topics: failure-modes, scaling