death spiral term
load on remaining nodes increases as nodes fail, accelerating failure
10 nodes handle traffic fine. One fails → 9 nodes get 11% more load → another fails → 8 nodes get even more load → boom. Fix: shed load proactively, autoscale faster than failure cascades, design for >2x peak headroom.