recall

← recall

partial failure term

some components work, some don't — different parts of the system disagree

The defining problem of distributed systems. Single-machine systems either work or don't. Distributed systems are always partly broken. Most distributed-systems vocabulary exists to deal with this.

topics: failure-modes, distributed-systems

references: