Skip to content
STUB

State vs Computation

The pattern: separate where data lives from where work happens. Computation scales horizontally — more workers, cheap, ephemeral. State scales differently — replicas, careful consistency, durable. Coupling them, by putting state inside compute, limits both: you can’t autoscale without losing data, and you can’t recover state without replaying compute. Pulling them apart is what makes pods restartable, lakehouses multi-engine, and serverless viable.

The trade-off: operational simplicity vs. flexibility. A monolith with co-located state is easier to reason about for small systems — one process, one disk, one backup. Splitting state from compute (services + DB; lakehouse separating storage from compute engines) costs more upfront (network in the hot path, transactional boundaries to manage) but unlocks independent scaling, multi-engine reads, and easier blue/green deploys. The 2020s lakehouse insight — “storage is dumb cheap; compute is expensive; separate them” — is this pattern made canonical at data-platform scale.

[Deepen Year 3 Phase 15 — the lakehouse insight is the canonical 2020s example.]