Traces, metrics, logs — the three pillars and the fourth nobody talks about: profiling. How to instrument distributed systems so you can debug them when they fail at 3am.

GGirish Sharma

In-Depth Reads

Distributed Systems Engineering — Part 5: Observability at Scale

In-Depth Reads

Distributed Systems Engineering — Part 5: Observability at Scale

Distributed Systems Engineering — Part 4: CRDT and Conflict-Free Collaboration

Distributed Systems Engineering — Part 3: Building Reliable Message Queues

Distributed Systems Engineering — Part 2: Consensus Algorithms Demystified

Distributed Systems Engineering — Part 1: Clocks, Time & Causality