Reads

Literature I have read or am currently reading

Site Reliability Engineering

O'Reilly

Essential reading for understanding how to build and maintain large-scale production systems. Learning the practices Google uses to keep services running reliably at scale teaches you how to think about infrastructure, monitoring, and incident response in ways that actually matter.