Crafting sustainable on-call rotations – Increment: On-Callhttps://increment.com/on-call/crafting-sustainable-on-call-rotations/
Distributed Systems Observability [Book]https://www.oreilly.com/library/view/distributed-systems-observability/9781492033431/
Google - Site Reliability Engineeringhttps://landing.google.com/sre/sre-book/chapters/monitoring-distributed-systems/
Google - Site Reliability Engineeringhttps://landing.google.com/sre/sre-book/toc/
High Scalability -http://highscalability.com/
Microservices vs The Worldhttps://adamdallis.com/2019/02/09/microservices-vs-the-world/
Some items from my "reliability list"https://rachelbythebay.com/w/2019/07/21/reliability/
You Are Not Google – Bradfieldhttps://blog.bradfieldcs.com/you-are-not-google-84912cf44afb
latency: a primerhttps://igor.io/latency/