This document contains notes from a presentation on cloud capacity planning and monitoring challenges. It discusses the importance of capacity and performance management to understand how to safely support customer needs. It notes that cloud systems use the same queue management formulas as traditional systems. The document emphasizes understanding systems at scale through proper monitoring of logs, metrics, alerts and trends. It also stresses that tools alone are not enough and that recovery must be part of the design through practices like blue/green deployments and backups.
25. 2015 ROBERT BIGOS
2
Know unknowns and unknown unknowns
Reports that say that something hasn't happened are always
interes2ng to me, because as we know, there are known knowns;
there are things we know we know. We also know there are known
unknowns; that is to say we know there are some things we do not
know. But there are also unknown unknowns -- the ones we don't
know we don't know
Donald Rumsfeld, February 12th, 2004 DOD News Briefing
Source: http://www.defense.gov/transcripts/transcript.aspx?transcriptid=2636