Variance and standard deviation measure how spread out a dataset is around its mean — the core question in statistics beyond just knowing the average. Two datasets can have identical means but completely different distributions: {50, 50, 50, 50} and {0, 0, 100, 100} both have means of 50, but the first has zero variance and the second has a variance of 2,500. Understanding this spread is essential for evaluating investment risk, quality control in manufacturing, research significance testing, and any situation where the average alone is insufficient information about a distribution.
The Empirical Rule: What Standard Deviation Tells You
For normally distributed data, the empirical rule (68-95-99.7 rule) states: approximately 68% of values fall within 1 standard deviation of the mean, 95% within 2 standard deviations, and 99.7% within 3 standard deviations. These percentages are specific to normal distributions but serve as useful approximations for many real-world distributions that are roughly bell-shaped.
Investment application: Jason, 43, in Seattle, Washington analyzes monthly stock returns with a mean of 1.2% and standard deviation of 4.3%. Under the empirical rule: 68% of months fall between 1.2% - 4.3% = -3.1% and 1.2% + 4.3% = 5.5%. 95% of months fall between 1.2% - 8.6% = -7.4% and 1.2% + 8.6% = 9.8%. In any given month, a loss worse than -7.4% should occur roughly 2.5% of the time — about once every 40 months, or once every 3.3 years. This is probabilistic risk assessment from standard deviation.