<<Up     Contents

Algorithms for calculating variance

The variance of a population is defined as the mean squared deviation from the mean. That mouthful says the same as the formula below.

There is another formula for calculating variance which you may see. It uses the sum of all the data and the sum of the squares. The formula is:

This formula was introduced when the prevailing calculators made it much easier to sum squares and the raw data than to sum the squared deviations. Because this formula can result in loss of precision, it should no longer be recommended except for small exercises.

The method of calculation may be more easily understood from the table below where the mean is 8.

ixixi-mean(xi-mean)2
(index)(datum)(deviation)(squared deviation)
15-39
27-11
3800
41024
51024
n=5sum=40018


Note that the column of deviations sums to zero. This is always the case. Note also that we round the standard deviation to one more than the number of significant digits in the mean.

back to Variance

wikipedia.org dumped 2003-03-17 with terodump