Variability
As important as measures of central tendency are, they do not provide us with a complete picture of the underlying scores. We may know what the averages are, but what about the distribution of grades - the 'spread' of the data? Will you feel the same about two possible grading schemes with the same average - one where there will be only five A's and five F's and a second where there will no A's or F's? Experience suggests that most students would care, although there is no agreement as to the preferred scheme. Those with good academic records tend to favor the first distribution while those with weak records tend to favor the second.
The differences between the two grade distributions will be captured in the variability of the scores. As with the previous discussion of central tendency, there are a number of measures of variation. We will look at range, variance, and standard deviation. For an on-line discussion of measures of variability you should check out the UCLA On-line Statistics Course, the Electronic Textbook, Introductory Statistics: Concepts, Models, and Applications Copyright 1996 by David W. Stockburger, Mathematics 220DX Statistics at the New Hampshire College, DAU the Stat refresher, and Hyperstat Online by David Lane at Rice University.
To demonstrate the measures of variability we will use our two grading examples. The frequency diagram of the exam scores for ECN202 and the histogram of the ECN201 grades are repeated below.


Range: This is the easy one. Once you have your data sorted to identify your median, simply look at the highest and lowest values. The range is the difference between the lowest and highest values. In the ECN202 grade example, grades range from 70 to 100. No one received a score higher than 100 or lower than 70.
In the ECN201 example, the scores ranged from 7 to 29 for the entire class and from 12 to 27 for the sample.
Variance: How likely is it that we would get a score close to the average? Did most of the students receive similar scores or were they spread out roughly equally over the entire range. The variance can be thought of as a measure of variability derived by a two-step process. In the first we compute a new variable that equals the test score minus the mean score (approximately 84.5). The result appears in the second column below. If we add the deviation in the first row (7.9) to the mean (84.5) we get the score (92.3) [Note: there may be a small difference due to rounding].
ECN201 Grades
| Score | Deviation | Deviation 2 |
| 92.3 | 7.9 | 62.264 |
| 85.7 | 1.2 | 1.54463 |
| 93.1 | 8.7 | 75.5893 |
| 76.6 | -7.9 | 61.8804 |
| 86.5 | 2.1 | 4.33774 |
| 70.4 | -14.0 | 196.835 |
| 77.1 | -7.4 | 54.6925 |
| 71.8 | -12.6 | 159.417 |
| 93.4 | 8.9 | 79.1603 |
| 87.0 | 2.5 | 6.2721 |
| 93.8 | 9.3 | 87.0956 |
| 77.2 | -7.2 | 52.1999 |
| 87.5 | 3.1 | 9.45035 |
| 94.7 | 10.3 | 105.67 |
| 72.0 | -12.5 | 155.804 |
| 77.5 | -7.0 | 48.4706 |
| 87.6 | 3.2 | 10.0862 |
| 95.2 | 10.8 | 115.687 |
| 72.6 | -11.8 | 139.925 |
| 77.9 | -6.6 | 43.3304 |
| 87.9 | 3.5 | 11.9836 |
| 95.2 | 10.8 | 115.994 |
| 96.1 | 11.7 | 135.747 |
| 73.4 | -11.0 | 121.861 |
| 77.9 | -6.6 | 43.0169 |
| 88.8 | 4.4 | 19.0616 |
| 89.0 | 4.5 | 20.667 |
| 96.9 | 12.4 | 154.054 |
| 73.9 | -10.6 | 112.082 |
| 78.1 | -6.3 | 39.9078 |
| 79.8 | -4.7 | 21.633 |
| 89.6 | 5.2 | 26.8889 |
| 97.5 | 13.1 | 171.007 |
| 98.3 | 13.8 | 191.654 |
| 74.2 | -10.3 | 106.006 |
| 74.4 | -10.0 | 100.544 |
| 80.3 | -4.2 | 17.3537 |
| 91.8 | 7.3 | 53.3722 |
| 91.8 | 7.3 | 53.4916 |
| 98.7 | 14.3 | 204.316 |
| 74.8 | -9.7 | 93.6733 |
| 80.5 | -4.0 | 15.6263 |
| 80.6 | -3.9 | 14.8779 |
| 92.0 | 7.5 | 56.2552 |
| 99.7 | 15.2 | 231.843 |
| 75.2 | -9.3 | 86.1429 |
| 75.4 | -9.0 | 81.7234 |
| 82.7 | -1.7 | 3.02864 |
| 92.0 | 7.5 | 56.686 |
| 75.7 | -8.8 | 76.5842 |
| 83.2 | -1.3 | 1.69538 |
| 76.3 | -8.2 | 66.7191 |
Total |
Total |
Total |
| 4391.6 | 0.0 | 3975.21 |
| 84.454 | Mean |
|
Variance |
76.45 | |
Standard deviation |
8.74 |
In the second step we square all of the deviation terms in column 2 that generates column 3. We now add these squared terms and divide by the number of observations to get the variance. All other things equal, the greater the variance the greater the spread of the scores.
What would the distribution look like if the variance were larger? What you would see would be fewer observations close to the mean and more observations near the upper and lower limits.
Returning to our ECN201 grades, we find that the variance for the entire class the variance is 19.12. You would expect that the variance is smaller in the sample given that there are no scores in the sample below 12, and this is precisely what you find. The variance in the sample is 16.86 points.
|
Sample |
Population |
|
|
Standard Deviation |
4.106375 |
4.372357 |
|
Sample Variance |
16.86232 |
19.1175 |
|
Kurtosis |
-0.42382 |
0.090843 |
|
Skewness |
-0.46205 |
-0.43279 |
|
Range |
15 |
22 |
|
Minimum |
12 |
8 |
|
Maximum |
27 |
30 |
|
Sum |
490 |
2580 |
|
Count |
24 |
119 |
Standard Deviation: There is one problem with the variance - it is influenced by the size of the variable being analyzed which will make comparisons of different score distributions impossible. For example, if one teacher used a 4-point scale and another used the 100-point scale, then there would be a real problem comparing the variability in grades for the two classes. To allow for this comparability, we can 'normalize' the variance by taking its square root. The result is the standard deviation which in the ECN202 example is 8.74 points. In the ECN201 example, the standard deviations for the entire class and the sample are 4.37 and 4.11.
In the next section where we examine probability and probability distributions, the standard deviation will take on special significance.