Professional Documents
Culture Documents
16-1
CHAPTER 16
LESSON
OUTLINE
In Chapter 10, we studied inferential methods for comparing the means of two
populations. Now we will study analysis of variance, or ANOVA, which
provides methods for comparing two or more population means. You should
be familiar with the chapter that discusses analysis of variance in your
textbook before beginning this chapter.
16.1
16.2
16.3
16.4
16.5
16.6
The F-distribution
One-Way ANOVA: The Logic
One-Way ANOVA: The Procedure
Multiple Comparisons*
The Kruskal-Wallis Test*
Problems
16-2
Section 16.3
16-3
Table 16 - 1
Energy
consumption
for samples
of
households
in four U.S.
regions
Northeast
15
10
13
14
13
Midwest
17
12
18
13
15
12
South
11
7
9
13
West
10
12
8
7
9
16-4
Solution Type the data into two variables named, ENERGY and REGION. ENERGY
should contain all 20 data values in the four samples. REGION should take
on the four values, 1, 2, 3, and 4, which associate the case with a region. The
values of REGION, 1, 2, 3, and 4, should be associated with the value labels,
Northeast, Midwest, South, and West, respectively.
Step 1: State the null and alternative hypotheses.
Let 1, 2, 3, and 4 denote last years mean energy consumptions for
households in the Northeast, Midwest, South, and West, respectively. The
null and alternative hypotheses are:
H 0 : 1 = 2 = 3 = 4 (mean consumptions are all equal)
H a : Not all the mean consumptions are all equal
Step 2: Decide on the significance level, .
The test is to be performed at the 5% significance level. Thus = 0.05.
Step 3: Compute the value of the test statistic.
1. Test the hypotheses by choosing Analyze > Compare Means >
One-Way ANOVA to open the One-Way ANOVA dialog box
(Figure 16 - 1).
Figure 16 - 1
One-Way
ANOVA
dialog box
2.
Paste the variable ENERGY into the Dependent List box and the
variable REGION into the Factor box.
3.
Section 16.3
16-5
Figure 16 - 2
ANOVA table
from OneWay ANOVA
procedure
16-6
Section 16.4
16-7
Figure 16 - 3
One-Way
ANOVA:
Post Hoc
Multiple
Comparisons
dialog box
16-8
For example, the confidence interval for the mean difference between the
Northeast and Midwest regions is 5.429 to 2.429. Two population means are
significantly different if their confidence interval does not include 0. This is
true for the Midwest and South regions, for example. SPSS provides another
table, the Homogeneous Subsets table (Figure 16 - 5), to help decipher which
population means are different and which are equal.
Figure 16 - 5
Homogeneous subsets
table from
Tukey
multiple
comparison
procedure
Means that are lined up together in a column under Subset for alpha = 0.05
are judged equal by the Tukey multiple comparison method. Means that are
in separate columns are judged not equal. That is, there is sufficient evidence
the population means for the regions, West, South, and Northeast are equal;
and the population means for the regions, Northeast and Midwest are equal.
Further, since West and Midwest are in different columns there is sufficient
evidence that they are not equal. These results have a 95% family confidence
level.
Section 16.5
16-9
Table 16 - 2
Number
miles driven
(1000s) last
year for
independent
samples of
cars, buses,
and trucks
Cars
19.9
15.3
2.2
6.8
34.2
8.3
12.0
7.0
9.5
1.1
Buses
1.8
7.2
7.2
6.5
13.3
25.4
Trucks
24.6
37.0
21.2
23.6
23.0
15.3
57.1
14.5
26.0
Preliminary data analysis (not shown) suggest that the distributions of miles
driven have roughly the same shape for cars, buses, and trucks but that those
distributions are far from normal. Thus the appropriate test is the KruskalWallis procedure. At the 5% significance level, do the data provide sufficient
evidence to conclude that a difference exists in last years mean number of
miles driven among cars, buses, and trucks?
Solution The Kruskal-Wallis test is performed by the Tests for Several Independent
Samples dialog box. Type the data into two variables named, MILES and
VEHICLE, in a new data file. MILES should contain all 25 data values in
the three samples. VEHICLE should take on the values, 1, 2, and 3,
associated with the value labels, Cars, Buses, and Trucks, respectively.
Step 1: State the null and alternative hypotheses
Let 1, 2, and 3 denote last years mean number of miles driven for cars,
buses, and trucks, respectively. The null and alternative hypotheses are:
H 0 : 1 = 2 = 3 (mean miles driven are all equal)
H a : Not all the means all equal
Step 2: Decide on the significance level, .
The test is to be performed at the 5% significance level. Thus = 0.05.
Copyright 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
16-10
Next, we need to specify the minimum and maximum integer values for the
grouping variable. The minimum value must be less than the maximum value.
Cases associated with values outside the bounds are excluded during the
analysis. This option is supplied so that the Kruskal-Wallis procedure can be
performed on a subset of the samples.
3. Click the Define Range button to open the Several Independent
Samples: Define Range dialog box (Figure 16 7).
Section 16.5
16-11
Figure 16 7
Several
Independent
Samples:
Define
Range dialog
box
The Ranks table (Figure 16 8) displays the mean ranks for each of the three
samples. If the sample means are equal we would expect the mean ranks to be
approximately equal.
Figure 16 8
Ranks table
from KruskalWallis
procedure
The Test Statistics table (Figure 16 9) gives the chi-square test statistic,
degrees of freedom associated with the test statistic, and the p-value of the
hypothesis test.
Figure 16 9
Test
Statistics
table from
KruskalWallis
procedure
Copyright 2012 Pearson Education, Inc. Publishing as Addison-Wesley.
16-12
16.6 Problems
Problem 16.8
Problem 16.10
Section 16.6
Problem 16.48
Table 16 3
Running Times
in minutes
Problem 16.49
Table 16 4
Number of
Copepods
16-13
Movie fans use the annual Leonard Maltin Movie Guide for facts, cast
members, and reviews of over 21,000 films. The movies are rated form 4
stars (4*), indicating a very good movie to 1 star (1*) which Leonard Maltin
refers to as a BOMB. Table 16 3 gives the running times, in minutes, of a
random sample of films listed in one years guide. At the 1% significance
level, do the data provide sufficient evidence to conclude that a difference
exists in mean running times among the four rating groups?
1* or 1.5*
75
95
84
86
58
85
2* or 2.5*
97
70
105
119
87
95
3* or 3.5*
101
89
97
103
86
100
4*
101
135
93
117
126
119
Copepods are tiny crustaceans that are an essential link in the estuarine food
web. Marine scientists G. Weiss, G. McManus, and H. Harvey at the
Chesapeake Biological Laboratory in Maryland designed an experiment to
determine whether dietary lipid (fat) content is important in the population
growth of a Chesapeake Bay copepod. Their findings were published as the
paper Development and Lipid Composition of the Harpacticoid Copepod
Nitocra Spinipes Reared on Different Diets (Marine Ecology Progress
Series, vol. 132, pp. 57-61). Independent random samples of copepods were
placed in containers containing lipid-rich diatoms, bacteria, or leafy
macroalgae. There were 12 containers total, four replicates per diet. Five
gravid (egg-bearing) females were placed in each container. Table 16 4
shows the number of copepods in each container after 14 days.
Diatoms
426
467
438
497
Bacteria
303
301
293
328
Macroalgae
277
324
302
272
16-14
Problem 16.95
Refer to Problem 16.49. Apply the Tukey multiple comparison method to the
data in Table 16 3. Use a family confidence level of 95%
Problem 16.129 Indications are that Americans have become more aware of the dangers of
excessive fat intake in their diets, although some reversal of this awareness
appears to have developed in recent years. The U.S. Department of
Agriculture publishes data on annual consumption of selected beverages in
Food Consumption, Prices, and Expenditures. Independent random samples
of lowfat-milk consumptions, measured in gallons, for 1980, 1995, and 2005
are given in Table 16 5.
Table 16 5
Lowfat milk
consumptions,
in gallons, for
1980, 1995,
and 2005
1980
1995
2005
11.1
10.7
8.6
9.4
9.2
15.1
11.6
8.3
15.5
16.0
16.1
14.7
11.5
17.1
16.2
11.2
12.7
17.4
17.1
13.4
11.4
13.9
14.6
15.2