Professional Documents
Culture Documents
I
rl
ill
USINGSTATISTICS
@T.C.ResortProperties
CHI.SOUARE
TESTFORTHE DIFFERENCE 12.6 KRUSKAL.WALLISRANK TEST:
BETWEENTWO PROPORTIONS NON PARAMETRICANALYSIS
(TNDEPENDENT
SAMPLES) FORTHE ONE.WAYANOVA
CHI.SOUARE TESTFORDIFFERENCES 12.7 P (CD.ROMTOP'q CHLSOUARETESTFOR
AMONG MORETHAN TWO PROPORNONS A VARIANCEOR STANDARDDEVIATION
TheMarascuilo
Procedure
EXCELCOMPANIONTO CHAPTER12
CHI.SOUARE
TESTOF INDEPENDENCE El2.l UsingtheChi-Square
Testfor the
DifferenceBetween
TwoProportions
MCNEMARTESTFORTHE DIFFERENCE
812.2 UsingtheChi-Square
Testfor the
BETWEENTWO PROPORTIONS
Differences
AmongMoreThanTwoProportions
(RELATED
SAMPLES)
EI2.3 UsingtheChi-Square
Testof Independence
WLCOXON RANK sUM TEST E12.4UsingtheMcNemar Test
NONPARAMETRICANALYSISFOR 812.5 UsingtheWilcoxonRankSumTest
TWOINDEPENDENT POPULATIONS E12.6UsinstheKruskal-Wallis
RankTest
,12.1 CHI.SOUARE
TESTFORTHEDIFFERENCE
BETWEEN
(TNDEPENDENT
TWO PROPORTTONS SAMPLES)
In Section10.3,you studiedtheZ testfor the differencebetweentwo proportions.In thi
tion, the dataareexaminedfrom a differentperspective. The hypothesis-testing procedu
a test statisticthat is approximatedby a chi-square1X21 distribution.The resultsof this1
areequivalent to thoseof theZtest described in Section10.3.
If you are interestedin comparingthe countsof categoricalresponses betweentwo
pendentgroups,you candevelopa two-waycontingencytable (seeSection2.4)to disp
frequencyof occurrenceof successes and failuresfor eachgroup.In Chapter4, contin
tableswereusedto defineand studyprobability.
To illustratethe contingencytable,returnto the Using Statisticsscenarioconcerni
Resort Properties.On one of the islands,T.C. Resort Propertieshas two hotel
Beachcomber and the Windsurfer).In tabulatingthe responses to the singlequestion'A
likely to choosethis hotel again?"163of 227 guestsat the Beachcomber responded ye
154of 262 guestsat the Windsurferrespondedyes.At the 0.05 level of significance,is
evidenceof a significantdifferencein guestsatisfaction(asmeasured by likelihoodto ret
the hotel)betweenthe two hotels?
The contingencytabledisplayedin Table12.1hastwo rowsandtwo columnsandis
a 2 x 2 contingencytable.The cells in the tableindicatethe frequencyfor eachrow an
umn combination.
12.l: Chi-SquareTestfor the DifferenceBetweenTwo Proportions(IndependentSamples) 463
A B L E1 2 . 1 (GROUP)
coLUMNVAR|ABTE
layouta
o2f x2 ROWVARIABLE Totals
Table
cale Successes xt x2 X
the Failures nt - Xt nz- Xz n-X
ture Totals nr n2
rthe
.ged
lna-
vith where
)tel, X, : numberof successes
in group I
am-
X, : numberof successes
in group2
and
the nt - Xt: numberof failuresin group I
you
tc. nz - Xz: numberof failuresin group2
lan- X: Xt + Xr, thetotal numberof successes
on?
sort n - X: (n, - X) + (nz- Xr), thetotal numberof failures
r, : samplesizein group I
r, : samplesizein group2
n : nt + nr: totalsamPlesize
12.2 HOTEL
2 Contingency CHOOSE
HOTETAGAIN? Beachcomber Windsurfer Total
forthe Hotel
Satisfaction Yes 163 154 3t7
No 64 108 172
Total 227 262 489
Ho:nr: n,
Hl nt + Tc2
X2TESTFORTHE DIFFERENCE
BETWEENTWO PROPORTIONS
The y2 test statisticis equalto the squareddifferencebetweenthe observed
andt
frequencies,divided by the expectedfrequencyin eachcell of the table,
summed
cellsof the table.
*'=,Ar,".Y (12.1)
where
: observedfrequency in a particular
f cell ofa conti
table
To compurethe expected
for eachc.elrpertainingto success(that
l":r::t'.r, is, thecei
group
;ljfi by
iT.:_:T,.'"*:::j*-::i:2:*"!:imurtipry,il.;*nf
p-.rocompute
theexpect"d ,i,"1o.corumn totar)
ror
thecells
ffi;.";';;;,?,;;jJ"li;#,iil::rili#?lil
in thesecondro11n thecontinge-nftabi;),
total)for a groupbV0 - p). yo, _ufi,pry,'ti. .u_pt" size(or colur
The test statistic.h:yn in Equation(r2.r)
approximatelyfollows a chi_square
tio.n TableE.4) with 1 degreeof f.""aon distri
_(see Using a level of
nul
I hvpothesi
sirthecomputel
null hypothesisif the c.,mnrrrprr
*2 +^^*-,^.,^.,t^'
x',"'r'iu,iJ; ; ;;ilffi
o, you ."iaar
from the x2 distributionhavingft"gr";;i;""ao*. ;a:;:'jil!l,li,llH:n:i
"significance
Thus,the decisionrule is
12.1
of rejectionand
whenusing
test for
chi-square
differencebetween
s, with
o
of significance
be closeto zerobecausethe
If the null hypothesisis true,the computed12 statisticshould
cell,fo, and what is theoretically
squareddifferencebetweenwhat is actually observedin each
is false,then there are differencesin the population
expectedf, shouldbe very small.If Ho
to be large. However,what constitutesa
proportionsand the compuiedX2statistii is expected
difference betweenf, andf from a cell
largedifferencein a celf is relative.The sameactual
contributesmore to the 12 test thana cell
statistic
with a smallnumberof expectedfrequencies
with a largenumberof expectedfrequencies'
betweentwo proportions,
To illustratethe use of the chi-squaretest for the difference
Propertiesandthe corresponding
returnto the Using StatisticsscenarioconcerningT.c. Resort
null hypothesis(Ho:,nr: fi2).states
contingencytabledisplayedin Table 12.2onpage463.The of
that thereis no diff-erencebetweenthe proportiin of guestswho are likely to chooseeither
thesehotelsagain.To begin.
X,l Xl 1 6 3+ 1 5 4 - 317 =
0.6483
P=-j= 22'7+262 489
ny+n2
are
p is the estimateof the commonparameterfi, the populationproportionof guestswho
is true' The estimated pro-
iit.ty to chooseeitherof thesehotelsagainif the null hypothesis
of p '
portion of guestswho are not llkely to choosethesehotels again is the complement
size r the
f o
I - 0 . 6 4 8 3 : 0 . 3 5 1 7 . M u l t i p l y i n gt h e s et w o p r o p o r t i o n sb y t h e s a m p l e
Hotel givesthe numUetof guestsexpectedto choosethe Beachcomber again
Beachcomber
hotel again. In a similar manner' multiplying the
andthe numbernot expectedto choosethis
sample size yields the corresponding
t two respectiu"p.opo.iionsby the WindsurferHotel's
I expectedfrequenciesfor that group.
t
E X A M P L E1 2 . 1 COMPUTINGTHE EXPECTEDFREOUENCIES
for eachof the four cellsof Tablel2'2 onpage463'
Computethe expectedfrequencies
SOLUTION
Yes-Beachcomber: p :0.6483 andnr:227,3o.fr:147 '16
Yes-Windsurfer'.p : 0.6483andnr:262, sof": 169'84
No-Beachcomber: | - P :0.3517 andn, : 227,sofn: 79'84
No-Windsurfer: 1 - p : 0.3517andnr-- 262,sof": 92'16
observedfrequencies'
nextto the corresponding
Table12.3presentstheseexpectedfrequencies
466 cHAPTERTwELVE
Chi-SquareTestsand NonparametricTests
TABLE12.3
Comparing
the CHOOSE
HOTEL Beachcomber
Observed(fo)and AGAIN? Observed Expected
Expected(fu)
Frequencies Yes 163 147.16 154 169.84
No 64 79.84 108 92.16
Total 227 227.00 262 262.00
Ho:nr: n,
Hr:n,+n,
12.2
of rejection
iectionwhen
the 12 critical
with1 degreeof
atthe0.05
ofsignificance
Critical Regionof
Value Rejection
',2,3
Excel
for the hotel
satisfaction
data
El2.1 to create
E t z . tt n t n a t
includesthe
for the first
of the worksheet
the basisfor
the chi-square
in cell825.
-cHllNv{8l8, 821}
-$Ul{(Fl3rG14}
-cHl0lST{B25,B21l
-lFF26 < 818, 'Roject fte null hyporhs€h'",
'Do rot reJoct
lhs null hypofhodr'J
Ho:nr: n,
thatthepopulationproportions
againstthealternative arenot equal:
Hi nr+fi2
"12.2 Determinethe critical value of 12 with I degreeof a. Find the expectedfrequencyfor eachcell.
freedomin eachof the followins circumstances: b. Find the 12 statisticfor this contingencytable.Is it
a . : 0 . 0 5n, : 2 8 nificantata:0.05?
b.s:0.025,n:21 Applying the Concepts
c. cr:0.01,n:5
12.5 A sampleof 500 shopperswas selectedin a I
12.3 Usethefollowingcontingency
table: metropolitanareato determinevariousinformation
ffi cerning consumerbehavior.Among the questions
was,"Do you enjoy shoppingfor clothing?"The results
B Total
summarizedin the following contingencytable:
1 20 30 50
2 30 45 75 GENDER
Total 50 75 125
ENJOYSHOPPING
FORCIOTHING Male Female
a. Find the expectedfrequencyfor eachcell.
Yes 136 224 360
b. Comparethe observedand expectedfrequenciesfor
No 104 36 140
eachcell.
Total 240 260 500
c. Computethe 12 statistic.Is it significantat a : 0.05?
12.I : Chi-Square
Testfor the DifferenceBetweenTwoProportions(Independent
Samples) 469
STUDENT
GROUP a . At the 0.05 level of significance,is there
differencebetweenundergraduate andMBA
FUND Undergraduate MBA the proportionwho selectedthe highesrcost
Highest-cost
fund 27 l8 b. Determinethep-valuein (a) and interpret its
Not highest-cost
fund 73 82 c . Comparethe resultsof (a) and (b) to thoseof
f 0.38on page396.
Source: ExtractedfromJ. Choi, D. Laibson, and B. Madrian,
"Why Does the Law of One Practice Fail? An Experiment
on
MutuaI Funds," www.som.yale.edu/faculty/jjcglf/fees.pdf.
Hot f t l : "' -
ftz- ltL
youuseEquation
(12.1)on page464:
(-f"--f")2
u
^ L2r = 5.
a// cells re
where
f: observed
frequency
in a particularcell ofa 2x c con
table
COMPUTING
THEESflMATED pRopoRTtoNFoR c cRoups
OVERALL
(12.3)
fa tingency table rninus I, tirnesthe number of columns in the table minus L For a 2 x c contin-
in gency table, there are c - I degreesof freedom:
J s f f r e e d o r n- ( 2 -
Degreeo l)(r'- 1): c- I
)m
Using the level of significanceo(,you rejectthe null hypothesisif the computed12 test statistic
is greater than f,i'. the upper-tail critical value from a chi-squaredistribution having c - I
degreesof freedom.Therefbre,the decisionrule is
F i s u r e 1 2 . 4i l l u s t r a t e st h e d e c i s i o nr u l e .
he
!S' FIGURE
12.4
Ire Regions of rejection
andnonrejection when
for differences
testing
among c ploportrons
usingthe Xztest
R e g i o no f Critical R e g i o no f
Nonrejection Value Rejection
To illustratethe 12 test for equality of proportionswhcn there are more than two groups,
return to the Using StatisticsscenarioconcerningT.C. Resort Properties.A similar surveywas
recently conductedon a diff'erentisland on which T.C. Resort Propertieshas three diflerent
hotels. Table 12.6 presentsthe responsesto a question concerning whether guestswould be
likely to choosethis hotel again.
T A B L E1 2 . 6 HOTEL
ne 2x 3 Contingency CHOOSE
HOTEL
AGAIN GoldenPalm PalmRoyale PalmPrincess Total
ES TableforGuest
CS SatisfactionSurvey Yes 128 199 186 51 3
2) No 88 JJ 66 187
br Total 216 232 252 700
l)=
X, + X, +...+ X( A
EXAMPLE12.2 COMPUTING
THEEXPECTED
FREOUENCIES
Compute the expectedfrequenciesfor each of the six cells in Table
SOLUTION
Y e s - G o l d e nP a l m :p : 0 . 7 3 3 a n dn r : 2 1 6 , s o { , : 1 5 8 . 3 0
Yes-Palm Royale:p : 0.733andnr: 232, so.fn: 170.02
Yes-Palm Princess:p -- 0.133andnr: 252,so-fn: 184.68
No-Golden Palm:1 - P : 0.267andnr- 216,sof": 57.70
No-Palm Royale:| - p : 0.267andnr:232, so{ : 61.98
No PalmPrincess: I - p :0.267 andnr:252, so.f": 67.32
Table12.7presentstheseexpectedfrequencies.
F 1 , :N o t a l l r n , a r ee q u a l( w h e r ej : 1 , 2 , 3 ) .
You use the observed frequencies from Table 12.6 on page 471 and the expected
from Table 12.7to computethe "72test statistic(given by Equation(12.1) on page
displayed
in Table12.8.
I
You use TableE.4 to find the critical value of the y2 teststatistic.In the guestsatisfaction I
I
survey,becausethreehotelsare evaluate4there are(2 - lX3 - l) : 2 degreesof freedom.
Using cr: 0.05,they2 criticalvaluewith2 degreesof freedomis 5.991.Becausethe computed
test statistic(X2:40.23) is greaterthan this critical value,you rejectthe null hypothesis(see I
Figure12.5).MicrosoftExcel(seeFigure12.6)alsoreportsthep-value.Likewise,because the I
I
p-value is approximately0.0000,which is lessthan cr : 0.05,you rejectthe null hypothesis. !
12.5
of rejection
ection when
for differences
e ProPonrons
0.05level
nce, with
of freedom
5.991
Regionof tI Regionof
Nonrejection I Rejection
Critical
Value
12.6
Excel
for the
satisfaction
Table12.6
E12.2to create
-{81e-0-Fat.r)
-€tlll||V(818, Btll
-SuI(Gl3:11.{}
-clll0lsT{8?5, B2ll
-lF€2t < 810,"Rrjrdrhe rull hyporte.b',
'Do not r.l.cl 1ft6null hypothalr'|
-lF{OR{Bl3
< t, Ci3 < 1, 013 < l, 9tl < 1. Cl4< t, 014< 1},
' kvlold.d,',' bm.r.1
For the Xz test to give accurateresults when dealing with 2 x c contineencytables, all
expectedfrequenciesmust be large.Thereis much debateamongstatisticiansaboutthe defini-
tionof large. Somestatisticians(seereference5) havefound that the test givesaccurateresults
as long as all expectedfrequencies equalor exceed0.5. Otherstatisticians, more conservative
in their approach,requirethat no more than 20o/oof the cells containexpectedfrequenciesless
47 4 CHAPTER TWELVE Chi-SouareTestsand NonoarametricTests
The MarascuiloProcedure
Rejectingthenullhypothesis
inayz testofequalityofproportionsin
a2xctableonlyal
you to reach the conclusion that not all c population proportions are equal. But which of
proportionsdiffer?Becausethe resultof theX2testfor equalityof proportionsdoesnot
ically answerthis question,you need to use a multiple comparisonsproceduresuchas
Marascuiloprocedure.
The Marascuilo procedureenablesyou to makecomparisons betweenall pairsof
First, compute the observed differencespi- pi, (whereT * j'') amongall c(c - l)/2 pairs.
use Equation (12.4) to compute the correspondingcritical rangesfor the Marascuilo proc
CRITICALRANGEFORTHE MARASCUILO
PROCEDURE
P1Q-P) , ni,(r* pi,)
,unf"= ,ffi
Critical (r2.4)
njnj'
You need to compute a different critical range for each pairwise comparison of sample
portions. ln the final step, you compare each of the c(c - l)12 pairs of sample proport
againstits correspondingcriticalrange.You declarea specificpair significantlydifferentif
absolutedifferencein the sampleproportionslpi- prl is greaterthanits criticalrange.
To apply the Marascuiloprocedure,returnto the guestsatisfactionsurvey.Usingthe
test,you concludedthat therewas evidenceof a significantdifferenceamongthe
proportions.Becausethereare threehotels,thereare (3X3 - 1)12: 3 pairwiseco
From Table12.6onpage471,the threesampleproportionsare
x1
Pt=- 0.593
n1
x1 199
Dt:1=-=0.858
n2 232
x, 186
Dt=L=_=0.738
nj 252
=Jrssr=2.448
F
Next, you computethe threepairsof absolutedifferencesin sampleproportions and their
respondingcritical ranges.If the absolutedifferenceis greaterthan its critical range, the
portionsaresignificantlydifferent:
12.2: Chi-squaredTestfor DifferencesAmongMoreThanTwoProportions 475
pj(l-pj)
lPi - Pi4 2.448 +pj,(l-pj,)
( 0 . 8 5 8 X 0 . 1 4_
2 \n ,
l p t- p zl : 1 0 .s9
- 30 .8 s81
:0.26s 1 n ^ o l ( 0 . 5 9 3 X 0 . 4 0 ,7 )
L.aao.,- = v.0992
\ 216 232
- 0 .7 381
: 0.120 ,,o /(0.858X0.142) . (0.7381(0.262\
\ -- - -'\ -----/
l p r- p zl :I0 .8 s8 L
" .a1o.t- + = 0.0880
\ 232 2s2
Thesecomputationsare shownin worksheetformat in Figure 12.7.
12.7
Excel
Proceoure
"Usingthe
Wo*sheets"
E12.2to
about this
12.3 CHI.SOUARE
TESTOF INDEPENDENCE
In Sectionsl2.l andl2.2,youused the y2 testtoevaluatepotentialdifferencesamongpopula-
tion proportions.For a contingencytablethat hasr rows and c columns,you can generalizethe
y2 testas a test ofindependencefor two categoricalvariables.
For
atest
""'"T,l;:ff;H:ffi:;:,'#ffi ,T"'"J'
(that is, thereis no relationshipbetweenthem).
?,: The two categoricalvariablesare dependent
(that is, thereis a relationshipbetweenthem).
Onceagain,you useEquation(12.1)on page464to computethe teststatistic:
(-f,--f")2
u
n2 = 5.
o/t'i,,, .f,
You rejectthe null hypothesisat the cr level of significanceif the computedvalueof the y2
test statisticis greaterthan 74!1,
the upper-tailcritical value from a.hi-.q,ra." distributron
with (r - lXc - l) degreesof freedom(seeFigure12.8).Thus,the decisionrule is
testingfor
c contingelcy
usingthe 12 test
The 72 test of independenceis similar to the y2 test for equality of proportions.The test
statisticsand the decisionrules arethe same,but the statedhypothesesand conclusionsaredif-
ferent.For example,in the guestsatisfactionsurveyof Sections12.1and 12.2,thereis evidence
of a significantdifferencebetweenthe hotelswith respectto the proportionof guestswho
478 cHAPTERTwELVE
Chi-SquareTestsand NonparametricTests
T A B L E1 2 . 9 HOTEL
Contingency Table of PRIMARY
REASON Golden Palm Palm
Primary Reasonfor Not
FORNOTRETURNING Palm Princess
R e t u r n i n ga n d H o t e l
Price 7 )l
Location 39 IJ 8
Roomaccommodation l3 5 l3
Other l3 8 8
Total 88 JJ 66
(f"-f"f
u2= T
LLf
all cells r e
where
: observed frequency in a particular cell ofthe r x c
/u
contingency table
where
row total: sumofall the frequencies
in the row
columntotal: sumof all the frequencies
in the column
r : overallsamplesize
and for the lower-right-corner cell (other reason for the Palm Princess),
E'.,z.10 HOTEL
Tableof PRIMARY
REASON Golden Palm Palm
Frequencies FORNOTRETURNING Palm Royale Princess Total
Reason
Returning Price 31.53 I 1.82 23.65 67
Hotel [ocation 28.24 10.59 2t.t8 60
Roomaccommodation 14.59 5.47 10.94 3l
Other 13.65 5.12 r0.24 29
Total 88.00 33.00 66.00 r87
480 CHAPTERTWELVE Chi-Square
TestsandNonparametric
Tests
To perform the test ofindependence, you use the y2 test statistic shown in
on page 464. Here, the test statisticapproximatelyfollows a chi-squaredistribu
degreesof freedomequal to the number of rows in the contingencytable minusl,
numberof columnsin the tableminus 1:
Degrees
offreedom trrl:_
_rrr^ i] = ,
Using the level of significanceo( : 0.05, the upper-tailcritical value from the
squaredistributionwith 6 degreesof freedomis 12.592(seeTableE.4). Becausether
putedtest statistic12 -- Zl .+l > 12.592,you rejectthe null hypothesisof independence
Figure 12.9). Similarly, you can use the Microsoft Excel worksheetin iigure l2
Becausethep-value:0.0001 < 0.05,you rejectthe null hypothesisof indepenlence.'
p-valueindicatesthat there is virtually no chanceof having a relationshipthis
lart
largerbetweenhotelsand primary reasonsfor not returningin a sample,if th-eprimary
sonsfor not returningare independentofthe specific hotelsin the entirepopuiation.i
thereis strongevidenceof a relationshipbetweenprimary reasonfor noireturning
the hotel.
FIGURE12.9
Regionsof rejection
and nonrejection
when testingfor
independence in the
hotel guestsatisfaction
surveyexampteat
the 0.05 levelof .05
significance,
with
6 degreesof freedom
Regionof
Rejection
12.3:Chi-SquareTestoflndependence
481
the4x3
>lefor
for not
hotel
to create
not shown
rows3
similar
IN
-Fa3 .0 . lB2{ - t}
-fi[nvFz2, B25l
-St|IGttdtS)
-ctllDlsT€z9, B25l
-lFlEl < &4, 'Rrlccalftc n.rll lporh.d.',
1)o ||ot r.r.rt lh. nlll iypolh.d.'l
-lF{OR{Bl5
< I, Cti < t, Dt5< 1, BtO< I, Cl6< t,016 < I,
Bl7 < I, c|7 < 't, Dt7< I, Blt < I, CtE< t, 018< tl
- hvlolercd.-,' b m.r.)
12.12 (GROUP)
coNDrTroN 2
(GROUP)
coNDrTroN 1 Yes No Totals
the
r Test Yes A B A+B
No C D C+D
Totals A+C B+D
where
The sampleproportionsare
A+C
P2= = proportionof respondents
in the samplewho answeryesto condition2
ft
The populationproportionsare
McNEMAR TEST
, _ B-C
(t2.6)
"[Ei
wherethe teststatisticZ is approximately
normallydistributed.
You use the McNemar test for these data becauseyou have repeatedmeasurements fro{
the sarneset of panelists.Each panelist gave a responseabout whether he or she preferr
Sprint or Verizon before exposureto the intensivemarketing carnpaignand then againaftd
exposureto the campaign.
To determinewhetherthe intensivemarketing carnpaignwas effective,you want to inve
tigate whether there is a difference between the population proportion who favor Verizo
before the campaign,7rl, versusthe proportion who favor Verizon after the carnpaign,nr. Th
null and alternativehypothesesare
H , r "n r - n ,
H l. nt+ Tc2
U s i n g a 0 . 0 5 l e v e l o f s i g n i f i c a n c et,h e c r i t r c a lv a l u e sa r e - 1 . 9 6 a n d + 1 . 9 6 ( s e eF i g u r e1 2 . 1
a n d t h e d e c i s i o nr u l e i s
FIGURE12.11
T w o - t a i lM c N e m a r
T e s ta t t h e 0 . 0 5 l e v e l
of significance
12.4: McNemar Test for the Difference Between Two Proportions (Related Samples) 485
sothat
Using Equation(12.6),
B_C
Z- =+ = 3.4641
4B+C {48
12.12
Excelresults
test
loyaltyof
providers
-xOilSmrpllzl
-||onrsilYtl -HiA
-2' (l - xoil$)lsr(AssFlflll
-F@1 < B1l.'Rql.d|i. illl byp.{*',
'Dc
f,ot r.l.d |i. ed l!|'ct*')
GROUP
2
Applying the Concepts
Yes No Total
12.28 A market researcherwantedto determinewhether
46 25 7l
the proportion of coffee drinkers who preferred Brand I
l6 59 75
increasedas the result of an advertisingcampaign.A ran-
62 84 A6
dom sampleof 200 coffeedrinkerswas selected.The results
486 CHAPTER
TWELVEChi-Square
Tests
andNonparametric
Tests
CHECKINGTHE RANKINGS
n{n + l)
T1+7, = (r2.7)
The Wilcoxon rank sum test can be either a two-tail test or a one-tail test, dependingon
whetheryou are testing whetherthe two populationmediansarc dffirent or whetherone
medianis greater than the othermedian:
where
M, : medianof population I
Mr: medianof population2
Whenboth samplesn, andn2are< 10,you useTableE.8 to find the criticalvaluesof the
test statistic2,. For a two-tail test,you reject the null hypothesis(seePanelA of Figure 12.13)
if the computedvalue of 7, equalsor is greaterthan the upper critical value,or if Z, is less
than or equalto the lower critical value.For one-tailtestshavingthe alternativehypothesis
Ht: M, < M2, yotrrejectthe null hypothesisif the observedvalue of Z, is lessthan or equal
to the lower critical value(seePanelB of Figure 12.13).For one-tailtestshavingthe alterna-
tive hypothesis11,: M t> M2, you rejectthe null hypothesisif the observedvalueof Z, equals
or is greaterthan the uppercritical value (seePanelC of Figure 12.13).
+6d CHAPTER TWELVE Chi-SouareTestsand Nonoarametric Tests
til
Tr, Tr,
ti
T.,
1t\ Fr., Pt
- Regionof Rejection
' Regionof Nonrejection PanelA PanelB PanelC
H . , :M . , * M , Hr:Mr< M, Hr:.M,,> M,
lrr,=\!
nrnz(n+ l)
04=
t2
Therefore,Equation(12.8)definesthe standardized
Z teststatistic.
LARGESAMPLEWILCOXONRANKsUMTEST
T -
n{n+l)
tl
z=-+ ^
(12.8)
+ l)
ln1n2(n
1n
wherethe test statisticZ approximatelyfollows a standardizednormal distribution.
12.',,4 Sales
theCombined Normal Display Combined End-Aisle Display Combined
(n, :10) Ranking (n, = l0) Ranking
22 1.0 52 5.5
34 3.0 7l 14.0
52 5.5 76 15.0
62 r0.0 54 7.0
30 2.0 67 13.0
40 4.0 83 17.0
64 I1.0 66 12.0
84 18.5 90 20.0
56 8.0 77 16.0
59 9.0 84 18.5
Source:Data are takenJiom Table10.1 onpage 372.
The next stepis to computeTr, the sum of the ranksassignedto the smaller sample.When
the samplesizesare equal,as in this example,you can identify either sampleasthe group from
which to compute2,. Choosingthe normal display asthe first sample,
T , : | + 3 + 5 . 5+ 1 0 + 2 + 4 + 1 1+ 1 8 . 5+ 8 + 9 : 7 2
As a checkon the ranking procedure,you computeTrfrom
T 2 : 5 . 5+ 1 4 + 1 5+ 7 + 1 3+ 1 7+ 1 2 + 2 0 + 1 6 + 1 8 . 5: 1 3 8
andthenuseEquation(12.7) onpage487to showthat the sum of the first n : 20 integersin the
combinedranking is equalto T, 1-Tr:
n(nj l)
T1+T2 -
z
i2+138-20(21) =2lo
2
210 = 210
To test the null hypothesisthat there is no differencebetweenthe median salesof the two
populations,you useTable8.8 to determinethe lower- and upper-tailcritical valuesfor the test
statistic2,. FromTable12.15,a portionof TableE.8,observethat for a levelof significanceof
0.05,the criticalvaluesare78 and l32.The decisionrule is
RejectHoif Tt < 78 or if Tr2132;
otherwise,do not rejectHn.
FIGURE',,2.14
MicrosoftExcel
Wilcoxonranksum test
worksheetfor the BLK
colasalesexamole
-!loRxslllv(8rz)
-lloRlslllv(l - Btlz)
-2' fl - tloRIs0lsT{ABs{81cD}
-lF(8Zl < Bl, 'RcJcafta null Ilypothrcb',
-llo noi rdc{*th! nrll hypothc'*J
TableE.8 showsthe lower and uppercritical valuesof the Wilcoxon rank sumtest
7n,,but only for situationsin which both n, andn) arelessthanor equalto 10.If either
both of the samplesizesare greaterthan10,yotmust usethe large-sampleZapproxi
formula[Equation(12.8)on page488]. However,you canalsousethis approximation
for small samplesizes.To demonstratethe large-sampleZ approximationformula,consi
BLK cola salesdata.Using Equation(12.8),
T -
n{n+l)
ll ^
11 (1ox2t)
lL--
(1Oxl0x2i)
t2
-
- 72 lo5 = -2.4946
13.2288
a. o : 0 . 1 0 ,n r : 6 , n r : 8
Learningthe Basics
b. c r: 0 . 0 5n, r : 6 ,n r : 8
',2.33 Using Table8.8, determinethe lower- c. o : 0 . 0 1n, r : 6 , n r : 8
and upper-tailcritical valuesfor the Wilcoxon d. Given your resultsin (a) through (c), what do
rank sum test statistic,2,, in eachof the follow- clude regardingthe width of the region of
ing two-tailtests: asthe selectedlevelofsignificancecx,gets
12.5: WilcoxonRankSumTest:Nonparametric
Analysisfor TwoIndependent
Populations 491
UsingTable E.8, determinethe lower-tail critical graduatesspendsix months on the job, the vice president
for theWilcoxon rank sumtest statistic,2,, in eachof ranks them on the basis of their performance,from I
ineone-tailtests: (worst)to 20 (best),with the following results(storedin the
= 0 . 0 5n, r : 6 , n r : 8 fileS@@:
= 0 . 0 2 5n,r : 6 , n r : 8
= 0 . 0 1n,r : 6 , n r : 8 T | 2 3 59l0t2t31415
= 0 . 0 0 5n,r : 6 , n r : 8 E 467 8ll16t7l81920
a. Is there evidenceof a differencein the median surface @ffiE (extractedfrom B. D. Olin andW. Q.
hardnessbetweenuntreatedand treated steel plates? "Applicationsof StatisticalMethodsto
(Usecr: 0.05.) Evaluation,"Technometrics,38, 1996,p.
b. What assumptionsmust you makein (a)?
Unflawed
c. Comparethe resultsof (a) with thoseof Problem10.18
(a) on page380. 0.0030.0040.0120.0140.02r 0.0230.0240.030
0.0410.0410.0420.0430.04s0.0s70.0630.074
12.44 Managementof a hotel was concerned
with increasingthe return rate for hotel guests. Flawed
One aspectof first impressionsby guestsrelates 0.0220.0260.0260.0300.0310.0340.0420.043
to the time it takesto delivera guest'sluggageto the room
0.0460.0460.0520.0s50.0s80.0600.0600.070
aftercheck-into the hotel.A randomsampleof 20 deliver-
ieson a particulardaywereselectedin Wing I of the hotel, 0.0730.0730.0780.0790.0790.0830.0900.095
and a randomsampleof 20 deliverieswere selectedin 0.0960.1000 . r 0 20 . r 0 30 . r 0 50 . 1l 4 0 . 1 1 90 . 1 200.
Wing B. The resultsare storedin the file [ElEEEf,lE. 0 . 1 6 00 . 3 0 60.3280.440
a. Is thereevidenceof a differencein the mediandelivery
time in the two wings of the hotel?(Usecr: 0.05.) a. Usinga 0.05levelof significance,is there
b. Comparethe resultsof (a) with thoseof Problem10.74 the median crack size is less for unflawed
on page409. than for flawed components?
b. What assumptions mustyou makein (a)?
12.45 The directorof trainingfor an electronicequip- c. Comparethe resultsof Problem10.21(a) on page
ment manufacturerwantsto determinewhetherdifferent with the resultsof (a) in this problem.Discuss.
training methodshave an effect on the productivityof
12.47 A bank with a branchlocatedin a commercial
assembly-lineemployees.She randomly assigns 42
trict of a city has developedan improvedprocessfor
recentlyhired employeesto two groupsof 21. The first
ing customersduring the noon-to-1 p.m. lunch period.
group receivesa computer-assisted, individual-based
waiting time (defined as the time elapsed from when
training program,and the other group receivesa team-
basedtrainingprogram.Upon completionof the training, customerentersthe line until he or shereachesthe
the employeesare evaluatedon the time (in seconds)it window)of all customersduringthis hour is recorded
takesto assemblea part.The resultsare in the datafile a period of 1 week.A randomsampleof 15
selected(andstoredin the file[l@!), andthe
@ minutes)areasfollows:
a. Using a 0.05level of significance,is there evidenceof a
difference in the median assemblytimes (in seconds) 4.21 5.5s 3.02 5.13 4.77 2.34 3.54 3.20
between employeestrained in a computer-assisted,indi-
4.s0 6.10 0.38 s.t2 6.46 6.19 3.19
vidual-basedprogram and those trained in a team-based
program? Another branch, located in a residential area, is also
b. What assumptions must you make in orderto do (a) of cerned with the noon-to-l p.m. lunch period. A
this problem? sample of l5 customers is selected (and stored in the
c. Comparethe resultsof Problem10.20(a) on page380 EEEEEE), and the results (in minutes) are as follows:
with the resultsof (a) in this oroblem.Discuss.
9.66 5.90 8.02 5.79 8.73 3.82 8.01 8.35
12.46 Nondestructive evaluationis a methodthat is used 10.49 6.68 5.64 4.08 6.t7 9.9t 5.47
to describethe propertiesof componentsor materials
a. ls thereevidenceof a differencein the medianwaiti
without causingany permanentphysicalchangeto the
time betweenthe two branches? (Usecr: 0.05.)
units.It includesthe determinationof propertiesof materi-
b. What assumptions mustyou makein (a)?
als andthe classificationof flawsby size,shape,type,and
c . Comparethe resultsof Probleml0.la (a) on page
location.This methodis most effectivefor detectingsur-
with the resultsof (a) in this problem.Discuss.
faceflaws and characterizing surfacepropertiesof electri-
cally conductivematerials.Recently,datawere collected 12.48 A problemwith a telephoneline that
that classifiedeachcomponentas havinga flaw or not, customerfrom receivingor making calls is
basedon manualinspectionand operatorjudgment, and both the customer and the telephone company.The dah
also reportedthe size of the crack in the material.Do the the file trElffr representsamplesof 20
componentsclassifiedas unflawedhavea smallermedian reported to two different offices of a telephone
crack size than componentsclassifiedas flawed?The and the time to cleartheseproblems(in minutes)from
resultsin termsof cracksize(in inches)arein the datafile customers' lines:
12.6: Kruskal-WallisRank Test:NonparametricAnalysis for the One-WayANOVA 493
:er, CentralOfficeI Time to Clear Problems (Minutes) a. ls there evidence of a difference in the rnedian time
ive 1 . 4 81 . 7 50 . 7 8 2 . 8 5 0 . 5 2 1 . 6 0 4 . 1 5 3 . 9 7 1 . 4 8 3 . 1 0 to clear these problems betweenthe two offices? (Use
cx:0.05.)
1.020.53 0.93 1.60 0.80 1.05 6.32 3.93 5.45 0.97
b. What assumptionsmust you make in (a)?
CentralOfficell Time to Clear Problems(Minutes) c. Compare the resultsof Problem 10.16 (a) on page 379
34 7 . 5 53 . 7 50 . 1 0 l . r 0 0 . 6 0 0 . s 2 3 . 3 0 2 . r 0 0 . 5 8 4 . 0 2 with the resultsof (a) in this problem. Discuss.
76 3 . 7 50 . 6 5 t . 9 2 0 . 6 0 1 . 5 3 4 . 2 3 0 . 0 8 1 . 4 8 1 . 6 5 0 . 7 2
t44
)7 r 12.6 KRUSKAL-WALLISRANK TEST:NONPARAMETRIC
ANALYSIS
)95 FOR THE ONE.WAY ANOVA
130 If the normality assumptionof the one-wayANOVA F test is not met, you can usethe Kruskal-
Wallis rank test.The Kruskal-Wallisrank test for differencesamong c'medians(wherec'> 2) is
an extension of the Wilcoxon rank sum test for two independentpopulations, discussedin
: that Section 12.5.Thus, the Kruskal-Wallistesthas the samepower relativeto the one-wayANOVA
rents F test that the Wilcoxon rank sum test has relative to the / test.
You use the Kruskal-Wallis rank test to test whether c independentgroups have equal
medians.The null hypothesisis
: 380
H u :M r : M z - - M,
As the sample sizes in each group get large (that is, greater than 5), you can
the test statistic,H,by the chi-squaredistributionwith c - I degreesof freedom.
reject the null hypothesisif the computedvalue of 11is greaterthan the y[ upper-
value (seeFigure 12.15).Therefore,the decisionrule is
FTGURE12.15
Determining the
rejectionregion for
the Kruskal-Wallistest
o t yilx,
Begion of / Regionof
Nonreiection/ Rejection
Critical
Value
H o :M r : M z : M t : M q
11,:Not all M, areequal(wherej : 1,2,3, 4).
TABLE 12.16
TensileStrengthand
Ranksof Parachutes
Woven from Synthetic Amount Rank
Fibersfrom Four
Suppliers 18.5 4 26.3 20 20.6 8 2s.4
24.0 l3.s 2s.3 18 25.2 t7 19.9
t7.2 I 24.0 13.5 20.8 9 22.6
19.9 5.5 21.2 l0 24.7 t6 t7.5
18.0 3 24.5 l5 22.9 t2 20.4
As a checkon the rankings,recall from Equation(12.7) onpage 487 that for any integern, the
4T!' Therefore
sum of the first n consecutiveintegemi,
L
Tr+Tr+T3+fo=4\!
27+76.5+62+44.5=ry
210 = 210
r. " ,21
H =l__.t!-t,'/ l- 31n + r;
l n ( n+ l ) - i = r n iI
; 't"2 r ^ - . 2 ( 4 4 . s ) '_z3- le] t \
=I l e t f + e 6 . r 2+ r , a 2+
[(20x215 )L s s s li
2.17 Upper-TailArea
the Degreesof
Value
Critical Freedom .995 .05
-Wallis
atthe0.05 l- 0.001 0.004 0.016 0.t02 1 . 3 2 32.706 s.024
2 0.010 0.020 0 . 0 5 1 0.103 0.211 0.575 2.773 4.605 7.378
9.348
4 0.207 0.29',70.484 0.71l 1.064 r.923 5.385 7.779 9.488 lt.t43
) 0.412 0.s54 0 . 8 3 1 1 . 1 4 5 1 . 6 1 02 . 6 7 5 6.626 9.236 il.071 12.833
Source:Extractedfrom TableE.4.
Figure 12.17presentsa roadmapfor this chapter.First, you different with rcspectto the proportion of guestswhoare
used hypothesistesting for analyzingcategoricalrcsponse likely to return; and that the reasonsgiven for not returning
data from two sarnples(independentand relatcd)and from to a hotel are dependenton the hotel the guestsvisited.
more than two independentsanples. In addition, the rules T h e s e i n f e r e n c e sw i l l a l l o w T . C . R e s o r t P r o p e r t i et so
o f p r o b a b i l i t y f r o m S e c t i o n 4 . 2 w e r e e x t e n d e dt o t h e inrprovethe quality of serviceit provides.
hypothesisof independencein the -;oint responsesto two I n a d d i t i o n t o t h e c h i - s q u a r et e s t s ,y o u a l s o s t u d i e d
categoricalvariables.You applicd thesen-rethods to the sur- t w o n o n p a r a m e t r i c t e s t s .Y o u u s c d t h e W i l c o x o n r a n k
veys conductedby T.C. Resort Properties. You concluded sum test when the assumptionsof the t test for two inde-
that a greaterproportion of gucstsare willing to rcturn to p e n d e n t s a r n p l e sw e r e v i o l a t c d a n d t h e K r u s k a l - W a l l i s
t h e B e a c h c o n , b eH r o t e l t h a n t o t h e W i n d s u r f e r :t h a t t h e tcst when the assumptionsof the one-way ANOVA were
Golden Palrn, Palm Royale,and Palm Princesshotels are violated.
Wilcoxon Kruskal-Wallis
RankSum Test Test C o n t ni g e n c y
Tables
Marascuilo
Procedure"*
Independent
No Samples
McNemar
Test
Yes
ZTest for
,(' lest lor
(see Section10.3) "1- "2
having a large two-topping pizza deliveredto their resi- (1) Do you believe that quality is a profession?
dencethat evening.The studentshad to decidebetween
orderingfromPizza Hut at a reducedprice of $8.49 (the Manufacturing Service Health
regularprice for a largetwo-toppingpizzafromthe Oxford
Yes 108 88 49
PizzaHtfi at this time was $11.49)and orderinga pizza
No 72 r32 50
from a differentpizzeia. The resultsfrom this questionare
summarizedin the following contingencytable: (2) Doesyour company measurethe impad of pt
improvement initiatives?
PIZZERIA
Manufacturing Service Health
GENDER PizzaHut Other Total
4 13 l7 Yes 132 t29 54
Female
l8 No 48 9t 46
Male 6 12
Total 10 2s 35 Source:Adaptedfrom G. Weiler "Ilhat Do CEOs ThinkAbout
Quality? " Quality Progress,May 2004, 37(5), pp. 52-56.
The surveyalso evaluatedpurchasedecisionsat other a. Is there a significant differenceamongthe threei
prices.Theseresultsare summarizedin the following con- tries with respectto the proportion of top
tingencytable: who believequality is a profession?(Usea:0.05.)
b. If appropriate,apply the Marascuiloprocedureto (
usingcr:0.05.
PRICE
c. Is there a significant differenceamong the di
PIZZERIA $8.49 $11.49 $14.49 Total industrieswith respectto the proportionof
PizzaHut l0 5 2 t7 that measurethe impact of processimprovementiniti
Other 25 23 27 75 tives?(Usecr: 0.05.)
Total 35 28 29 92 d. If appropriate,apply the Marascuiloprocedureto
usingo: 0.05.
a. Using a 0.05 level of significanceand usingthe datain 12.66 A company is considering an organizat
the first contingencytable,is there evidenceofa signif- changeby adopting the use of self-managedwork
icant relationshipbetweena student'sgenderand his or To assessthe attitudesof employeesof the company
her pizzeriaselection? this change,a sampleof 400 employeesis selected
b. What is your answerto (a) if nine of the male students askedwhether they favor the institution of self.
selectedPizzaHrfi and nine selectedother? work teams in the organization.Three responsesare
c. Using a 0.05 level of significanceand usingthe datain mitted: favor,neutral,or oppose.The resultsof the
the secondcontingencytable, is thereevidenceofa dif- cross-classifiedby type ofjob and attitudetoward
ferenceinpizzeia selectionbasedon price? managedwork teams,are summarizedas follows:
d. Determinethep-value in (c) and interpretits meaning.
e. If appropriate,use the Marascuiloprocedureand cr : SELF-MANAGEDWORKTEAMS
0.05 to determinewhich prices are different in terms of TYPEOFJOB Favor Neutral Oppose
pizzeia preference.
Hourlyworker 108 46 7l
12.65 A2004 study by the American Societyfor Quality Supervisor 18 12 30
investigatedexecutives'views toward quality. Top execu- Middlemanagement 35 14 26
tives were askedwhetherthey view quality as a profession Uppermanagement 24 7 9
in the way law, medicine,engineering,and accountingare Total 185 79 136
viewed,or whetherthey seepracticing quality more as the
ability to understandand use a variety oftools and tech- a. At the 0.05 level of significance,is thereevidence
niquesto producea result.Table(l) providesthe responses relationshipbetweenattitudetowardself-managed
to this question,cross-classifiedby the type of industry teamsandtype ofjob?
with which the executiveis involved.A secondquestion
askedwhetherthe executives'companiesactually measure The survey also askedrespondentsabouttheir
the impact of processimprovementinitiatives designedto towardinstitutinga policy wherebyan employeecould
raisethe quality oftheir productsor services.Table(2) pro- one additional vacation day per month without pay.
vides the resultsto this question. by typeofjob, areshownon page
results,cross-classified
('hapterRcvicwProblems 501
VACATIONTIMEWITHOUTPAY SALES
APPROACH
TYPE
OFJOB Favor Neutral 0ppose Total Personal
-are
Hourfyworker t 35 23
ACTION VideotapeSalesCall Telephone
Total
67 225
Supervisor 39 7 14 60 Purchase l9 27 t4 60
M i d d l em a n a g e m e n t 47 6 22 75 Don't purchase g6
_81 73 240
Uppermanagement 26 6 g 40 Total 100
)ce5s 100 100 300
Total U7 A I lt 400
c. At the 0.05 level of stgnificance,is there evidenceof a
lare b. At the 0.05 levcl of significance,is there evidenceof a difference in the proportion of tapes purchasedon the
relationshipbetweenattitudetoward vacationtirne with- basisofthe salesstrategyused?
outpay and type ofjob'/ d. If appropriatc,usethe Marascuiloprocedureand u:0.05
to deterrninewhich salesapproachesare clifferent.
12,67 A companythat producesanclmarketsvideotaped e . O n t h e b a s i so f t h e r e s u l t so f ( c ) a n d ( d ) . w h i c h s a l e s
continuing education programs for the financial industry approachdo you think a representative shoulduse in the
hastraditionallyrrailed sampletapesthat containpreviews future'?Explain the rationalefbr your decision.
rdus- of theprograrnsto prospectivecustomers.Custornersthen
tives agree to purchasethe program tapes or return the sar.nple 12.68 A markct rescarcherinvestigatedconsumerprefer-
) t a p e sA. g r o u p o f s a l e s r e p r e s e n t a t i v e s t L r d i e dh o w t o encesfor Coca-Colaand Pepsibefore a tastetest and after
r (a), increase sales and found that many prospective custorners a t a s t e t e s t . T h e f o l l o w i n g t a b l e s u m m a r i z e st h e r e s u l t s
believed it was difficult to tell from a sample tape alone from a sar.nple of 200 respondents:
)rent whether the educationalprogramswould meet their needs.
tnies Thesalesrepresentatives performed an experimentto test PREFERENCE
AFTER
TASTE
TEST
titia- whether sendingthe completeprogram tapesfor review by PREFERENCE
BEFORE
c u s t o m e rws o u l d i n c r e a s es a l e s .T h e y s e l e c t e d8 0 c u s - TASTE
TEST Coca-Cola Pepsi Total
,(c), tomersfrom the rnailing list and randomly assigned40 to
Coca-Cola 104 6 ltO
receive the sampletapesand 40 to receivethe full-program
Pepsi t4 76 90
onal tapes for review.They then determinedthe nurnberof tapes
Total l 18 82 200
ams. thatwerepurchasedand returnedin eachgroup.The results
vard oftheexperimentare as follows:
a . I s t h e r e e v i d e n c eo f a d i f f e r e n c ei n t h e p r o p o r t i o n o f
and
respondentswho prefer Coca-Colabefore and after the
rged
TYPEOFVIDEOTAPE
RECEIVED tastetests?(Use cr : 0. 10.)
per-
Sample b. Compute the p-value and interpretits meaning.
vey,
c. Show how the following table was derived from the
;elf- Purchased 6 t4 20 table above:
Returned 34 zo 60
Iotal 40 40 80
ts SOFTDRINK
tal PREFERENCE Coca-Cola Pepsi
a.At the 0.05 level of significance,is there evidenceof a
Beforetaste test l l0 90 200
* differencein the proportion of tapes purchasedon the
t0 After taste test 118 82 200
basisof the type of tape sentto the custorner?
t5 Total 228 172 400
b .O n t h e b a s i s o f t h e r e s u l t so f ( a ) , w h i c h t a p e d o y o u
a
)0
thinka representative should sendin the future? Explain
d. Using the secondtable, is there evidenceof a difference
therationalefor your decision.
The sales representativea s lso wanted to determine in preferencefor Coca-Cola before and after the taste
rfa whrchof three initial salesapproachesresult in the most t e s t ' (?U s eo : 0 . 0 5 . )
ork sales: ( I ) a video sales-informationtape rnailedto prospec- e. Determinethe p-value and interpretits meaning.
tivecustomers,(2) a personalsalescall to prospectivecus- f. Explainthe differencein the resulrsof (a) and (d). Which
tomers, and (3) a telephonecall to prospectivecustomers. method of analyzingthe data shouldyou use?Why,?
Jes A r a n d o r n s a m p l e o f 3 0 0 p r o s p e c t i v ec u s t o m e r s w a s 12.69 A market researcherwas interestedin studyingthe
*e selected, and 100 were randomly assignedto each of the effect of advertisementsol1 brand preferenceof new car
'he
threesalesapproaches.The results,in terms of purchases buyers.Prospectivepurchasersofnew cars were first asked
ll. ofthefull-programtapes,are as follows: whether they preferred Toyota or GM and then watched
502 Tests
TWELVEChi-Square
CHAPTER andNonparametric
Tests
Expense ratio-Ratio of expensesto net assetsin For these data, construct contingency tables, using gender,
L
percentage major, plans to go to graduateschool, and employment sta-
!- 2005return-Twelve-monthreturnin 2005 tus. (You need to constructsix tables,taking two variables
Three-yearreturn-Annualized return,2003-2005 at a time.) Analyzethe dataat the 0.05 levelof significance
Five-yearreturn-Annualized return,200I -2005 to determine whether any significant relationships exist
Risk-Risk-of-lossfactor of the mutual fund (low. among these variables.
I average,or high)
) 12,71a. Constructa 2 x 2 contingencytable,using fees 12.76 Problem 1.27 on page 15 describesa survey of 50
astherow variableandobjectiveasthe columnvariable. undergraduatestudents(see the file @E@E@IS&)
L a. Select a sample of 50 undergraduatestudentsat your
b.At the0.05level of significance,is thereevidenceof a
J significantrelationshipbetweenthe objectiveof a school and conducta similar survey for thosestudents.
mutualfundandwhetherthereis a fee? b. For the data collectedin (a), repeatProblem 12.75.
8
12.72a. Constructa 2 x 3 contingencytable,using fees c. Compare the results of (b) to those of Problem 12.75.
I astherow variableandrisk asthe columnvariable.
9
b.At the0.05level of significance,is thereevidenceof a 12.77 Problem 1.28 on page l5 describesa survey of 50
'L relationshipbetweenthe perceivedrisk of a
significant MBA students (see the file [[!!@l!ls). For these data,
J mutualfund and whetherthere is a fee? construct contingency tables, using gender, undergraduate
12,73a. Constructa 3 x 2 contingencytable,using risk majoq graduate major, and employment status. (You need
2 to construct six tables, taking two variables at a time.)
astherow variableandobjectiveasthe columnvariable.
7 b.At the0.05level of significance,is thereevidenceof a Analyze the data at the 0.05 level of significance to deter-
.9
significantrelationshipbetweenthe objective of a mine whether any significant relationships exist among
mutualfund andits perceivedrisk? thesevariables.
12,74a. Constructa 3 x 3 contingencytable,using risk
12.78 Problem 1.28 on page l5 describesa survey of 50
astherow variableandcategoryasthe columnvariable.
MBA students (see the file [[[@!s[E)
b.At the0.05level of significance,is thereevidenceof a
a. Select a sample of 50 graduatestudentsin your MBA
her significantrelationshipbetweenthe categoryof a
program and conduct a similar survey for those students.
een mutualfund andits perceivedrisk?
b. For the data collected in (a), repeat Problem 12.77.
rof Student Survey Database c. Comparethe resultsof (b) to thoseof Problem 12.77.
12.75Probleml.2l on pagel5 describesa surveyof 50
o'
'D
students(seethe file EEEEtrEIEE).
undergraduate
rfa
for
'rDg' ^
rfa
lha
&
pfanaging the Springville Herald
rg.
ofa "l
Phase
eci-
Rwiewingthe resultsof its research,the marketing department 3. Substantial discount for the newspaper. Subscribers
concluded that a segmentof Springville householdsmight be would pay $3.00 per week for the newspaperduring the
rg.
interestedin a discountedtrial home subscriptiontothe Herald. 90-day trial period.
Theteamdecidedto test various discountsbefore determining 4. Discount restaurantcard. Subscriberswould be given a
ard- flretypeof discountto offer during the trial period. It decidedto card providing a discountof l5oh at selectedrestaurants
The conduct an experiment using three types of discounts plus a in Springville during the trial period.
planthatoffered no discount during the trial period:
und l. No discount for the newspaper.Subscriberswould pay Each participant in the experiment was randomly
$4.50per week for the newspaperduring the 90-day trial assignedto a discount plan. A random sampleof 100 sub-
lual period. scribers to each plan during the trial period was tracked to
2,Moderate discount for the newspaper. Subscribers determine how many would continue to subscribeto the
wouldpay $4.00 per week for the newspaperduring the Herald after the trial period. Table SH 12.I summarizesthe
90-davtrial oeriod. results.
504 andNonparametricTests
CHAPTERTWELVEChi-SquareTests
TABLE 5H1 2.1 Number of SubscribersWho Continue SubscriptionsAfter Trial Period with Four Discount
DISCOUNT
PLANS
CONTINUE
SUBSCRIPTIONS No Moderate Substantial Restaurant
AFTER
TRIALPERIOD Discount Discount Discount Card
Yes 34 37 38 6l
No 66 63 62 39
Total 100 100 100 100
EXERCISE
SUNDAY
PURCHASE
BEHAVIOR
SHl2.l Analyzethe results of the experiment.Write a
reportto the teamthat includesyour recommenda- INTEREST No More
tion for which discountplan to use.Be preparedto
INTRIAL Every 2-3 Times/ThanOnce/
SUBSCRIPTION
Sunday Month Month
discussthe limitations and assumptionsof the
experiment. Yes 35 l0 I
No 103 44 23
DO NOT CONTINUE LINTIL THE PHASE 1 EXERCISE Total t38 54 24
HAS BEEN COMPLETED.
INTEREST
INTRIAI
Phase2 SUBSCRIPTION
The marketingdepartmentteamdiscussedthe resultsof the
WHERE
surveypresentedin Chapter8, on pages320-32l.The
PURCHASED Yes No
team realizedthat the evaluationof individual questions
was providing only limited information. In order to further Conveniencestore t2 62
understandthe marketfor home-deliverysubscriptions,the Newsstand/candystore l5 80
datawere organizedin the following cross-classification Vendingmachine l0 ll
tables: Supermarket 5 8
Otherlocations 4 9
READOTHER
NEWSPAPER Total 46 170
HOME
DELIVERY Yes No Total MONDAY-SATURDAY
PURCHASE
BEHAVIOR
Yes 6l 75 136
No 77 139 216 SUNDAY
Total 138 214 352 PURCHASE Every Most Occasionally
BEHAVIOR Day Days or Never
RESTAURANT
CARD EverySunday 55 65 l8
HOME 2-3 times/month 19 23 t2
DELIVERY Yes No Total Once/month 47 l3
Total 78 95 43
Yes 26 ll0 136
No 40 r76 216
EXERCISE
Total 66 286 352
5H12.2 Analyze the results of the cross-classifi
MONDAY-SATURDAY tables.Write a report for the marketing
PURCHASE
BEHAVIOR ment team and discussthe marketingimpli
of the resultsfor the SpringvilleHerald.
INTEREST
INTRIAL Every Most Occasionally
SUBSCRIPTION Day Days or Never Total
Yes 29 l4 J 46
No 49 8l 40 170
Total 78 95 43 216
Rcferences505
Web Case
al Applltyour knou,ledge of'testing./br the clffirenc'e betv'een file in the Web Case folder on the StudentCD-ROM), and
0 tu'oproportions in this Weh Case, n'hich ertends the T.C. examinethe survey data.Then answerthe following:
ResortPropertie,s Using Statistic'.;sc'enario of'this chapter.
I A s T . C . R e s o r t P r o p e r t i e ss e e k st o i m p r o v e i t s c u s -
l. Are the claims made by Sunlow valid'l
r0
tomerservice,the company faces new competition fronl 2. What analysesof the survey data would lead to a more
SunLowResorts.SunLow l.rasrecer.rtly openedresorthotels favorableimpressionaboutT.C. ResortProperties?
o n t h e i s l a n d sw h e r e T . C . R e s o r t P r o p e r t i e sh a s i t s f i v e
3. Perforrnone of the analysesidentified in your answerto
hotels.SunLow is currently advertisingthat a randornsur-
step2.
vey of 300 customersrevealedthat about 600l percentof
thecustomerspreferred its "Concierge Class" travel reward 4. Review the data about the T.C. ResortsPropertiescus-
tal programover the T.C. Resorts "TCPass Plus" program. tomerspresentedin this chapter.Are thereany other fac-
Visit the SunLow Web site, www.prenhall.com/ tors that you might include in a future survey of travel
+6 S p r i n g v i l l e / S u n L o w H o m e . h t m( o r o p e n t h e W e b p a g e rewardprograms'?Explain.
70
16
tal
l8
4
,.1
6
ton
rrt-
)ns
506 EXCELcoMPANIoN to chapter12
E12.1 USINGTHE CHI.SOUART Before you enter contingency table data, many
#DIV/0!. This is not anerror.
cellsdisplaythe message
TESTFORTHE DIFFERENCE
BETWEENTWO PROPORTIONS
UsingChi-Square.xls
You conducta chi-squaretest for the differencebetween
two proportionsby eitherselectingthe PHStat2Chi-Square Opento the ChiSquare2Pworksheetof the
Testfor Differencesin Two Proportionsprocedureor by workbook.This worksheet(seeFigure 12.3on page
making entriesin the![ft![[f[! workbook. usesthe function CHIINV(/evel of signfficance,
of freedom) to computethe critical value and the functi
CHIDIST(Xz test statistic, degreesof freedom)to
Using PHStat2 Chi-SquareTest pute the p-valuefor the Section12.1hotel guestsati
for Differences in Two Proportions tion example.To adaptthis worksheetto other
SelectPHStat ) Two-SampleTests) Chi-SquareTest enterthe problem'scontingencytabledatainto therows
for Differencesin Two Proportions. In the procedure's through 7 Observed Frequenciesarea, edit the title in
dialogbox (shownbelow),enterthe Level of Significance, Al, and changethe level of significancein cell Bl8,i
entera title asthe Title, andclick OK. necessary.
Figure El2.l shows the cell formulas for rows
\1 -qq( sttqNu is Fiqute 12.3. Formulasin
\Krqlr:s-(\
Bl1, A12, Al3, A14, B12, andC12 displaytheronr
column labels enteredin the ObservedFrequencies
\
)owo!t,!!,):,yM
-Wz.aZZAAr,rzatzynrtrk_
themessage
#DM}!. This is not an
K\rSNs\N\)\.iNsulsrsr
\s.u t,ss\ur\ u\eS u'*t r'='s\:N\Nsu\x
qnere\ohtiheets.xls PHStat2t, *
"-u, or bvmakingentries
in
rhefilNFFFTnE*"rff::l:""edure
E12.3 USINGTHECHI.SOUARE
TEST
OF INDEPENDENCE
Adapt the instructionsof the previous sectionto the
createsa worksheetin which vou enter vour chi-squaretest of independence. If you usePHStat2,enter
tabledata,suchasTable12.6on page477, your numberof rowsasthe Number of Rows.If you usethe
Frequencies areathat beginsin row 4. (You @workbook, opento eitherthe chi-
entercustomrow and column labels for your squareworksheets for 3 x 4, 4 x 3, 7 x 3, or 8 x 3, or open
youentercontingencytabledata,manywork- oneof the 2 x c worksheetsmentionedin Section812.2.
displaythe message#DIV/O!.This is not an
E12.4 USINGTHE McNEMAR TEST
You conduct a McNemar test by either selectingthe
PHStat2McNemarTestprocedureor by makingentriesin
Worksheets.xls
Chi-Square
theEEIE@ workbook.
@workbook to the work-
containsthe appropriate 2 x c observedfre-
hblefor your problem.For example,for the guest
Using PHStat2McNemar Test
iondatain Table 12.6on page 471 that requiresa SelectPHStat ) Tivo-SampleTests) McNemar Test.In
table,open to the ChiSquare2x3 worksheet. the procedure's dialogbox (seetop ofpage 508),enterthe
with empty observedfrequenciestablesdis- Level of Significance,click a test option, entera title as
message #DIV/01in many cells.This is not an the Title, andclick OK.
508 EXCELcoMPANIoN to chaoter12
=1{ORMSTNV(B11}
=1{ORMSDTST{8161
=lF{826< 81l, "Rejeclthe null hypothesis", PHStatcreatesa worksheetsimilar to the Fisure12.
"Do nql rejeci the null hypolhesis'J
worksheeton page490.However,PHStat2usesthe
=NORMS|l{V(t -8111 COUNTIF and SUMIF in formulasin cells87, 88, B
=1 - ilORMSDt5T{816}
=lF(B3l < Bll, "Rejedrhe trull hypothesis", andBl I to computethe samplesizeandsumof theranks
"Do not rejecl the null hypothesis"!
eachpopulation.To learn more aboutthesefunctions,
FIGUREE12.2 McNemar All worksheetone-tailtests "Countingand SummingRanks,"laterin this section.
E12.6:UsinstheKruskal-Wallis
RankTest 509
El2.3 Wilcoxonone-tailtests
FIGURE