Professional Documents
Culture Documents
Division of Trauma and Critical Care, Department of Surgery, Medical College of Wisconsin, Milwaukee,
Wisconsin
Journal of Surgical Education • © 2007 by the Association of Program Directors in Surgery 1931-7204/07/$30.00 93
Published by Elsevier Inc. doi:10.1016/j.jsurg.2007.02.001
patients greater than 65 years of age. H1: One of these groups is
different than the other groups. Table 1 shows the data from
such a study.
The F-statistic for this data set is 2.34. As with the t- and
z-statistics, the F-statistic is compared with a table to determine
whether it is greater than the critical value. In interpreting the
F-statistic, the degrees of freedom for both the numerator and
the denominator are required. The degrees of freedom in the
numerator is the number of groups minus 1 (2 in this example),
and the degrees of freedom in the denominator is the number of
data points minus the number of groups (197 in this example).
The critical value for the F-statistic in this case is 3.04; there-
fore, the null hypothesis cannot be rejected for this data set. The
groups are not statistically different.
This comparison is a basic ANOVA; more and more vari-
ables can be added, which increases the mathematical complex-
ity, although the concepts remain the same.
FIGURE 1. Framingham study data— diastolic versus systolic blood pres- NONPARAMETRIC TESTS
sure of individual patients.
If the data do not meet the criteria for a parametric test (nor-
mally distributed, equal variance, and continuous), it must be
mean height is 168.6 cm with a standard deviation of 5.7 6cm. analyzed with a nonparametric test. If a nonparametric test is
The t-value is 1.88; therefore, the mean heights are not statis- required, more data will be needed to make the same conclu-
tically different. sion. For this reason, categorical data are often converted to
continuous data before analysis. Many nonparametric tests and
The z-Test multiple variations of each of those specific tests exist. Discus-
The next test, which is very similar to the Student t-test, is the sion will be limited to the few that are most used and corre-
z-test. However, with the z-test, the variance of the standard spond to the previously discussed parametric tests.
population, rather than the standard deviation of the study
groups, is used to obtain the z-test statistic. Chi-Squared
Using the z-chart, like the t-table, we see what percentage of
the standard population is outside the mean of the sample pop- The chi-squared test is usually used to compare multiple groups
ulation. If, like the t-test, greater than 95% of the standard where the input variable and the output variable are binary.
population is on one side of the mean, the p-value is less than This test is very well illustrated with an example of a 2 ⫻ 2 table
0.05 and statistical significance is achieved. of data comparing 2 groups, one receiving a treatment and the
As some assumption of sample size exists in the calculation of other not, and 2 outcomes of cure versus continued disease
the z-test, it should not be used if sample size is less than 30. If (Table 2).
both the n and the standard deviation of both groups are The expected data table was constructed by taking the totals
known, a two sample t-test is best. of the rows, multiplying them by the column totals, and then
dividing by the grand total of subjects. Next a critical value is
ANOVA determined. The degrees of freedom for a chi-square table are the
number of rows minus one times the number of columns minus
Analysis of variance (ANOVA) is a test that incorporates means one. For a 2 ⫻ 2 table, the degrees of freedom is always 1. For this
and variances to determine the test statistic. The test statistic is example, the calculated chi-squared statistic is 27.5, and the
then used to determine whether groups of data are the same or
different. When hypothesis testing is being performed with
ANOVA, the null hypothesis is stated such that all groups are
the same. The test statistic for ANOVA is called the F-ratio. TABLE 1. Example ANOVA. Pulse rate in elderly trauma patients.
As an example, ANOVA is used to compare values for pulse All
rate in 3 groups of trauma patients over 65, one group without Pulse MI ⴙ Ø MI ⴚ Ø Control groups
comorbidities (control), a second group status post-heart attack Mean 80.25 85.77 88.93 87.11
without beta blocker treatment, and the third group status post- Standard 16.81 16.59 19.96 18.97
heart attack on beta blocker therapy. H0: Myocardial infarction deviation
Patients 25 48 127 200
and beta blocker therapy do not affect pulse rate in trauma
Treatment ⴙ ⴚ Total
Observed data
⫹ 45 20 65
⫺ 5 30 35
total 50 50 100
Expected Data
⫹ 32.5 32.5 65
⫺ 17.5 17.5 35
total 50 50 100
critical value is 3.841, which allows us to reject the null hypoth- FIGURE 2. The test statistic for this data set is ⫺0.71. It seems as though
age negatively correlates with spontaneous tidal volume.
esis; the treatment positively affects outcome.
Along the same lines as the chi-squared test is Fisher exact
test. This test is complicated to calculate but is used if the table
of data are ranked together. Once the 2 test statistics are
of expected outcomes has any single value of less than 2 or if
calculated, the smaller one is used to determine significance.
more than 20% of the values are less than 5. The test then
Unlike the previous tests, the null hypothesis is rejected if
compares the probability that the study data could have ran-
the test statistic is less than the critical value. The U-value
domly been acquired and compares it with the probabilities of
table is not as widely available as the previous tables, but
all expected data tables that could be generated with n subjects.
most statistic software will give a p-value and state whether
statistical difference exists. Two valuable websites that pro-
Spearman Rank Coefficient
vide this information are (http://fsweb.berry.edu/academic/
Like the Pearson product correlation coefficient, the Spearman education/vbissonnette/tables/mwu.pdf) and (http://www.
rank coefficient is calculated to determine how well 2 variables physicalgeography.net/fundamentals/3g.html).
for individual data points can predict each other. The difference The following example will compare different pain medications
is that the data need not be linear. To start, it is easiest to graph and their effectiveness in controlling pain after inguinal hernia
all the data points and find the x and y values. Then rank each x repair. There are 24 patients, and each report their pain score on
and y value in order of occurrence. Similar to the Pearson correla- postoperative day 1 before going to bed. The first group receives
tion coefficient, the test statistic is from ⫺1 to 1, with ⫺1 being a oxycodone/acetaminophen, and the other group receives ibupro-
perfect negative correlation and 1 a perfect positive correlation. fen/acetaminophen, each with a dosage of 1 to 2 tablets every 4
Table 3 and Fig. 2 show spontaneous tidal volume from a ran- hours. Table 4 illustrates their pain scores and relative ranks.
dom sample of patients on any given day at a family practice clinic. The test statistic is less than the critical value. The null hy-
pothesis that these 2 pain control regimens are equivalent
Mann-Whitney U Test should be rejected.
This test, sometimes referred to as Wilcoxon rank test, uses rank
just as the previous test did. It is analogous to the t-test for
continuous variable but can be used for ordinal data. This TABLE 4. Example Mann-Whitney U test (Wilcoxon Rank test).
test compares 2 independent populations to determine Effectiveness of pain medication regimens.
whether they are different. The sample values from both sets Oxycodone Ibuprofen APAP
APAP score Ranks score Ranks
1 1 4 6
TABLE 3. Example Spearman Rank Coefficient. Spontaneous 2 2.5 5 9
tidal volume and age. 2 2.5 6 13
Rank (age) Age Tidal volume (ml) Rank (TV) 3 4 6 13
4 6 6 13
1 12 954 8 4 6 7 17.5
2 15 785 5 5 9 7 17.5
3 24 806 7 5 9 7 17.5
4 45 458 3 6 13 7 17.5
5 54 785 6 6 13 8 21.5
6 60 325 2 8 21.5 8 21.5
7 78 154 1 8 21.5 9 24
8 86 621 4 Sum ranks 109 Sum ranks 191