Professional Documents
Culture Documents
Test if sample representative of the population Test if sample representative of the population Test differences Test differences Test relationship Test relationship Test if observed frequencies fit expected frequencies Test relationship
One Sample Case Z-test ( and known) One Sample Case t-test ( unknown) Independent (or dependent) t-test One-Way ANOVA Pearson Correlation Point Biserial Chi-Square Test of Goodness-of-Fit Chi-Square Test of Independence/Association 1
A One-Way ANOVA is used when comparing two or more group means on a continuous dependent variable. The independent t-test is a special case of the OneWay ANOVA for situations where there are only two group means.
Therefore, if there are only two groups, an Independent t-test is a shortcut to using a One-Way ANOVA, but provides the same results. It is an extension of Independent t-test. For situations involving more than two groups, a OneWay ANOVA must be used.
The IV is categorical. This categorical IV can be dichotomous (i.e., two groups) or it can have more than two groups (i.e., three or more groups). The DV is continuous Data are collected on both variables for each person in the study.
Is there a significant difference among freshmen, sophomores, and juniors on college GPA? OR Is there a significant difference in GPA by class level? Note that GPA is continuous and class standing is categorical (freshmen, sophomores, or juniors)
Is there a significant difference in student attitudes toward the course between students who pass or fail a course? Note that student attitude is continuous and passing a course is dichotomous (pass/fail). Because the IV has only two groups, an independent t-test or rpb could also be used here.
6
Does student satisfaction significantly differ by location of institution (rural, urban, suburban)? Note that student satisfaction is continuous and institution location is categorical.
Is there a significant difference in weight loss when dieting, exercising, and dieting and exercising? Is there a significant difference in childs selfesteem by parenting style (authoritative, permissive, and authoritarian)? Is there a significant difference among those who condition, blow-dry, or only shampoo in hair frizziness?
8
Any experiment with k groups has k X (k-1)/2 different pairs available for testing. k=3 k X (3-1) / 2 = k X 3 (2/2) = 3(1) = 3 different pairs available for testing
10
= number of t-tests
11
The populations from which the samples were obtained must be normally or approximately normally distributed. The samples must be independent. The variances of the populations must be equal.
13
The One-Way ANOVA separates the total variance (hence the term analysis of variance) in the continuous dependent variable into two components: variability between the groups and variability within the groups.
Variability between the groups is calculated by first obtaining the sums of squares between groups (SSb), or the sum of the squared differences between each individual group mean from the grand mean.
Variability within the groups is calculated by first obtaining the sums of squares within groups (SSw) or the sum of the squared differences between each individual score and that individuals group mean.
14
The linear model, conceptually, is: SSt = SSb + SSw, where SSt is the total sums of squares. Note the simplicity of statistical models: They are linear depictions of phenomena, splitting variability into (1) what we can measure systematically (differences between groups) and (2) what we cant explain or account for systematically (differences within groups). Variability within the groups is random variance or noise; it reflects individual differences from the group mean and sometimes prevents us from seeing difference between groups. Variability between the groups is systematic variance, it reflects differences among the groups due to the experimental treatment or characteristics of group membership.
15
N 1 We can see that the numerator is a sum of squared values (or a sums of squares), and the denominator is the degrees of freedom. Thus, the formula can be rewritten as:
(x - x) 2
SS
df
16
The Analysis of Variance (ANOVA) analyzes the ratio of the variance between groups (i.e., how far apart the group means are from one another) to the variance within the groups (i.e., how much variability there is among the scores within a single group). In ANOVA, these variances, formerly known to us as 2, are referred as mean squares (MS) or the average of the sums of squares (SS/df). Thus, a mean square between is simply the variance between groups obtained by a sums of squares divided by degrees of freedom (SSb/dfb). Likewise, a mean square within is simply the variance within the groups also obtained by a sums of squares divided by degrees of freedom (SSw/dfw). Both are shown below:
MSb
SSb
dfb
MSw
SSw
df w
17
Table 1. Scores and means of the comparison groups and the total group
Between Method
1 1 4 3 2 5 1 6
2 8 6 7 4 3 5
Within
3 7 6 4 9 8 5 7 5 8
Within
18
Within n mean
3.14
5.50
6.38
Within Between
19
The MSb and the MSw are then divided to obtain the F ratio for hypothesis testing, i.e., F is the ratio of MSb and the MSw MS
MSw
As in all statistical tests, the larger the numerator (for example, the larger the difference between groups), the larger the test statistic (whether it be r, t, or F), and the more likely we are to reject the null hypothesis.
Also, the smaller the denominator (the less variability among people in a group), the larger the test statistic, and the more likely we are to reject the null hypothesis.
20
The diagrams below show the impact of increasing the numerator of the test statistic. Note that the within group variability (the denominator of the equation) is the same in situations A and B. However, the between group variability is greater in A than it is in B. This means that the F ratio for A will be larger than for B, and thus is more likely to be significant.
21
The diagrams below show the impact of decreasing the denominator of the test statistic. Note that the between group variability (the difference between group means) is the same in situations C and D. However, the within group variability is greater in D than it is in C. This means that the F ratio for C will be larger than for D, and thus is more likely to be significant.
22
Major Concepts: t2 = F
Diagrams A through D on the previous slides could also be used to illustrate the t statistic. As two groups differ more in their means (increase in numerator and effect size), the larger the value of t and the more likely we are to find significant results. The lower the within-group variability (decrease in noise), the smaller the denominator and the more likely we are to find significance. This analogy can be made to ANOVA because t is just a special case of ANOVA when only two groups comprise the independent variable. We are familiar with the t distribution as normally distributed (for large df), with positive and negative values. The F statistic, on the other hand, is positively skewed, and is comprised of squared values. Thus, for any two group situation, t2 = F.
23
Distribution of F ratio
F distribution is positively skewed. If F statistic falls near 1.0, then most likely the null is true. If F statistic is large, expect null is false. Thus, significant F ratios will be in the tail of the F distribution.
24
25
Summary of Calculations
dfb groups = k 1 where k is # of groups dfw groups = N k where N is total # of individuals in groups
27
28
Within
(N K)
29
30
3. Calculate SST: Square each individual score. Sum the squared scores across all groups and subtract correction factor.
31
N-K
SS MS df
B
B B
K-1
MS
W
MS
32
33
34
Interpreting Tables
Treatment A
M SE 2.5 .43
N-K
Treatment B
6.0 .58
K-1
Treatment C
0.5 0.22
ANOVA Summary SSb/dfb
SS 93 17 110
df 2 15 17
MS 46.5 1.13
N-1
F 41.02
MSb/MSw
SSw/dfw
35
36