You are on page 1of 4

ANOVA One Way Part 1 of 2

Purpose
To compare the means of two or more populations on a continuous CT characteristic. Since we don't know the population means, an analysis of data samples is required. ANOVA (Analysis of Variance) is usually used to determine if there is a statistically significant change in the mean of a CT characteristic under two or more conditions introduced by one factor (see concept Factor and Levels).

Anatomy

ANOVA - One Way - Part 1 of 2


H0 : Ha : =
j

A C

1 i

= .. =
+ +

for at least one pair (i, j)


i ij B

Yij =
One-Way Analysis of Variance
Analysis of Variance Source DF Factor 2 Error 42 Total 44 F Level 1 2 3 Pooled StDev N 15 15 15 = Mean 66.038 79.300 74.584 3.233 D Six Sigma - Tools & Concepts SS 1355.7 438.9 1794.6

MS 677.9 10.4

F 64.87

P 0.000

Individual 95% CIs For Mean Based on Pooled StDev StDev 2.760 3.217 3.658

(--*--) (---*--) (--*---)


65.0 70.0 75.0 E
ANOVA112_001

80.0

Reference:

Juran's Quality Control Handbook - Ch. 26, P. 10-16, Design and Analysis of Experiments by D. C. Montgomery Ch. 3, P. 67-79

Terminology
A. Null (H0) and alternative (Ha) hypotheses where the means ( 1, 2, , g)of the g levels of the factor are compared. There is only one alternative hypotheses: which is at least the means of two levels are significantly different. (Note: these hypotheses correspond to a fixed effects model, for more information see reference above.) B. Model where yij is the (ij)th observation of the CT characteristic (i = 1, 2, , g and j = 1, 2, , n, for g levels of size n), is the overall mean, i is the ith level effect and ij is an error component. C. Minitab session window output. D. Descriptive Statistics Sample sizes, means, standard deviations (StDev), and the Pooled Standard Deviation, which is a combination of the standard deviations of the g groups. E. Confidence Intervals around the mean for each individual level. F. ANOVA Table (see part 2).

Major Considerations
The assumptions for using this tool is that the data comes from independent random samples taken from normally distributed populations, with the same variance. When using ANOVA, we are using a model where its adequacy has to be verified using residual analysis (see tool Residual Plots).

Application Cookbook
1. 2. 3. 4. 5. 6. Define problem and state the objective of the study. State Null and Alternative Hypothesis. Establish sample size (see tool Sample Size Continuous Data One Way ANOVA). Select random samples. Measure the CT characteristic. Analyze data with Minitab (part 1 of 2): In order to fully use the Minitab functions associated with ANOVA it is recommended that the data be stacked into one column and a second column to contain the group codes. Using the function under Manip > Stack/Unstack > Stack Columns. See Part 2 for continuation of the application cookbook.

ANOVA One Way Part 2 of 2 (ANOVA Table)


Purpose
To summarize the results of an analysis of variance calculation in a table.

Anatomy

ANOVA - One Way - Part 2 of 2

One-Way Analysis of Variance


Analysis of Variance Source DF Factor 2 Error 42 Total 44
B

SS 1355.7 438.9 1794.6


C

MS 677.9 10.4
D

F 64.87

P 0.000

Six Sigma - Tools & Concepts

Anova122_001

Reference:

Juran's Quality Control Handbook Ch. 26, P. 10-16, Design and Analysis of Experiments by D. C. Montgomery Ch. 3, P. 67-79

Terminology
A. Source Indicates the different variation sources decomposed in the ANOVA table. "Factor" represents the variation introduced between the factor levels. The "Error" is the variation within each of the factor levels. The "Total" is the total variation in the CT characteristic. B. DF The number of degrees of freedom related to each sum of square (SS). They are the denominators of the estimate of variance. C. SS The sums of squares measure the variability associated with each source. They are the variance estimate's numerators. "Factor" SS is due to the change in factor levels. The larger the difference between the means of a factor levels, the larger the factor sum of squares will be. The "Error" SS is due to the variation within each factor level. The "Total" SS is the sum of the Factor and Error sum of squares (see tool Sum of Squares). D. MS Mean Square is the estimate of the variance for the factor and error sources. Computed by MS = SS/DF. E. F The ratio of the mean square for the "Factor" and the mean square for the "Error". F. P-Value This value has to be compared with the alpha level ( ) and the following decision rule is used: if P < , reject H0 and accept Ha with (1-P)100% confidence; if P , don't reject H0.

Major Considerations
The assumptions for using this tool is that the data comes from independent random samples taken from normally distributed populations, with the same variance. When using ANOVA we are using a model where its adequacy has to be verified using residual analysis (see tool Residual Plots).

Application Cookbook
1. Analyze the data with Minitab (part 2 of 2): Verify the assumption of equality of variance for all the levels with the function under Stat > ANOVA > Homogeneity of Variance (to interpret this analysis, see tool Homogeneity of Variance Tests). Use the function under Stat > ANOVA > One Way. Input the name of the column which contains the measurement of the CT characteristic into the 'Response' field, and the name of the column that contains the level codes into the 'Factor' field. In order to verify the assumption of the model, select the options 'Store Residuals' and 'Store Fits'. Select the graphs option and highlight all the available residual Plots (to interpret these plots see tool Residual Plots). In the event of non-compliance with either of these assumptions, the results of the analysis of variance may be distorted. In this case, the use of graphical tools such as a Box Plot can be used depict the location of the means and the variation associated with each factor level. In the case of outliers, these should be investigated. 2. Make a Statistical decision from the session window output of Minitab. Either accept or reject H0. When H0 is rejected we can conclude that there is a significant difference between the means of the levels. 3. Translate statistical conclusion into practical decision about the CT characteristic.

You might also like