Professional Documents
Culture Documents
0. Table of contents
1. Introduction
6. Conclusions
1. Introduction
The purpose of this report is to explore the statistical tools ANOVA (one way and two way)
and MANOVA (one way and two way), compare and apply them to a test case coming from
the field of interest of the two authors.
1.1 ANOVA
If the objective is to compare two groups or populations on a single factor, the t-Test is
sufficient. On the other hand "Analysis of Variance" (ANOVA) tests three or more groups for
mean differences based on a continuous (i.e. scale or interval) response variable (a.k.a.
independent variable). The term "factor" refers to the variable that distinguishes this group
membership. The lathe machine that produced a mechanical component is an example of
factor.
1.2 MANOVA
MANOVA stands for Multivariate Analysis of Variance". In basic terms, a MANOVA is an
ANOVA with two or more continuous response variables. Like ANOVA, MANOVA has both a
one-way flavor and a two-way flavor. The number of factor variables involved distinguishes
a one-way MANOVA from a two-way MANOVA:
Hypotheses of ANOVA
Following two hypotheses are considered in ANOVA.
Ho: The (population) means of all groups under consideration are equal.
Ha: Means are not all equal. (Note: This is different than saying they are all
unequal)
Alpha is chosen ahead of time (in this case set to 0.05) and represents the
willingness to falsely reject the null hypothesis if it is true. In practical terms It is the
area of the right
F-critical corresponds to the x coordinate of the distribution that guarantees that
the area on the right of F-critical is (5%) of the total.
The F ratio is computed from the data (see Sect. 2)
The p value is the area on the right of the F ratio.
F ratio
A large F ratio means that the variation among group means is more than you'd expect
to see by chance. A large F ratio is encountered both when the null hypothesis is wrong
(the data are not sampled from populations with the same mean) and when random
sampling happened to end up with large values in some groups and small values in
others.
P value
The P value tests the null hypothesis that data from all groups are drawn from
populations with identical means. Therefore, the P value answers this question:
If all the populations really have the same mean (the fact that the rods come from
different lathe machines is ineffective), what is the chance that random sampling would
result in means as far apart (or more so) as observed in this experiment?
If the overall P value is large (i.e. > 0.05), the data do not give you any reason to
conclude that the means differ. Even if the population means were equal. This is not the
same as saying that If the overall P value is small, then it is unlikely that the differences
you observed are due to random sampling. It is possible to reject the idea that all the
populations have identical means.
Decision criteria
2.1 Data
Data
no. of df= degrees of freedom
samples lathe 1 lathe 2 lathe 3
SS= Sum of squares
1 25.45 25.71 25.64
MS= Mean Square
2 25.20 25.50 25.73
2.2 Solution
The excel Data Analysis tool has been used and cross-checked with step-by-step calculation
ANOVA: Single Factor (Alpha=0.05)
SUMMARY
Groups Count Sum Average Variance
lathe 1 7 177.23 25.319 0.044681
lathe 2 7 179.71 25.673 0.00839
lathe 3 7 180.14 25.734 0.013362
ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups (BG) 0.704924 2 0.3525 15.91649 0.000105 3.554557
Within
Groups(WG)/Error (E) 0.3986 18 0.0221
= ( )2
=1
where nj is the size of sample j (always 7 in this case)
3 7
= = ( )2
=1 =1
3 7
= ( )2
=1 =1
/
=
/
= +
the null hypothesis H0 is rejected, in other words Ha is accepted. The data provide
sufficient evidence to conclude that the means of the rod diameter from the 3-lathe
machines are not all the same.
3.1 Data
Data
3.2 Solution
We are in this case using Matlabs built-in function anova2. The results have been cross checked by
Excels calculations
3.3 Relevant formulas
7
= ( )2
=1
where nk is the size of sample k (always 3 in this case)
/
=
/
/
=
/
= + +
HoB: All means of rod diameters from 7 different operators are equal
4.1 Data
Data Output Diameter
4.2 Solution
Using Matlabs built-in functions:
[d,p,stats] = manova1([Diameter,Roughness],Groups,0.05);
The first output, d, is an estimate of the dimension of the group means. If the means
were all the same, the dimension would be 0, indicating that the means are at the same
point. In the example the dimension is 1, indicating that it is not possible to reject the
hypothesis that the group means fall along a line.
The largest possible dimension for the means of three groups is 2, which would indicate
that the means fall on a plane but not along a line.
The second output, p, is a vector of p-values for a sequence of tests. The first p value
tests whether the dimension is 0, the next whether the dimension is 1, and so on. In this
case the first p-values is small (6.6510e-06), while the second one is large (0.56). That's
why the estimated dimension is 2.
Matlabs function MBox confirms the assumptions on which MANOVA is based:
[MBox] = MBoxtest([Groups,Diameter,Roughness],0.05);
5. Solved Exercise: Two Way MANOVA
In the same above mentioned example, we introduce one more dependent variable or
response: we are now looking at surface roughness as well as the diameter and testing for
two independent factors (operator and machine)
5.1 Data
Data Output Diameter
5.2 Solution
Using Matlabs function maov2:
maov2([Factor1,Factor2,Diam,Roughness],0.05)
It appears that no interaction occurs between factors, therefore two separate multivariate analysis
of variance have been conducted for Factor 1 (Lathe machines) and Factor 2 (Operators).
As observed for the two-way ANOVA, there exists a difference in the components due to the lathe
machines but not due to the operators.
6. Conclusions
It can therefore be concluded that, in the case observed, the lack of uniformity between the
components is not due to the operators training but to the lathe machines. At least one of the lathe
machines tool is not positioned properly but this does not affect the performance of the operators
on it.