Professional Documents
Culture Documents
There are variables in nature that are related in such a way that if we know one of
them, the others can be estimated. For example, bright parents will most likely have
bright children. So if we know the IQ of the parents, we can make an educated guess of
their children’s IQ. The farther you travel in a vehicle, the more gasoline you consume.
The higher the sun in the horizon, the shorter is the shadow of the objects.
CORRELATION
-1 -0.5 0 +0.5 +1
n n
The term n – 1 is used for samples, while n is used when dealing with
populations. The location of n – 1 or n in the denominator makes rxy dependent on the
size of the sample.
n n n
n ∑ (xi ) 2 – (∑ xi ) 2 n ∑ (yi ) 2
- (∑ yi ) 2
6∑ (xi - yi) ²
i=1
rs = 1 - ------------------
n ( n² – 1)
PHI COEFFICIENT
Variable x
1 2
1 a b a+b
Variable y
2 c d c+ d
a+c b+d
ad – bc
rφ = ---------------------------------------------------------------------------------------
------------------------------------------------------------------------
(a + b)(c + d)(a + c)(b +d)
This formula was first derived by Carl Pearson in 1901.
POINT-BISERAL CORRELATION
There is another correlation that is a special case of the Pearson product moment
correlation. It is called the Point-Biseral Correlation rpb . It correlates a real dichotomous
variable with an interval variable. For example, the score x in a test correlated with
gender y categorized as male (1) or female (0).
xx̅1 - xx̅0 n1 no
rpb = ----------- ---------
sx n(n-1)
where xx̅1 = the mean of those which are labeled 1 in the real dichotomous, y
xx̅0 = the mean of those which are labeled 0 in the real dichotomous, y