Professional Documents
Culture Documents
Introduction
In lecture 1 we
1
Economics 513
Consider
To make progress we start with the assumption that all economic relations
are essentially deterministic, if we include all variables
y = f( x1 , , xW )
2
Economics 513
yi = f( xi1 , , xiW ) , i = 1, , n
yi = 0 + 1 ( xi1 x1 ) + + W ( xiW xW ) +
+ 1 ( xi1 x1 ) 2 + + ( xiW xW ) 2 +
+ 1 ( xi1 x1 )( xi 2 x2 ) +
3
Economics 513
4
Economics 513
To keep the relation simple we concentrate on the linear part. Hence the
observations satisfy
yi = 0 + 1 xi1 + + K 1 xi , K 1 + i
with
K =1
0 = 0 jxj
j =1
The remainder term contains all the omitted terms
i = K ( xiK x K ) + + V ( xiV xV ) +
+ 1 ( xi1 x1 ) 2 + + V ( xiV xV ) 2 +
+ 1 ( xi1 x1 )( xi 2 x2 ) +
5
Economics 513
Note that
f
j = ( x1 , , xW ) , j = 1, , K 1
xj
K 1
0 = f( x1 , , xW ) jxj
j =1
Conclusions:
6
Economics 513
Assume that we have observed xi1 , , xi , K 1 . This does not tell us anything
about the disturbance i that depends on (many) variables beside
x1 , , x K 1 . Hence, even if we know the coefficients 0 , , K 1, we cannot
predict with certainty what yi is.
7
Economics 513
i = yi 0 1 xi1 K 1 xi , K 1
y1 = 0 + 1 x11 + + K 1 x1, K 1 + 1
y n = 0 + 1 xn1 + + K 1 xn, K 1 + n
The distribution of the disturbance reflects the fact that we do not know
the value of the disturbance if we know x1 , , x K 1 .
8
Economics 513
Assumption 1: E( i | xi1 , , xi , K 1 ) = 0
Consider
(1) i = ( xi1 x1 ) + i , i = 1, , n
Substitution gives
yi = 0 x1 + ( 1 + ) xi1 + + K 1 xi , K 1 + i
9
Economics 513
Remember that
f
1 = ( x1 , , xW )
x1
Conclusion: If (1) holds then the coefficient of x1 is not equal to the partial
effect of x1 on y .
10
Economics 513
11
Economics 513
This is the multiple linear regression model. The relation (2) is linear in
x1 , , x K and also linear in 1 , , K . The latter is essential. By re-
interpreting x1 , , x K we can deal with relations that are non-linear in these
variables.
12
Economics 513
Examples:
Polynomial in x
y = 1 + 2 x + 3 x 2 + 4 x 3 +
Define: x1 1, x2 = x, x3 = x 2 , x4 = x 3
13
Economics 513
Log-linear relation
ln y = 1 + 2 ln x +
Define: x1 1, x2 = ln x
Note
ln y
2 =
ln x
14
Economics 513
Semi-log relation
ln y = 1 + 2 x +
Note: no re-definition is needed.
Also
y
ln y y
2 = =
x x
This is a semi-elasticity.
15
Economics 513
16
Economics 513
y = X +
and
E( i | xi ) = 0 , i = 1, , n
with
x1
(4) X =
xn
17
Economics 513
18
Economics 513
CLR model
y = X +
E( | X ) = 0
E( | X ) = 2 I
rank( X ) = K
19
Economics 513
E( i | X ) = 0 , i = 1, , n
E( i | x1 , , xn ) = 0 , i = 1, , n
E( i | xi ) = 0 , i = 1, , n
20
Economics 513
The current assumption states that not only xi but also x j , j i is not
related to i . This is not stronger than the previous assumption if x1 , , x n
are independent as in a random sample from a population. If these are not
independent, as e.g. in time-series data, then this additional assumption
may be too strong.
21
Economics 513
Note
12 1 2 1 n
' = 2 1
n1 n2
Hence Assumption 2 implies that
E( i2 | X ) = 2 , i = 1, , n
22
Economics 513
E( i j | X ) = 0
23
Economics 513
y = 0 + ( 1 + u ) x +
population variation
in coefficient
= 0 + 1 x + + ux
E( + ux | x) = 0
but
Var( + ux | x) = 2 + u x + u2 x 2
with u = E(u ) , u2 = E(u 2 ) . Hence the composite error is heteroskedastic.
24
Economics 513
i = i 1 + ui
Applies in time-series.
Finally, consider Assumption 3.
25
Economics 513
yi = 1 xi1 + 2 xi 2 + 3 xi 3 + 4 xi 4 + i , i = 1, , n
with
x2 = schooling (years)
x3 = age
x4 = potential experience
(age-years in school-6)
26
Economics 513
Hence
x4 = x3 x2 6
27
Economics 513
Then also
~ ~ ~ ~
yi = 1 xi1 + 2 xi 2 + 3 xi 3 + 4 xi 4 + i , i = 1, , n
~ ~
Conclusion: We cannot distinguish between 1 , , 4 and 1 , , 4 . For all
c these parameters are observationally equivalent.
yi = ( 1 6 4 ) xi1 + ( 2 4 ) xi 2 + ( 3 + 4 ) xi 3 + i
28
Economics 513
| X ~ N (0, 2 I ) .
Assumption 4
| X ~ N (0, 2 I )
29
Economics 513
f
j = ( x1 , , xW )
xj
For this result Assumption 1 was necessary.
30
Economics 513
31
Economics 513
32
Economics 513
From the joint frequency in the sample we obtain the sample conditional
frequency distribution (see table 1.2)
(y, x)
f
f(y | x) =
f(x)
33
Economics 513
If not, we can consider the average of y for every value of x . This is the
conditional mean function.
In population
m( x) = E( y | x) = y f( y | x)
y
with estimate
m ( x) = y f( y | x)
y
34
Economics 513
35
Economics 513
Why is m( x) = E( y | x) interesting?
( ) (
+ E (m( x) h( x)) 2 E ( y m( x)) 2 )
and this lower bound is achieved if h( x) = m( x) .
36
Economics 513
h( x) = a + bx
(
min E ( y a bx) 2
a,b
)
First-order conditions with u = y a bx
2 E(u ) = 0 E( y ) = a + b E( x)
2 E(ux) = 0 E( xy ) = a + b E( x 2 )
37
Economics 513
Cov( x, y )
b=
Var( x)
a = E( y ) b E( x)
n
( xi x )( yi y )
b = i =1 n
( xi x ) 2
i =1
a = y bx
38
Economics 513
If E( y | x) = a + bx , we have for u = y a bx
E(u | x) = E( y a bx | x) = E( y E( y | x) | x ) = 0
39
Economics 513
This is weaker:
E(u | x) = 0 E(ux) = 0
but
Hence in that case Assumption 1 of the CLR model is not satisfied but a
weaker uncorrelatedness assumption.
40
Economics 513
1 x12 x1K
X =
1 x xnK
n2
The observations are a sample from a population and we assume that the
joint distribution of y, X is such that the CLR model is the appropriate
statistical model, i.e. y, X satisfy
y = X +
41
Economics 513
E ( | X ) = 0
E ( | X ) = 2 I
42
Economics 513
y1 = x1 + 1
43
Economics 513
Estimation of and 2
b = ( X ' X ) 1 X ' y
44
Economics 513
ys = X s + s , s = 1, , S
45
Economics 513
Consider
From this we can, using the CLR assumptions, find the conditional average
of b given X (in the sampling distribution)
E (b | X ) = + ( X ' X ) 1 X ' E ( | X ) =
E (b) = E X ( E (b | X )) =
In words: under CLR assumptions the OLS estimator is unbiased for .
46
Economics 513
We have
b = ( X ' X ) 1 X '
Upon substitution
(
Var (b) = E ( X ' X ) 1 X ' ' X ( X ' X ) 1 )
47
Economics 513
We have
( )
E ( X ' X ) 1 X ' ' X ( X ' X ) 1 | X =
= ( X ' X ) 1 X ' E ( ' | X ) X ( X ' X ) 1 = 2 ( X ' X ) 1
and hence
(
Var (b) = E X ( 2 ( X ' X ) 1 ) = 2 E ( X ' X ) 1 )
Note that 2 ( X ' X ) 1 is an unbiased estimator of this variance.
48
Economics 513
In special case of constant and one regressor we have the unbiased variance
estimator
2
Var (b2 ) = n
( xi x ) 2
i =1
Note that this decreases with 2 and with the variation in x .
49
Economics 513
bL = Cy
50