Professional Documents
Culture Documents
Lecture 7
Heteroskedasticity and
some further diagnostic testing
INEMET [U13783]
Topics to be covered
Heteroskedasticity
Some further diagnostic testing
Normality of the disturbances
Multicollinearity
INEMET [U13783]
Econometric problems
INEMET [U13783]
Heteroskedasticity
What does it mean? The variance of the error term is not
constant
What are its consequences? The least squares results
are no longer efficient and t tests and F tests results may be
misleading
How can you detect the problem? Plot the residuals against
each of the regressors or use one of the more formal tests
How can I remedy the problem? Respecify the model look for
other missing variables; perhaps take logs or choose some
other appropriate functional form; or make sure relevant
variables are expressed per capita
INEMET [U13783]
400
300
200
100
0
0
100
200
300
400
Income
INEMET [U13783]
INEMET [U13783]
INEMET [U13783]
INEMET [U13783]
Detecting heteroskedasticity
Visual inspection of scatter diagram or
the residuals
Goldfeld-Quandt test
suitable for a simple form of
heteroskedasticity
Breusch-Pagan test
a test of more general forms of
heteroskedastcity
INEMET [U13783]
Residual plots
Plot residuals against one variable at a time
INEMET [U13783]
INEMET [U13783]
Breusch-Pagan test
Regress the squared residuals on a constant,
the original regressors, the original regressors
squared and, if enough data, the cross-products
of the Xs
The null hypothesis of no heteroskedasticity will
be rejected if the value of the test statistic is too
high (P-value too low)
Both c2 and F forms are available in PcGive
INEMET [U13783]
Remedies
Respecification of the model
Include relevant omitted variable(s)
Express model in log-linear form or some other
appropriate functional form
Express variables in per capita form
ARCH
Note: with time series data, particularly highfrequency data (for example daily or hourly
financial data) a special form of
heteroskedasticity called Autoregressive
Conditional Heteroskedasticty (ARCH) may be
present
We can see it graphically as excessive volatility
of the time series in certain short bursts
I will say more about this when we look in more
detail at dynamic models
INEMET [U13783]
INEMET [U13783]
Reset test
INEMET [U13783]
Multicollinearity
What does it mean? A high degree of correlation amongst the
explanatory variables
What are its consequences? It may be difficult to separate out
the effects of the individual regressors. Standard errors may
be overestimated and t-values depressed.
Note: a symptom may be high R2 but low t-values
How can you detect the problem? Examine the correlation
matrix of regressors - also carry out auxiliary regressions
amongst the regressors.
Look at the Variance Inflation Factors
NOTE:
be careful not to apply t tests mechanically without checking for multicollinearity
multicollinearity is a data problem, not a misspecification problem
INEMET [U13783]
INEMET [U13783]