Professional Documents
Culture Documents
Two-Stage Least-Squares Regression (2SLS): Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
2 Key Concepts ***** Two-Stage Least-Squares Regression (2SLS) In a Nonrecursive Path Analysis
Recursisve and nonrecursive path models Causal loops Problems in OLS regression when a predictor variable is related to an error term Regression coefficient (b) will be biased Confidence interval of b will not be efficient There may be a spurious effect present b may be overestimated or underestimated The value of b may be suppressed or have the wrong sign How to test whether the error terms in a path model are independent The logic of 2SLS regression The concept of an instrumental variable Criteria of a good instrumental variable The concept of a lagged variable Construction of the equations in 2SLS DITS analysis in 2SLS SPSS 2SLS
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
Lecture Outline
The problem of non-independent error terms in path analysis Recursive and nonrecursive path models The concept of two-stage least-squares regression The concept of an instrumental variable 2SLS regression in phases SPSS 2SLS in one step DITS analysis of results
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
The Problem
In linear regression, if a predictor variable Xi is related to the error term ej bij will be a biased estimate of Fij and The confidence interval of bij will not be efficient
In addition, a spurious relationship may be present with the result that: bij may be overestimated or bij may be underestimated with the result that The spurious effect acts as a suppressor on bij or bij will have the wrong algebraic sign
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
X3
X1
I1
X2
X4
I4
Nonrecursive Model
Causal relationships are bi-directional creating causal loops
X3
X1
I1
X2
X4
I4
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
Nonrecursive Model
Correlated Error Terms
X3
X1
I1
X2 Interpretation of relationships
X4
I4
X1 is nonrecursively & causally related to X4 X1 may be indirectly related to I4 via X4 X4 may be indirectly related to I1 via X1
I1 may be correlated I4
If these conditions exist
The assumptions of OLS will be violated. OLS regression assumes unidirectional relationships & independence of error terms
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
An Example
Let us assume that we are interested in predicting a state's correctional population (X4 state), the total number of people in prison and on parole. Assume further that The index crime rate (X3 crm_indx), The number of arrests (X2 arrests) and The total correctional population (X1 com_corr) in the state's largest county are causes of the state's population Finally, assume that the state's population is partly a cause of the county's population Since parolees who are arrested become part of the county's population
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
crime index
com_corr
X3
X1
I1
X2
arrests
X4
state
I4
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
9 An Example (cont.)
Interpretation The county's correctional population effects the state population and The state's population effects the county's population through parole releases.
Therefore The county's population may be indirectly related to I4 via the state's population The state's population may be indirectly related to the county's population And I1 may be correlated I4
I1 via
If this is the case OLS assumptions will be violated in estimating the model
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
10
The Problem
Equation #1 may be solved without bias using OLS regression Equations #2 & #3 may not be solvable without the potential for bias using OLS regression
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
11
Interpretation
Correlation between I1 and I4 (r = 0.198) Slight correlation between I1 and X4 (r = -0.058) Slight correlation between I4 and X1 (-0.084)
X3
X1
I1
-0.058 +0.198
X2
X4
I4
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
12
I4
Step 4 Use the predictions (pre_1) of X1, instead of X1 , to predict X4, Since they are that part of X1 which is not caused by X4 and are independent of I4
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
13
An alternative method involves creating an instrumental variable (Ij) by lagging the troublesome variable (X1). This assumes that the nonrecursive relationship between X1 & X4 is contemporaneous and will disappear when X1 is lagged.
The instrumental variable Ij Ij = f(X1 lag = +1) Shifting the cases of variable X1 by one case
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
14
Case Number
1 2 3 4 N N+1
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
15
state 110331.66 160400.00 110331.66 140374.17 130360.56 170412.31 140374.17 150387.30 120346.41 180424.26 . .
com_corr 20141.42 18134.16 22148.32 21144.91 27164.32 26161.25 25158.11 27164.32 30173.21 28167.33 . .
arrests crm_indx 810900.00 840916.52 850921.95 850921.95 880938.08 900948.68 890943.40 940969.54 950974.68 990994.99 2101449.14 2901702.94 1751322.88 2301516.58 2051431.78 2801673.32 2401549.19 2501581.14 1951396.42 2601612.45
lag_com . . 20141.00 18134.00 22148.00 21145.00 27164.00 26161.00 25158.00 27164.00 30173.00 28167.00
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
16
X5
crime index com_corr
X3
X1
I1
X2
arrests
X4
state
I4
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
18
Model 1
Variables Removed .
Method Enter
Mo el Su
Model 1
R .919 a
R Square .844
Model 1
df 3 5 8
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
ry Adjusted R Square .750 Std. Error of the Estimate 1917.5058 Mean Square 33088717.21 3676828.498 F 8.999 Sig. .019 a
ffici
Predi ted Val e Resid al Std. Resid al Std. Predi ted Val e
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
Minim m 18794.91
Maxim m 30026.32
Resi
Model 1
a ls t tisti s
Unstandardized Coeffi ients B Std. Error (Constant) -8567. 73 20947.224 LAG_COM .418 .418 ARRESTS 3.761E-02 .030 CRM_INDX -4.37E-03 .002
N 9 9 9 9
20
Pr
ict
-1779.79
8. 84E-13
1515.9214 1. .791
t . Pr t . .
ict
-1.775
-.928
ri
l :
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
$$$
11 D4 A 5 5 @ 6 CB CB & ) )( &' & & ) )0 ' 2&1 ' 3 &0 ) ( '& '& ' 3 )0 ' 2&1 &0 )( '& '&
i im m 18794.91
xim m 26.32
% 5 8 % 7&6 ) '3
$ 0 $)4
Resi
a als tatisti s
"
t . i ti 3 22. 373
0 54
9 9 9 9
21
Predictions: pre_1
state 110331.66 160400.00 110331.66 140374.17 130360.56 170412.31 140374.17 150387.30 120346.41 180424.26 . .
com_corr 20141.42 18134.16 22148.32 21144.91 27164.32 26161.25 25158.11 27164.32 30173.21 28167.33 . .
arrests crm_indx 810900.00 840916.52 850921.95 850921.95 880938.08 900948.68 890943.40 940969.54 950974.68 990994.99
lag_com
pre_1
2101449.14 2901702.94 1751322.88 2301516.58 2051431.78 2801673.32 2401549.19 2501581.14 1951396.42 2601612.45
. . 20141.00 18134.00 22148.00 21145.00 27164.00 26161.00 25158.00 27164.00 30173.00 28167.00
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
22
X2 Arrests
X4 State
(1.0)
I4 Error
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
23
Model 1
Variables Removed
Method
Enter
Model
Model 1
R .953 a
R Square .909
a. redi tors: (Constant), CRM_INDX, ARRESTS, Unstandardized redi ted Value b. Dependent Variable: STATE
Model 1
df 3 5 8
a. redi tors: (Constant), CRM_INDX, ARRESTS, Unstandardized redi ted Value b. Dependent Variable: STATE
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
U Q V
R IIH G QP
ar
S S
F 16.586
Sig. .005 a
a oefficients
Unstandardi ed Coeffi ients Model 1 (Constant) Unstandardi d Predi ted Val e ARRESTS CRM_INDX B -103846
Beta
a Residuals tatistics
Predi ted Val e Residual Std. Residual Std. Predi ted Value
-1.747 -1.110
Regression model
X4 = -103846 - 0.215 (pre_1) + 0.147 (X2) + 0.05149 (X3)
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
W a YX ` a ` `
Standardi ed Coeffi ien ts t -.799 -.047 .487 3.097 Sig. .460 .964 .647 .027
N 9 9 9 9
25
lag_com X5 +0.418 Crime index -0.004 X3 +0.038 +0.027 -0.215 0.052 X2 Arrests +0.147 X4 State (1.0) I4 Error r = 0.0
pre_1
Regression equations X2 = 827219 + 0.027 X3 X1 = - 8567.07 + 0.418(X5) - 0.0376(X2) -0.00437(X3) X4 = -103846 - 0.215 (pre_1) + 0.147 (X2) + 0.05149 (X3)
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
26
When the proxy variable for the current years county corrections
population (pre_1) increases by one offender, the states correctional population (X4) decreases by -0.215 offenders. A ratio of +1000 to 215 The proxy variable for the current years county corrections population (pre_1) Declines by -0.004 for a unit increase in the crime index (X3) (+1000 to 4) Increases by +0.038 for a unit increase in arrests (X2) (+1000 to +38)
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
27
Correlation of pre_1 and X4 (state) r = 0.0 In the original path model the correlation between X1 (com_corr) and e4 was r = 0.231
lag_com X5 +0.418 Crime index -0.004 X3 +0.038 +0.027 -0.215 0.052 X2 Arrests +0.147 X4 State (1.0) I4 Error r = 0.0
pre_1
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
28
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
29
Multiple R .95406 R Square .91023 Adjusted R Square .85636 Standard Error 8710.97783
Analysis of Variance: DF Sum of Squares 3 5 Mean Square 3846809975.3 379405674.1 Signif F = 1282269991.8 75881134.8 .0048
Regression Residuals F =
16.89840
------------------ Variables in the Equation -----------------Variable B SE B Beta -.035774 .868543 .330126 T -.047 3.126 .492 -.807 Sig T .9641 .0261 .6436 .4564
COM_CORR -.214591 4.535006 CRM_INDX .051487 .016472 ARRESTS .147136 .299121 (Constant)-103845.71777 128702.8891
Correlation Matrix of Parameter Estimates
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
30
Comparison of Results
The table below compares the results of the SPSS 2SLS in one step with the previous analysis involving multiple steps.
Variable
SPSS 2SLS +0.0515 s0.0165 +0.1471 s0.2992 -0.2146 s4.5350 0.9102 0.0898 s8710.98
Previous Analysis +0.0515 s0.0166 +0.1472 s0.3019 -0.2149 s4.5780 0.9090 0.0913 s8792.72
Crm_indx (X3) Arrests (X2) Com_corr Pre (X1) Goodness Of Fit Indicators
There are slight differences in the error terms since SPSS adjusts the error for the accuracy with which the instrumental variable predicts com_corr (X1). Otherwise the results are identical within rounding error
31
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
32
Total Effects: T = (D + I)
T41 = D41 = -0.215 T42 = D12 + I412= (0.147 0.00817) = +0.13883 T43 = D43 + I413 + I423 + I4123 = (0.052 + 0.00086 + 0.00397 + 0.00022) = +0.05705 T45 = I415 = +0. 08987 T12 = D12 = +0.038 T13 = D13 = -0.004 T15 = D15 = +0.418 T23 = D23 = +0.027
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
33
Variable D
Crm_indx (X3)
T +5.7
S 0.00
+5.2
+14.7 -21.5
-0.817 (1) NA
+13.9 -21.5
0.00 0.00
Variable D
Crm_indx (X3) Arrests (X2) Lag_com (X5)
Effects on County Corrections (X1) I via (Xj) +0.10 (2) NA NA T -0.4 +3.8 +41.8 S 0.00 0.00 0.00
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University
34
Phase 2
Two-Stage Least-Squares Regression: Charles M. Friel Ph.D., Criminal Justice Center, Sam Houston State University