Professional Documents
Culture Documents
Regression
The Data
EVAC
0
0
0
1
1
0
0
0
0
0
0
0
1
PETS
1
1
1
1
0
0
0
1
1
0
0
1
1
Rented
0
0
1
1
0
0
0
0
0
0
0
0
1
TENURE
16
26
11
1
5
34
3
3
10
2
2
25
20
EDUC
16
12
13
10
12
12
14
16
12
18
12
16
12
OLS Results
Variable
(Constant)
PETS
MOBLHOME
TENURE
EDUC
R2
F-stat
B
0.296
-0.139
0.357
-0.004
0.010
0.086
12.422
t-value
2.058
-3.308
5.865
-2.520
1.039
Problems:
Predicted Values outside the 0,1
range
Residuals Statistics
Minimum
Std. Predicted
Value
-2.12109
Maximum
3.251766
Mean
Std.
Devia
tion
0
More:
The logistic distribution constrains the estimated
probabilities to lie between 0 and 1.
The estimated probability is:
p = 1/[1 + exp(- - X)]
if you let + X =0, then p = .50
as + X gets really big, p approaches 1
as + X gets really small, p approaches 0
Interpreting Coefficients
Since:
ln[p/(1-p)] = + X + e
The slope coefficient () is interpreted as the rate
of change in the "log odds" as X changes not very
useful.
Since:
[p/(1-p)] = exp( + X)
S.E.
-0.6593
1.5583
-0.0198
0.0501
0.916
0.2012
0.2874
0.008
0.0468
0.69
Wald
10.7323
29.3895
6.1238
1.1483
1.7624
Exp(B)
1/Exp(B)
-0.6593
1.5583
-0.0198
0.0501
-0.916
0.5172
4.7508
0.9804
1.0514
1.933
1.020
Model Chi-Square
The model likelihood ratio (LR), statistic is
LLR[i] = -2[LL() - LL(, ) ]
{Or, as you are reading SPSS printout:
LLR[i] = [-2LL (of beginning model)] - [-2LL (of ending model)]}
An Example:
Observed
0
1
Predicted
0
1
328
24
139
44
Overall
% Correct
93.18%
24.04%
69.53%
Pseudo-R2
One psuedo-R2 statistic is the McFadden's-R2 statistic:
McFadden's-R2 = 1 - [LL(,)/LL()]
{= 1 - [-2LL(, )/-2LL()] (from SPSS printout)}
where the R2 is a scalar measure which varies between
0 and (somewhat close to) 1 much like the R2 in a LP
model.
An Example:
Beginning -2 LL
Ending -2 LL
Ending/Beginning
2
McF. R = 1 - E./B.
687.36
641.84
0.9338
0.0662
McFaddens
LL( B )
1
LL(0)
2
Cox and Snell is also based on loglikelihood but it takes the sample size
into account:
2
CS
HL result (Sample)
Hosmer - Lemeshow Test
Chi-square
4.361
Df
Sig.
.823
Observed
27
Expected
26.991
25
Expected
.009
27
24.977
.023
25
23
22.948
.052
23
28
27.879
.121
28
25
25.717
.283
26
27
26.420
.580
27
24
23.815
1.185
25
22
22.933
3.067
26
20
17.814
8.186
26
5.505
22
20.495
26
10
Observed
Total
Variable Description
Diagnostics
Cook's. The logistic regression analog of Cook's
influence statistic. A measure of how much the
residuals of all cases would change if a particular case
were excluded from the calculation of the regression
coefficients.
Cook's distance measures the effect of deleting a
given observation. Data points with large residuals
(outliers) and/or high leverage may distort the
outcome and accuracy of a regression. Points with a
large Cook's distance are considered to merit closer
examination in the analysis.
Reccomendation of Thickol
Corp
A lamentable aspect of this disaster was that the problem with the Orings was already understood by some engineers prior to the
Challenger launch. In February of 1984, the Marshall Configuration
Control Board sent a memo about the O-ring erosion that occurred on
STS 41-B (the 10th space shuttle flight and the 4th mission for the
Challenger shuttle). These messages continued to increase in
intensity as evidenced by a 1985 internal memo from Thiokol Corp.,
the company that designed the O-ring. Employees from Thiokol wrote
the following to their Vice President of Engineering:
Data Description
n=23 Space Shuttle Lift-offs prior to
Challenger
Response: Presence/Absence of erosion or
blow-by on at least one O-Ring field joint
Y=1 if occurred, 0 if not
Data
O-Ring Problem
50
55
60
65
70
Temperature
Flight#
1
2
3
4
5
6
7
8
9
10
11
12
Temp
66
70
69
68
67
72
73
70
57
63
70
78
O-Ring Problem
0
1
0
0
0
0
0
0
1
1
1
0
Flight#
13
14
15
16
17
18
19
20
21
22
23
Temp
67
53
67
75
70
81
76
79
75
76
58
O-Ring Problem
0
1
0
0
0
0
0
0
1
0
1
75
80
85
( X ) X PY 1 X
(X )
0 1 X
g ( ) ln
1 ( X )
0 1 X
e
(X )
0 1 X
1 e
Model Estimation/Inference
Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) 15.0429
7.3786 2.039
0.0415 *
degrees
-0.2322
0.1082 -2.145
0.0320 *
--Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Null deviance: 28.267 on 22 degrees of freedom
Residual deviance: 20.315 on 21 degrees of freedom
H0: No association between incidence of O-Ring Failure and Temperature (1 = 0)
HA: Association between incidence of O-Ring Failure and Temperature (1 0)
Reject H0 (zobs = -2.145, P=0.032), Conclude a negative association exists
15.04 0.23 X
e
X
15.04 0.23 X
1 e
^
0.9
0.8
0.7
0.6
O-Ring Problem
P(O-Ring Prob)
0.5
0.4
0.3
0.2
0.1
0
50
55
60
65
70
75
80
85
90
Odds Ratio
e 0 1 X
e 0 1 X
e 0 1 X
1 e 0 1 X
1 e 0 1 X
1 e 0 1 X
X
odds X
1
1 X
e 0 1 X 1 e 0 1 X e 0 1 X
1
1 e 0 1 X
1 e 0 1 X
1 e 0 1 X
odds( X 1) e 0 1 ( X 1)
e 0 1 X
odds( X 1) e 0 1 ( X 1) e 0 1 X e 1
Odds Ratio : OR
0 1 X 0 1 X e 1
odds( X )
e
e
^
^ 1 1.96 SE ^ 1 ^ 1 1.96 SE ^ 1
95% CI for Population Odds Ratio : e
,e
Challenger Data :
^
OR e
e 0.23 0.795
^ 1 1.96 SE ^ 1 ^ 1 1.96 SE ^ 1
e
,e
e 0.231.96(.1082) , e 0.231.96(.1082) e .4421 , e .0179 .6427,.9823