You are on page 1of 17

Advanced Data Analytics

Pricing of IPL Players

Submitted by
Ajitesh Agarwal E003
Abhishek Batra C011
Prabhjit Singh Walia B063
Ishan Katyal E024
Anusha Aggarwal I003

Page 1
SUMMARY

The current case talks about the pricing of the players in IPL. IPL being the Indian premiere league is based
on giving exposure to the Indian cricket players. This is T20 format game where players are classified into
baskets where each basket has a minimum base price. Players are not only weighed on their cricket skills
but also marketing skills. The case also talks about the player being good in ODI/Test format doesnt
guarantee that he will be good in T20 too.

Approach

We first modified the modified raw data in the case to Include following more columns.

1) BOW*SR
2) BAT*ECO
3) BAT*SRBL
4) BOW*RUNS-S
5) ALL*RUNS-S
6) BATS*WKI
7) ALL*WKI
8) BOW*TRUNS
9) ALL*TRUNS
10) BOW*ODI RUNS
11) ALL*ODI RUNS
12) BATS*Wk-O
13) ALL*WK-O

This was done in order to incorporate the dummy variables Cricketer, Bowler and All-Rounder in our data.
Our data is taking into account the contribution made by Batsman, bowler and the all-rounder.

1. Linear Regression

The data was entered into SPSS where we performed first regression analysis, taking majority of
the variables as independent variables and SQRT(S-B) as dependent variable

Model Summary

Adjusted R Std. Error of the


Model R R Square Square Estimate

1 .781a .610 .429 260.395

a. Predictors: (Constant), Year, BOW*SR, ODI-SR-BL, MTS, B25-35,


All*T-runs, AUSTRALIA, BATS*ECO, SR -B, BATS*WKT, CAPTAINCY
EXP, ODI-SR-B, T-WKTS, BATS*WKI, AVE-BL, ALL*WKI, L25,
BOW*RUNS, bow*ODIR, INDIA, SIXERS, BAT*T-RUNS, ALL*WKI,
BATS*SR-B, AVE, ALL*RUNS, BOW*WK-I, ALL, HS , BOW*SR-BL,
All*ODI, BOW*T-RUNS, BOW*ECO, BAT*ODI-RUNS, BOW*WK-O,
ALL*ECON, BAT*RUN-S, RUNS-C, ALL*SR-B, ALL*SR-BL, BOW

Page 2
The model has a R-Square of 0.610. This is average R-Square and can be improved by applying various
other statistical techniques.

2. Linear Regression without dummy variables

Model Summaryb

Adjusted R Std. Error of the


Model R R Square Square Estimate

1 .762a .580 .430 260.170

a. Predictors: (Constant), Year, BOW*SR, ODI-SR-BL, MTS, SR -B,


All*T-runs, ECON, BATS*SR-B, BAT*T-RUNS, BOW*T-RUNS, ODI-
SR-B, ALL*WKI, BOW*RUNS, WKTS, ODI-WKTS, BATS*ECO,
SIXERS, AVE, ALL*RUNS, BOW*SR-BL, BOW*WK-I, ALL*WKI, T-
WKTS, HS , ALL*SR-B, ODI-RUNS, ALL*SR-BL, All*ODI, bow*ODIR,
BAT*RUN-S, BOW*ECO, RUNS-C, AVE-BL, BOW*WK-O
b. Dependent Variable: SQRT(S-B)

In this we removed all the dummy variables like batsman, all-rounder, bowler, country etc.
Though the R-Square got reduced but it laid stone for outlier analysis. We selected Cook
distance in regression analysis and eliminated outliers using Outlier Analysis.
3. Outlier Analysis

Model Summaryb

Adjusted R Std. Error of the


Model R R Square Square Estimate

1 .827a .684 .557 224.173

a. Predictors: (Constant), Year, BAT*RUN-S, ODI-SR-BL, ALL*RUNS,


BOW*T-RUNS, WKTS, ODI-SR-B, ALL*WKI, ECON, SR -B, BATS*SR-
B, T-RUNS, BOW*RUNS, BOW*SR-BL, ALL*WKI, T-WKTS,
BATS*ECO, AVE, All*T-runs, bow*ODIR, HS , ALL*SR-B, All*ODI,
SIXERS, MTS, BOW*ECO, BAT*ODI-RUNS, BOW*SR, BOW*WK-O,
AVE-BL, RUNS-C, ODI-WKTS, ALL*SR-BL, BOW*WK-I
b. Dependent Variable: SQRT(S-B)

Model Summaryb

Adjusted R Std. Error of the


Model R R Square Square Estimate
1 .854a .729 .613 205.792

Page 3
a. Predictors: (Constant), Year, BAT*RUN-S, ODI-SR-BL, ALL*RUNS,
BOW*T-RUNS, All*T-runs, SR -B, WKTS, BATS*SR-B, ODI-SR-B, T-
RUNS, BOW*RUNS, ECON, BOW*SR-BL, ALL*WKI, T-WKTS, AVE,
BATS*ECO, All*ODI, bow*ODIR, ALL*WKI, SIXERS, ALL*ECON, HS ,
MTS, BAT*ODI-RUNS, ALL*SR-B, BOW*SR, BOW*WK-O, AVE-BL,
RUNS-C, ALL*SR-BL, ODI-WKTS, BOW*WK-I
b. Dependent Variable: SQRT(S-B)

Model Summaryb

Adjusted R Std. Error of the


Model R R Square Square Estimate

1 .913a .834 .750 158.922

a. Predictors: (Constant), Year, MTS, BOW*SR, ODI-SR-BL, SR -B,


BATS*ECO, All*T-runs, ODI-SR-B, ALL*RUNS, BOW*T-RUNS, T-
RUNS, BOW*RUNS, BATS*SR-B, ALL*WKI, All*ODI, AVE-BL,
bow*ODIR, BOW*WK-I, AVE, T-WKTS, SIXERS, ALL*ECON, HS ,
ALL*WKI, BOW*ECO, ODI-RUNS, ALL*SR-B, BOW*WK-O, RUNS-S,
RUNS-C, BOW*SR-BL, ALL*SR-BL, WKTS, ODI-WKTS
b. Dependent Variable: SQRT(S-B)

Model Summaryb

Adjusted R Std. Error of the


Model R R Square Square Estimate

1 .922a .850 .771 153.969

a. Predictors: (Constant), Year, ALL*RUNS, ODI-SR-BL, BATS*ECO,


bow*ODIR, All*T-runs, SR -B, MTS, ODI-SR-B, T-RUNS, T-WKTS,
BOW*RUNS, AVE-BL, ALL*WKI, All*ODI, BATS*SR-B, BOW*WK-I,
AVE, ALL*ECON, SIXERS, BOW*T-RUNS, HS , BOW*ECO, ALL*WKI,
BOW*WK-O, BAT*SR, ODI-RUNS, RUNS-S, ALL*SR-B, RUNS-C,
BOW*SR-BL, ALL*SR-BL, WKTS, ODI-WKTS
b. Dependent Variable: SQRT(S-B)

ANOVAa

Model Sum of Squares Df Mean Square F Sig.

1 Regression 8723821.023 34 256582.971 10.823 .000b

Residual 1540916.597 65 23706.409


Total 10264737.621 99

Page 4
a. Dependent Variable: SQRT(S-B)
b. Predictors: (Constant), Year, ALL*RUNS, ODI-SR-BL, BATS*ECO, bow*ODIR, All*T-runs, SR -
B, MTS, ODI-SR-B, T-RUNS, T-WKTS, BOW*RUNS, AVE-BL, ALL*WKI, All*ODI, BATS*SR-B,
BOW*WK-I, AVE, ALL*ECON, SIXERS, BOW*T-RUNS, HS , BOW*ECO, ALL*WKI, BOW*WK-O,
BAT*SR, ODI-RUNS, RUNS-S, ALL*SR-B, RUNS-C, BOW*SR-BL, ALL*SR-BL, WKTS, ODI-
WKTS

The anova is rejected here suggesting that the null hypothesis is rejected. We now check for
tolerance/VIF. If VIF>5, there exists collinearity
Coefficientsa

Unstandardized Standardized
Coefficients Coefficients Collinearity Statistics
Model B Std. Error Beta t Sig. Tolerance VIF
1 (Constant)
######## 30544.661 -4.000 .000

BAT*SR -1.169 1.479 -.219 -.791 .432 .030 33.313


BOW*ECO
-8.813 16.278 -.107 -.541 .590 .059 16.865

BOW*SR-
BL -2.550 9.350 -.096 -.273 .786 .019 53.442

BOW*WK-I
11.489 11.198 .778 1.026 .309 .004 249.183

BOW*WK-O
5.536 6.029 1.408 .918 .362 .001 1017.570

T-RUNS
-.014 .023 -.124 -.608 .545 .055 18.159

T-WKTS
-2.420 .647 -.766 -3.740 .000 .055 18.171

ODI-RUNS
.027 .024 .243 1.116 .269 .049 20.546

ODI-SR-B
1.267 .809 .099 1.566 .122 .583 1.716

ODI-WKTS
-3.499 6.031 -1.053 -.580 .564 .001 1426.374

ODI-SR-BL
-.937 .995 -.070 -.941 .350 .414 2.416

MTS 12.109 2.986 .895 4.055 .000 .047 21.096


ALL*SR-B -4.145 1.589 -.685 -2.608 .011 .034 29.837
ALL*SR-BL
-13.427 13.527 -.446 -.993 .325 .011 87.290

ALL*ECON
57.357 37.892 .612 1.514 .135 .014 70.886

RUNS-S -.228 .163 -.401 -1.394 .168 .028 35.836


HS -5.733 1.619 -.607 -3.541 .001 .079 12.725

Page 5
AVE 16.339 3.874 .547 4.218 .000 .137 7.284
SR B -.668 .634 -.076 -1.054 .296 .449 2.225
SIXERS 5.136 2.241 .331 2.292 .025 .111 9.005
RUNS-C .416 .187 .720 2.220 .030 .022 45.482
WKTS -21.750 11.593 -1.493 -1.876 .065 .004 274.142
AVE-BL -.028 6.340 -.002 -.004 .997 .019 52.986
BATS*ECO
18.530 8.830 .162 2.098 .040 .389 2.572

BATS*SR-B
-.464 11.293 -.009 -.041 .967 .050 20.086

BOW*RUNS
-1.613 .644 -.231 -2.506 .015 .273 3.665

ALL*RUNS
.540 .211 .371 2.556 .013 .110 9.125

ALL*WKI 2.754 10.815 .094 .255 .800 .017 59.239


BOW*T-
RUNS .759 .200 .577 3.786 .000 .099 10.062

All*T-runs .191 .060 .471 3.200 .002 .107 9.368


bow*ODIR
-.702 .246 -.450 -2.853 .006 .093 10.752

All*ODI -.026 .046 -.093 -.556 .580 .083 12.013


ALL*WKI 6.233 6.060 1.314 1.029 .307 .001 706.397
Year 60.861 15.195 .263 4.005 .000 .538 1.860
a. Dependent Variable: SQRT(S-B)

The variables highlighted in RED do not exhibit multi-collinearity. Rest all variables are passed
through PCA

Page 6
4. Principal Component Analysis (1st time)
Communalities

Initial Extraction

BAT*SR 1.000 .908


BOW*ECO 1.000 .841
BOW*SR-BL 1.000 .752
BOW*WK-I 1.000 .894
BOW*WK-O 1.000 .940
T-RUNS 1.000 .766
T-WKTS 1.000 .753
ODI-RUNS 1.000 .769
ODI-WKTS 1.000 .941
MTS 1.000 .885
ALL*SR-B 1.000 .940
ALL*SR-BL 1.000 .913
ALL*ECON 1.000 .916
RUNS-S 1.000 .906
HS 1.000 .865
AVE 1.000 .786
SIXERS 1.000 .857
RUNS-C 1.000 .952
WKTS 1.000 .950
AVE-BL 1.000 .747
BATS*SR-B 1.000 .677
ALL*RUNS 1.000 .823
ALL*WKI 1.000 .710
BOW*T-RUNS 1.000 .863
All*T-runs 1.000 .827
bow*ODIR 1.000 .744
All*ODI 1.000 .769
ALL*WKI 1.000 .911

Extraction Method: Principal Component


Analysis.

Here the communalities > 0.5. So there is no issue in this table.

Page 7
Total Variance Explained

Initial Eigenvalues Extraction Sums of Squared Loadings

Component Total % of Variance Cumulative % Total % of Variance Cumulative %

1 8.457 30.203 30.203 8.457 30.203 30.203


2 6.392 22.828 53.031 6.392 22.828 53.031
3 3.207 11.454 64.486 3.207 11.454 64.486
4 2.584 9.227 73.713 2.584 9.227 73.713
5 1.668 5.958 79.671 1.668 5.958 79.671
6 1.296 4.630 84.301 1.296 4.630 84.301
7 .862 3.077 87.377
8 .665 2.375 89.752
9 .578 2.066 91.818
10 .518 1.851 93.670
11 .401 1.433 95.102
12 .307 1.097 96.199
13 .242 .864 97.063
14 .185 .659 97.722
15 .142 .507 98.229
16 .111 .397 98.626
17 .091 .324 98.950
18 .082 .291 99.241
19 .074 .265 99.506
20 .031 .111 99.617
21 .028 .101 99.718
22 .024 .086 99.803
23 .019 .068 99.872
24 .017 .061 99.933
25 .010 .034 99.967
26 .007 .025 99.992
27 .002 .007 99.999
28 .000 .001 100.000

Extraction Method: Principal Component Analysis.

This table shows that total variance is explained by mostly 6 variables

Page 8
KMO and Bartlett's Test

Kaiser-Meyer-Olkin Measure of Sampling Adequacy. .690


Bartlett's Test of Sphericity Approx. Chi-Square 4825.925

Df 378

Sig. .000

KMO Test of 0.690 suggests that there exists multi-collinearity

Anti Image Correlation

Page 9
Anti-image BAT*SR a
.812 .586 .348 .163 -.074 .082 -.072 -.059 .088 -.105 .535 .067 .210 .163 -.264 -.287 -.156 -.102 -.068 -.190 .213 .232 .100 .014 .303 -.036 -.036 -.112
Correlatio BOW*ECO
n .586 .860a .143 -.032 -.122 -.054 .057 .161 .111 .275 .391 .073 .208 -.203 .058 .009 -.140 -.161 .031 -.213 .215 .070 -.017 .119 .177 -.123 -.072 -.136
BOW*SR-
.348 .143 .668a .302 .009 .006 -.241 .009 .003 -.043 -.029 .560 .277 .091 -.083 -.128 -.087 -.102 -.269 -.919 .869 .094 .291 .073 .295 -.034 -.023 -.017
BL
BOW*WK-I a
.163 -.032 .302 .604 -.191 -.009 -.058 -.083 .192 -.453 .041 .160 .075 .426 -.275 -.035 .112 .198 -.950 -.289 .496 -.003 .942 -.007 .111 .035 .017 -.195
BOW*WK-
-.074 -.122 .009 -.191 .673a .111 -.039 -.005 -.991 -.123 -.081 .028 -.015 .064 -.024 -.001 .012 .172 .152 -.042 .040 .006 -.211 -.026 -.058 -.045 -.013 .988
O
T-RUNS .082 -.054 .006 -.009 .111 .609a -.060 -.932 -.104 -.082 .108 -.018 -.012 .158 -.324 -.121 .326 .071 .008 .017 -.054 -.059 -.038 -.094 -.287 .076 .473 .097

less than 0.5.


T-WKTS a
-.072 .057 -.241 -.058 -.039 -.060 .614 .051 -.066 .013 .240 -.251 -.020 -.016 .124 -.064 -.035 -.019 .076 .193 -.176 -.049 -.059 -.716 -.490 .769 .415 -.022
ODI-
-.059 .161 .009 -.083 -.005 -.932 .051 .626a .000 .219 -.053 .017 .036 -.285 .347 .133 -.301 -.153 .086 -.022 .073 .066 -.045 .102 .291 -.106 -.508 .003
RUNS
ODI-WKTS
.088 .111 .003 .192 -.991 -.104 -.066 .000 .644a .127 .065 -.013 .016 -.065 .014 .005 -.011 -.161 -.156 .038 -.037 -.002 .210 .071 .110 -.059 -.034 -.988
MTS -.105 .275 -.043 -.453 -.123 -.082 .013 .219 .127 .529a -.003 .009 .029 -.858 .351 .085 .019 -.497 .403 .011 -.051 -.049 -.337 .064 .017 -.088 -.033 -.133
ALL*SR-B .535 .391 -.029 .041 -.081 .108 .240 -.053 .065 -.003 .854a -.025 -.304 .095 -.247 -.072 -.060 -.040 .008 .111 -.071 -.142 .010 -.189 -.095 .156 .147 -.117
ALL*SR-
.067 .073 .560 .160 .028 -.018 -.251 .017 -.013 .009 -.025 .724a -.515 .035 -.095 .057 -.077 .002 -.177 -.603 .557 -.499 .300 .170 -.081 -.125 .162 -.004
BL
ALL*ECO a
.210 .208 .277 .075 -.015 -.012 -.020 .036 .016 .029 -.304 -.515 .780 -.040 .124 -.168 -.009 -.031 -.069 -.261 .248 .617 -.081 .005 .389 -.012 -.285 .004
N
RUNS-S a
.163 -.203 .091 .426 .064 .158 -.016 -.285 -.065 -.858 .095 .035 -.040 .710 -.440 -.154 -.327 .436 -.382 -.051 .088 .028 .327 -.077 -.022 .100 .107 .064
HS a
-.264 .058 -.083 -.275 -.024 -.324 .124 .347 .014 .351 -.247 -.095 .124 -.440 .812 -.388 -.174 -.222 .256 .075 -.094 -.031 -.216 -.036 .030 -.005 -.212 -.005
AVE a
-.287 .009 -.128 -.035 -.001 -.121 -.064 .133 .005 .085 -.072 .057 -.168 -.154 -.388 .917 .091 .023 .007 .029 -.046 -.122 -.006 .138 -.122 -.139 -.031 .012
SIXERS -.156 -.140 -.087 .112 .012 .326 -.035 -.301 -.011 .019 -.060 -.077 -.009 -.327 -.174 .091 .893a -.033 -.105 .082 -.098 -.112 .102 -.001 -.080 .016 .135 .014
RUNS-C -.102 -.161 -.102 .198 .172 .071 -.019 -.153 -.161 -.497 -.040 .002 -.031 .436 -.222 .023 -.033 .810a -.428 -.077 .038 -.081 .093 -.029 -.106 -.125 .054 .168
WKTS -.068 .031 -.269 -.950 .152 .008 .076 .086 -.156 .403 .008 -.177 -.069 -.382 .256 .007 -.105 -.428 .570a .331 -.509 .047 -.893 -.021 -.077 .038 -.024 .157
AVE-BL a
-.190 -.213 -.919 -.289 -.042 .017 .193 -.022 .038 .011 .111 -.603 -.261 -.051 .075 .029 .082 -.077 .331 .519 -.923 -.039 -.279 -.112 -.220 .055 -.018 -.022
BATS*SR-
.213 .215 .869 .496 .040 -.054 -.176 .073 -.037 -.051 -.071 .557 .248 .088 -.094 -.046 -.098 .038 -.509 -.923 .271a .061 .486 .102 .235 -.040 -.015 .021
B
ALL*RUN
.232 .070 .094 -.003 .006 -.059 -.049 .066 -.002 -.049 -.142 -.499 .617 .028 -.031 -.122 -.112 -.081 .047 -.039 .061 .738a -.219 .029 .339 -.008 -.386 .046
S
ALL*WKI .100 -.017 .291 .942 -.211 -.038 -.059 -.045 .210 -.337 .010 .300 -.081 .327 -.216 -.006 .102 .093 -.893 -.279 .486 -.219 .503a .019 .079 .029 .125 -.231
BOW*T-
.014 .119 .073 -.007 -.026 -.094 -.716 .102 .071 .064 -.189 .170 .005 -.077 -.036 .138 -.001 -.029 -.021 -.112 .102 .029 .019 .739a .379 -.686 -.370 -.004
RUNS
All*T-runs a
.303 .177 .295 .111 -.058 -.287 -.490 .291 .110 .017 -.095 -.081 .389 -.022 .030 -.122 -.080 -.106 -.077 -.220 .235 .339 .079 .379 .661 -.394 -.412 -.113
bow*ODIR a
-.036 -.123 -.034 .035 -.045 .076 .769 -.106 -.059 -.088 .156 -.125 -.012 .100 -.005 -.139 .016 -.125 .038 .055 -.040 -.008 .029 -.686 -.394 .671 .416 -.015
All*ODI -.036 -.072 -.023 .017 -.013 .473 .415 -.508 -.034 -.033 .147 .162 -.285 .107 -.212 -.031 .135 .054 -.024 -.018 -.015 -.386 .125 -.370 -.412 .416 .693a -.073
ALL*WKI a
-.112 -.136 -.017 -.195 .988 .097 -.022 .003 -.988 -.133 -.117 -.004 .004 .064 -.005 .012 .014 .168 .157 -.022 .021 .046 -.231 -.004 -.113 -.015 -.073 .627

The above image has the anti-correlation matrix. In this variables are removed whose values is

Page 10
5. Principal Component Analysis (n times)
Again PCA is repeated with the variables removed several times.
Rotated Component Matrixa

Component
1 2 3 4 5 6
BAT*SR .619 -.389 -.261 -.340 -.175 -.398
BOW*ECO
-.493 -.332 .389 .411 -.171 .373
BOW*SR-BL
-.407 -.327 .375 .380 -.117 .429
BOW*WK-I
-.274 -.260 .165 .831 -.077 .162
BOW*WK-O
-.200 -.135 .919 .108 -.111 .117
T-RUNS .079 -.094 -.050 -.164 .074 -.899
T-WKTS -.152 -.063 .715 .074 .462 .041
ODI-RUNS
.190 -.026 -.087 -.172 .138 -.859
ODI-WKTS
-.183 .120 .777 .093 .530 .034
MTS .738 -.066 -.043 .587 -.095 .084
ALL*SR-B -.040 .852 -.122 -.114 .416 .102
ALL*SR-BL
-.024 .827 -.108 -.147 .414 .149
ALL*ECON
-.059 .846 -.128 -.145 .378 .138
RUNS-S .938 -.024 -.147 -.079 -.069 -.115
HS .855 .122 -.188 -.205 -.024 -.218
AVE .794 .131 -.191 -.284 .028 -.172
SIXERS .915 .067 -.121 -.056 -.057 -.038
RUNS-C -.111 .182 .194 .912 .003 .216
WKTS -.152 .122 .140 .937 -.010 .156
AVE-BL -.182 .313 .170 .132 .219 .645
ALL*RUNS
.192 .878 -.039 .089 .028 .081
ALL*WKI .111 .800 -.029 .240 .129 -.053
BOW*T-RUNS
-.148 -.122 .882 .172 -.098 .105
All*T-runs -.055 .235 -.010 -.062 .885 -.043
bow*ODIR
-.126 -.062 .826 .106 -.169 .084
All*ODI -.014 .398 -.032 -.064 .782 -.025
ALL*WKI -.029 .340 .006 .007 .895 -.074
Extraction Method: Principal Component Analysis.
Rotation Method: Varimax with Kaiser Normalization.
a. Rotation converged in 7 iterations.

Page 11
After applying varimax rotation, we say that a variable is loading on more than two factors as
there are multiple values >0.4. We remove these variables.
Again PCA is applied after removing variables.

KMO and Bartlett's Test

Kaiser-Meyer-Olkin Measure of Sampling Adequacy. .713


Bartlett's Test of Sphericity Approx. Chi-Square 4579.410

df 351

Sig. .000

Again PCA is applied and we check

Component Score Coefficient Matrix

Component

1 2 3 4 5

BAT*SR -.091 .103 -.023 -.024 .101


BOW*WK-I -.062 -.031 .336 -.050 .124
BOW*WK-O .025 .051 -.066 .383 .005
T-RUNS .069 -.030 .086 .036 .408
ODI-RUNS .092 .004 .081 .050 .390
ALL*ECON .180 .012 -.079 -.021 -.136
RUNS-S -.022 .262 .055 .055 -.014
HS .015 .244 .030 .054 .008
AVE .024 .218 -.019 .052 -.017
SIXERS -.010 .275 .058 .062 -.052
RUNS-C .021 .067 .372 -.020 .052
WKTS .002 .041 .400 -.062 .085
AVE-BL .077 .013 -.075 .069 -.235
ALL*RUNS .127 .146 .047 .008 -.154
ALL*WKI .137 .108 .134 -.032 -.056
BOW*T-RUNS .032 .067 -.027 .379 .026
All*T-runs .220 -.068 -.021 .033 .131
bow*ODIR .040 .082 -.059 .401 .015
All*ODI .232 -.027 -.026 .054 .091
ALL*WKI .240 -.047 .014 .035 .138

Extraction Method: Principal Component Analysis.


Rotation Method: Varimax with Kaiser Normalization.
Component Scores.

Page 12
After doing PCA, all the variables which were removed during factor analysis are added back.
The list of independent variables include
I. Variables excluded during PCA
II. Variables removed after PCA
III. Components generated in PCA
IV. Dummy variables

The dependent variables include SQRT(S-B)

The Final Model after linear aggression is

Model Summaryb

Adjusted R Std. Error of the


Model R R Square Square Estimate Durbin-Watson

1 .903a .815 .714 172.095 1.990

a. Predictors: (Constant), BATS*WKI, MTS, L25, REGR factor score 4 for analysis 7,
ODI-SR-B, AUSTRALIA, ECON, SR -B, ALL*WKI, BOW*RUNS, REGR factor score 2 for
analysis 7, A35, ODI-SR-BL, BATS*SR-B, Year, CAPTAINCY EXP, T-WKTS,
BATS*WKT, BOW*SR-BL, REGR factor score 5 for analysis 7, INDIA, T-RUNS, REGR
factor score 3 for analysis 7, BATS*ECO, REGR factor score 1 for analysis 7, ALL*SR-
B, BOW*SR, BAT*ODI-RUNS, ODI-WKTS, BAT*RUN-S, BOW*ECO, ALL*SR-BL, ALL,
BOW, BAT*T-RUNS
b. Dependent Variable: SQRT(S-B)

R-Square of 0.815

ANOVAa

Model Sum of Squares df Mean Square F Sig.

1 Regression 8369266.427 35 239121.898 8.074 .000b

Residual 1895471.194 64 29616.737

Total 10264737.621 99

a. Dependent Variable: SQRT(S-B)

Page 13
b. Predictors: (Constant), BATS*WKI, MTS, L25, REGR factor score 4 for analysis 7, ODI-SR-B,
AUSTRALIA, ECON, SR -B, ALL*WKI, BOW*RUNS, REGR factor score 2 for analysis 7, A35,
ODI-SR-BL, BATS*SR-B, Year, CAPTAINCY EXP, T-WKTS, BATS*WKT, BOW*SR-BL, REGR
factor score 5 for analysis 7, INDIA, T-RUNS, REGR factor score 3 for analysis 7, BATS*ECO,
REGR factor score 1 for analysis 7, ALL*SR-B, BOW*SR, BAT*ODI-RUNS, ODI-WKTS,
BAT*RUN-S, BOW*ECO, ALL*SR-BL, ALL, BOW, BAT*T-RUNS

Anova is rejected

On close observation we found that the tolerance level for some variables is still more than 5. So
we applied Step wise regression

Model Summaryh

Adjusted R Std. Error of the


Model R R Square Square Estimate Durbin-Watson

1 .691a .478 .472 233.879


2 .758b .574 .565 212.265
3 .794c .630 .618 198.966
4 .810d .656 .641 192.876
5 .820e .672 .655 189.210
f
6 .829 .687 .667 185.935
7 .837g .700 .677 182.970 1.852

a. Predictors: (Constant), MTS


b. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7
c. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25
d. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25, Year
e. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25, Year,
BATS*ECO
f. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25, Year,
BATS*ECO, OTHERS
g. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25, Year,
BATS*ECO, OTHERS, ODI-SR-BL
h. Dependent Variable: SQRT(S-B)

Page 14
ANOVAa

Model Sum of Squares df Mean Square F Sig.

1 Regression 4904200.344 1 4904200.344 89.657 .000b

Residual 5360537.277 98 54699.360

Total 10264737.621 99
2 Regression 5894270.584 2 2947135.292 65.410 .000c
Residual 4370467.037 97 45056.361
Total 10264737.621 99
3 Regression 6464348.355 3 2154782.785 54.431 .000d
Residual 3800389.265 96 39587.388
Total 10264737.621 99
4 Regression 6730623.874 4 1682655.969 45.231 .000e
Residual 3534113.746 95 37201.197
Total 10264737.621 99
5 Regression 6899498.050 5 1379899.610 38.544 .000f
Residual 3365239.570 94 35800.421
Total 10264737.621 99
6 Regression 7049546.454 6 1174924.409 33.985 .000g
Residual 3215191.166 93 34571.948
Total 10264737.621 99
7 Regression 7184763.344 7 1026394.763 30.659 .000h

Residual 3079974.276 92 33477.981

Total 10264737.621 99

a. Dependent Variable: SQRT(S-B)


b. Predictors: (Constant), MTS
c. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7
d. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25
e. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25, Year
f. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25, Year, BATS*ECO
g. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25, Year, BATS*ECO,
OTHERS
h. Predictors: (Constant), MTS, REGR factor score 2 for analysis 7, L25, Year, BATS*ECO,
OTHERS, ODI-SR-BL

Page 15
Page 16
Page 17

You might also like