Professional Documents
Culture Documents
What Is Regression
Regression is a measure of relation between a dependent variable and a set of independent variables which affect the value of the dependent variable.
Types of Regression
Linear :
Assumes linear relationship between variables Simple When on independent variable is used to predict the value of the dependent variable. When there are many independent variables used to predict the value of the dependant variable.
Multiple
Non linear :
When the relationship is non linear
Drop variables that are unlikely to affect value of dependent variable. Several models are available for eliminating variables from a regression analysis.
Eliminating independent variables having a low correlation to the dependent variable. Stepwise regression
Starting with the independent variable with the highest predictive value. And entering variables one by one examining at each stage, the improvement
At each stage all variables in the equation are examined to check if they are
needed. And if at any stage they are found superfluous they are dropped.
Forward selection Similar to stepwise regression except that no variable is dropped once it is entered into the equation. Backward elimination Using all independent variables and eliminating variables that contribute the least, one by one.
4
Model summary
R2 0.693 Adjusted R2 0.694
Interpretation
The F statistics is a measure of whether any relationship exists between the dependent and independent variable.
Interpretation
Constant to be used in the regression equation There is a B value for each independent variable. It is the coefficient of each independent variable in the equation A unit change in the independent variable can cause B units of change in the dependent variable, if all other independent variables are constant It is the standard error of the coefficient B. It is the normalised value of B. And removes the effect of the scale differences in the independent variables. It is a measure of relative importance because it indicates the expected change in the dependent variable per unit change in the independent variable.
Standard error
Significance of t
If t is not significant, then the independent variable is not a good predictor. And should be removed from the analysis.
Applications Of Regression
0.593 0.794
43% 57%
Both cleanliness and duration of billing are important contributors to overall satisfaction with the store. Duration of billing is a relatively more important contributor.
9
Forecasting
The regression equation can be used to predict the value of the dependent variable when the independent variable values are known. Y = a+b1x1+b2x2+b3x3+ Data available
Awareness for brand A during the period of a campaign. GRPs in TV for the ad campaign. What are the likely levels of awareness of brand A during the next campaign, for which estimates of GRP are available.
Can predict
10
If the regression equation was obtained for the awareness of a brand vis--vis GRPs for a market leader, it cannot be extrapolated for a minority brand.
11
DISCRIMINANT ANALYSIS
13
Selection Process for a job, Admission process of an educational program Dividing a group in potential buyer & non- buyer high risk low risk Y = a + k1x1+ k2x2 K1 and K2 should maximise the separation between two groups
14
15
Percent Correct/ Wrong Column 94.44% Model has correctly classified 94.44% of the cases Level of accuracy may not hold true for future predictions.. But is a good pointer towards model being a Good One
16
17
Standardized Coefficients indicates relative importance of the variables Means of Canonical Variables Computed based on Raw co-efficient table Right side of Mid Point is Group 2 Left Side of Mid Point is Group 1
18
Case Study
A Business School selects its students every year through a written test, interview and group discussion. It then tracks the performance of students during the two year program by means of GPA. A GPA above 2.75 /4.0 is defined as Successful and below as Unsuccessful students.
Can you develop a model that predicts whether a student would be potentially successful or not.
19
How good is the model? Statistical Significance of the model Predictors Classification of new Student
20