Professional Documents
Culture Documents
REG RESSIO N
Subtitle
CO RRELATIO N
CORRELATION COEFFICIENT
It is a mathematical index that describes the direction & magnitude
of a relationship.
Positive
High scores on X associated with High scores on Y
Negative
High scores on X associated with Low scores on Y
No Correlation
High or Low scores are not associated
SCATTERPLO TS
REG RESSIO N
Using correlation to assess the magnitude and direction of a
relationship
Used to make predictions about scores on one variable from the
knowledge of scores on another variable
Term first used in 1885 by Sir Francis Galton
Observed Regression toward Mediocrity or Regression toward the
mean meaning scores tend to regress towards the mean on repeated
occasions
Karl Pearson developed first statistical models of correlation and
regression (late 19th century)
O TH ER CO N CEPTS
Residual
Standard Error of Estimate (SEE)
Coefficient of Determination
Coefficient of Alienation
Shrinkage
Cross Validation
The Correlation-Causation Problem
Third Variable Explanation
Restricted Range
Multivariate Analysis
RESID U AL
Definition: the difference between the predicted and the
observed values
Symbolically defined as Y - Y
The SUM of the residuals must always equals 0
The SUM of the squared residuals should be the
smallest value according to the principle of least squares.
SH RIN KAG E
Defined as the amount of decrease observed when a
regression equation is created for one population and
then applied to another
The consequence of using a prediction created from
group A and then applied to group B
A tendency to overestimate the relationship between
variables, particularly if the sample subjects are small
RESTRICTED RAN G E
Sometimes it is extremely difficult to demonstrate the
relationship between two things even though a true
underlying relationship may exist.
Correlations require variability, if variability is restricted,
then significant correlations are difficult to find.
M U TLIVARIATE AN ALYSIS
Considers relationship among combinations of three of
more variables
Multiple regression (interval data)
Discriminant Analysis (nominal/categorical data)
Factor Analysis (study of interrelationships among set of
variable without reference to a criterion; involves
reduction of large variable to small number)
CORRELATIONAL
STATISTICS
DESCRIBING
RELATIONSHIP BETWEEN
TWO VARIABLES
INFERENTIAL
STATISTICS
PARAMETRIC AND NON
PARAMETRIC TESTS