05b Logistic Regression

Machine Learning g Logistic Regression
Jeff Howbert
Introduction to Machine Learning
Winter 2012
Logistic regression
Name is somewhat misleading. Really a technique for classification, not regression. Regression comes from fact that we fit a linear model to the feature space. p Involves a more probabilistic view of classification.
Jeff Howbert
Winter 2012
Different ways of expressing probability Consider a two-outcome probability space, where: p( O1 ) = p p( O2 ) = 1 p = q Can express probability of O1 as:
standard probability odds log odds (logit)
notation p p/q log( p / q )
range equivalents 0 0.5 1 0 1 + 0 - +
Jeff Howbert
Winter 2012
Log odds
Numeric treatment of outcomes O1 and O2 is equivalent If neither outcome is favored over the other, then log g odds = 0. If one outcome is favored with log odds = x, then other outcome is disfavored with log g odds = -x. Especially useful in domains where relative probabilities can be miniscule Example: multiple sequence alignment in computational biology

Jeff Howbert Introduction to Machine Learning Winter 2012 4
From probability to log odds (and back again)
p z = log 1 p p = ez 1 p ez 1 p= = z 1 + e 1 + ez
logit function
logistic function
Jeff Howbert
Winter 2012
Standard logistic function
Jeff Howbert
Winter 2012
Logistic regression
Scenario: A multidimensional feature space (features can be categorical or continuous). Outcome is discrete, discrete not continuous. continuous
Well focus on case of two classes.
It seems plausible that a linear decision boundary (hyperplane) will give good predictive accuracy. p y
Jeff Howbert
Winter 2012
Using a logistic regression model

Model consists of a vector in d-dimensional feature space For a point x in feature space, project it onto to convert it into a real number z in the range - to +
z = + x = + 1 x1 + L + d xd
Map z to the range 0 to 1 using the logistic function
p = 1 /(1 + e z )
Overall, logistic regression maps a point x in ddimensional feature space to a value in the range 0 to 1
Introduction to Machine Learning Winter 2012 8
Jeff Howbert
Using a logistic regression model

Can interpret prediction from a logistic regression model as: A probability of class membership A class assignment assignment, by applying threshold to probability
threshold represents p decision boundary y in feature space
Jeff Howbert
Winter 2012
Training a logistic regression model

Need to optimize so the model gives the best possible reproduction of training set labels Usually done by numerical approximation of maximum likelihood On really large datasets, may use stochastic gradient descent g
Jeff Howbert
Winter 2012
10
Logistic regression in one dimension
Jeff Howbert
Winter 2012
11
Jeff Howbert
Winter 2012
12

Parameters control shape and location of sigmoid curve

controls location of midpoint controls slope of rise
Jeff Howbert
Winter 2012
13
Jeff Howbert
Winter 2012
14
Jeff Howbert
Winter 2012
15
Logistic regression in two dimensions

Subset of Fisher iris dataset
Two classes First two columns (SL, SW)
decision boundary
Jeff Howbert
Winter 2012
16
Logistic regression in two dimensions

Interpreting the model vector of coefficients

From MATLAB: B = [ 13.0460 = B( 1 ), = [ 1 2 ] = B( 2 : 3 ) , define location and orientation of decision boundary - is distance of decision boundary from origin decision boundary is perpendicular to
-1.9024
-0.4047 ]
magnitude of defines gradient of probabilities between 0 and 1
Jeff Howbert
Winter 2012
17
Heart disease dataset

13 attributes (see heart.docx for details) 2 demographic g p ( (age, g ,g gender) ) 11 clinical measures of cardiovascular status and performance 2 classes: absence ( 1 ) or presence ( 2 ) of heart disease 270 samples D t Dataset t taken t k from f UC Irvine I i Machine M hi L Learning i R Repository: it http://archive.ics.uci.edu/ml/datasets/Statlog+(Heart) Preformatted for MATLAB as heart.mat h t t.

Jeff Howbert
Winter 2012
18
MATLAB interlude
matlab_demo_05.m
Jeff Howbert
Winter 2012
19
Logistic regression
Advantages:
Makes no assumptions about distributions of classes in feature space Easily extended to multiple classes (multinomial regression) Natural probabilistic view of class predictions Quick to train Very fast at classifying unknown records Good accuracy for many simple data sets Resistant to overfitting Can interpret p model coefficients as indicators of feature importance
Disadvantages:
Linear Li d decision i i b boundary d
Jeff Howbert
Winter 2012
20

05b Logistic Regression

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

05b Logistic Regression

Uploaded by

Copyright:

Available Formats

Machine Learning g Logistic Regression

Introduction to Machine Learning

Introduction to Machine Learning

standard probability odds log odds (logit)

notation p p/q log( p / q )

range equivalents 0 0.5 1 0 1 + 0 - +

Introduction to Machine Learning

From probability to log odds (and back again)

Introduction to Machine Learning

Standard logistic function

Introduction to Machine Learning

Well focus on case of two classes.

Introduction to Machine Learning

Using a logistic regression model

Map z to the range 0 to 1 using the logistic function

Using a logistic regression model

Introduction to Machine Learning

Training a logistic regression model

Introduction to Machine Learning

Logistic regression in one dimension

Introduction to Machine Learning

Logistic regression in one dimension

Introduction to Machine Learning

Logistic regression in one dimension

Parameters control shape and location of sigmoid curve

Introduction to Machine Learning

Logistic regression in one dimension

Introduction to Machine Learning

Logistic regression in one dimension

Introduction to Machine Learning

Logistic regression in two dimensions

Introduction to Machine Learning

Logistic regression in two dimensions

magnitude of defines gradient of probabilities between 0 and 1

Introduction to Machine Learning

Heart disease dataset

Introduction to Machine Learning

Introduction to Machine Learning

Introduction to Machine Learning

You might also like