Professional Documents
Culture Documents
Kanade
Kanade
-
-
Lucas
Lucas
-
-
Tomasi
Tomasi
(KLT)
(KLT)
Feature Tracker
Feature Tracker
Computer Vision Lab.
Computer Vision Lab.
Jae Kyu Suhr
Jae Kyu Suhr
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Introduction
Introduction
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Motivation Motivation
Image sequence
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Motivation Motivation
Feature point detection
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Motivation Motivation
Feature point tracking
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Motivation Motivation
Tracking result
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
3D Reconstruction results 3D Reconstruction results
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Problem Statement
Problem Statement
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Problem Statement Problem Statement
I, J : 1
st
and 2
nd
grayscaledimages
I(x,y) : gray value of I at [x y]
T
Let u=[u
x
u
y
]
T
be a point on the 1
st
image I
The goal is to find v on J , where I(u) and J (v) are
similar. (v= u+d = [u
x
+d
x
u
y
+d
y
]
T
)
d=[d
x
d
y
]
T
is the image velocity at u
or the optical flow at u.
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Problem Statement Problem Statement
Definition of the residual function (d).
x
and
y
integration window size parameter
integration window size (2
x
+1)(2
y
+1).
(u
x
, u
y
)
w
x
w
y
I J
( - )
2
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Pyramid Implementation
Pyramid Implementation
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Why pyramid representation? Why pyramid representation?
Standard KLT algorithm can deal with small pixel displacement.
Solution for this is a pyramidal implementation.
Let I
0
=I be the 0
th
level image
The pyramid representation is built recursively.
I
0
I
1
I
2
I
3
I
Lm
(Lm: 2~4)
I
L-1
: the image at level L-1
I
L
: the image at level L
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Pyramidal feature tracking Pyramidal feature tracking
A given point u in I, find its corresponding location v=u+d.
Corresponding point of u(u
0
) on the pyramidal image I
L
is u
L
Simple overall pyramid tracking algorithm
d
Lm
is computed at the pyramid level L
m
d
Lm-1
is computed
with an initial guess of d
Lm
at L
m-1
This continues up to the level 0
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Let g
L
=[g
L
x
g
L
y
]
T
be an initial guess at level L
( g
L
is available from level L
m
to level L+1 )
Residual pixel displacement vector d
L
=[d
L
x
d
L
y
]
T
that minimizes the new image matching error function
L
Window size (2
x
+1)(2
y
+1) is constant for all pyramid
g
L
is used to pre-translate the image patch in 2
nd
image J
d
L
is small and therefore easy to compute using KLT algorithm
Pyramidal feature tracking Pyramidal feature tracking
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Pyramidal feature tracking Pyramidal feature tracking
Assume that d
L
is computed, new initial guess g
L-1
at level L-1
Initial guess for the deepest level L
m
The final optical flow solution d
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Standard KLT algorithm
Standard KLT algorithm
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Standard KLT optical flow computation Standard KLT optical flow computation
Goal is to find d
L
that minimizes the matching function
L
.
Same operation is performed for all levels L, drop the superscript L
and define the new images A and B
Let us change the displacement vector and the point vector
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Standard KLT optical flow computation Standard KLT optical flow computation
Goal is to find that minimizes the matching function
To find the optimum
Let us substitute by its 1
st
order Taylor expansion
about the point
frame difference Image gradient
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Standard KLT optical flow computation Standard KLT optical flow computation
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Standard KLT optical flow computation Standard KLT optical flow computation
( G must be invertible).
Standard LK is valid only the pixel displacement is small.
(Because of the first order Taylor approximation)
Therefore, iterative version of LK is necessary.
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Iterative KLT algorithm
Iterative KLT algorithm
Computer Vision (EEE6503) Fall 2009, Yonsei Univ.
Iterative KLT optical flow computation Iterative KLT optical flow computation
k: iteration index