You are on page 1of 4

A Study of the Sensitivity of Automatic Image

Registration Algorithms to Initial Conditions


Jacqueline Le Moigne Arlene Cole-Rhodes Roger Eastman
Applied Information Sciences Branch Kisha Johnson Loyola College
NASA/GSFC-Code 935 Electrical & Comp. Eng. Dept Baltimore, MD 21210
Greenbelt, MD 20771 Morgan State University reastman@loyola.edu
Tel:301-286-8723 Baltimore, MD 21251
Fax:301-286-1776 acrhodes&kjohnson@eng.morgan.edu Harold Stone, Ilya Zavorin
Jacqueline.J.LeMoigne- UMBC/GEST Center at GSFC
Stewart@nasa.gov Nathan S. Netanyahu Greenbelt, MD 20771
Dept Math. & Comp.Sc. Bar-Ilan hstone@alumni.princeton.edu
Jeffrey Morisette Univ., Ramat-Gan, 52900, Israel, zavorin@backserv.gsfc.nasa.gov
Laboratory for Terrestrial Physics and Center For Automation
NASA/GSFC-Code 923 Research, Univ. of Maryland, Peyush Jain
Greenbelt, MD 2071 College Park, MD 20742 NASA/GSFC-Code 586
Jeffrey.T.Morisette@nasa.gov nathan@cfar.umd.edu Greenbelt, MD 20771
Peyush.Jain@nasa.gov

Abstract- While automatic image registration algorithms initial geolocation information may be accurate from within
are usually being evaluated with regards to their one to several pixels. Larger errors may occur when model
accuracy, it is often useful to relate this accuracy to the parameters, such as position and orientation of the spacecraft,
"initial conditions", i.e., the distance between the initial are inaccurate. Furthermore, additional errors in the predicted
navigation geolocation and the correct result. In this values can be introduced, e.g. during or after a maneuver of
paper, we describe a modular framework that was built the satellite, and the initial geolocation errors may grow
to describe registration algorithms, and we utilize this progressively during several weeks until finally discovered.
framework to attempt to classify different registration In this paper, we describe a modular registration framework
components and algorithms in terms of their responses to built at NASA that is being utilized to attempt to classify
the initial conditions. Performances will be evaluated on different registration algorithms in terms of their responses to
synthetic data, multi-temporal and multi-sensor data. All different parameters, especially initial conditions. For
results of the study will be presented at the conference example, some algorithms might always be able to reach an
and will be useful for two different purposes: (1) provide approximate solution even when starting far from it, although
they might be less accurate than another technique that
automatic quality assessment of the geo-location of
reaches very high sub-pixel accuracy but needs to start
remote sensing data by performing inter-algorithm
relatively close from the “true” solution. These performances
consistency studies; and (2) be the foundations for the
will be evaluated on several test data and results will be
design of future on-board applications including presented at the conference. This study will be useful for 2
planetary exploration. different purposes: (1) provide automatic quality assessment
I. INTRODUCTION
of the geo-location of remote sensing data by performing
Future decision support systems, intelligent sensors and inter-algorithm consistency studies; (2) be the foundations for
adaptive constellations will rely on real- or near-real-time the design of future on-board applications and potentially
interpretation of Earth observation data, both on-board and planetary exploration. This paper will address the general
from ground-based Direct Readout stations. The more expert problem of image registration, review some of the previous
the system and far-reaching the application, the more studies performed by the authors, and will describe our
important will it be to obtain accurate data. Utilized for modular registration framework that enable to independently
global change analysis, validation of new instruments or for test multiple registration components.
new data analysis, accurate image registration is also very
II. A MODULAR IMAGE REGISTRATION FRAMEWORK
useful for real-time applications such as those related to
direct readout data, or on-board navigation and processing The goal of our project is to develop and assess image
systems. While often, automatic image registration registration methodologies that will enable accurate multi-
algorithms have been evaluated with regards to their source integration. Our work focuses on precision correction
accuracy, it is also useful to relate this accuracy to the "initial or automatic image registration. While navigation often
conditions", i.e., the distance between the initial navigation refers to “systematic correction”, image registration refers to
geolocation and the correct result. Depending on the quality “precision correction,” and while systematic correction is
of the navigation model and of the ephemeris data, such model-based, while precision correction is feature-based. For

0-7803-8742-2/04/$20.00 (C) 2004 IEEE 1390

Authorized licensed use limited to: M Th Koelen. Downloaded on June 12, 2009 at 10:00 from IEEE Xplore. Restrictions apply.
many applications, it is very important to reach the sub-pixel Simoncelli/low-pass features have a better radius of
accuracy that can be achieved by precision correction. In our convergence, while Simoncelli/band-pass features are the
experiments, we assume that the data has already been best in terms of accuracy and consistency, but than when they
corrected according to a navigation model and is at a level converge, the Spline features present the best accuracy.
equivalent to the EOS-level 1B. Assuming that the results of
(B) Similarity Measures Experiments
the systematic correction are accurate within a few or a few
Using Simoncelli band-pass features as registration
ten’s of pixels, our precision-correction algorithms utilize
features, exhaustive search as well as a stochastic gradient
selected image features or control points to refine this geo- optimization matching strategy (Spall’s algorithm [10]) were
location accuracy within one pixel or a sub-pixel. Currently, utilized to compare 2 similarity measures: correlation and
there is a large quantity of potential image registration mutual information [9]. The results show that similarity
methods that have been developed for aerial or medical peaks obtained with mutual information are sharper than
images and that are applicable to remote sensing images those obtained with correlation; this can be important when
[1,2]. But there is no systematic study that enables to select we need to reach sub-pixel registration accuracy.
the most appropriate method for a remote sensing application Furthermore, when using mutual information with a
and predict its accuracy. stochastic gradient, results show an accuracy of 0.01 pixel on
synthetic test data, 0.64 pixel on multi-temporal (cloudy)
As a general definition, image registration is described as data, and 0.34 pixel on multi-sensor data.
the process that aligns one image to another image of the
same area that was acquired at the same or at different times Following these preliminary experiments focusing on one
or another of the components of the registration process, the
by different or identical sensors. Image registration can be
next step is then to perform systematic registration
defined by three main steps: (1) extraction of features to be components comparisons, where all these components can be
used in the matching process; (2) feature matching strategy combined in different ways and assessed on well-chosen test
and metrics; (3) resampling or indexing of the data. The data. In order to perform these systematic studies and to
intent of our work is to survey, design, and develop different enable new components to be tested in a rigorous fashion, we
components of the registration process and to evaluate their started building a modular image registration framework. The
performance on well-chosen multiple source data. concept guiding this framework is that various components
of the registration process can be combined in several ways
Many choices are available for each of the previous three in order to reach optimum registration on a given type of data
steps. In earlier work, we investigated step (1), first focusing and under given circumstances. Thereby, the purpose of this
on correlation-based methods, then looking at optimization- framework is double-fold:
based methods. 1. it represents a testing framework to:
a. Assess various combinations of components as
(A) Features Experiments a function of the applications,
Our first experiments focused on features' assessment. b. Assess a new registration component compared
Using correlation as a similarity metrics, we showed that: to other known ones.
• band-pass (e.g., edge or edge-like) features are more 2. it is be the basis of a registration tool where a user can
robust, but less accurate than low-pass (e.g., intensity) “schedule” a combination of components as a function of
features. the application at hand, the available computational
• when comparing gradient-edges and wavelets, we observed resources and the required registration accuracy.
that orthogonal wavelet-based registration was usually
faster although not always as accurate than a full-resolution Figure 3 illustrates this concept, where a registration
edge-based registration [4,5]. This was obtained by algorithm is defined as the combination of a set of features, a
exploiting the multi-resolution nature of wavelets. similarity measure, and a matching strategy. In our current
• among wavelet-like representations, which can be framework:
computed in real or near-real time, the “Steerable - features can be either gray levels, low-Pass features from
Pyramid,” or Simoncelli representation [6,7], is more Simoncelli steerable filters decomposition or from a Spline
accurate and more robust than Daubechies’ filters. decomposition, or Simoncelli band-Pass features,
- similarity metrics can be either cross-correlation, the L2
Subsequent experiments were based on a gradient descent Norm, Mutual Information or an Hausdorff distance,
optimization and used an L2 norm as similarity metrics. - matching strategies are either based on a Fast Fourier
Using this matching methodology, we compared features Correlation, three different types of optimization - pure
obtained from two different multi-resolution decompositions, gradient descent, a Marquard-Levenberg approach
the Simoncelli steerable pyramid and the Spline (developed by Thevenaz et al [10], denoted TRU in the
decompositon [8]. While the Simoncelli steerable pyramid remaining of this paper) or a stocchastic gradient approach
produces low-pass and band-pass features, the Spline (developed by Spall et al) [9,12], and a Robust Feature
pyramid only produces low-pass features. Results then Matching approach [11].
showed than for a gradient optimization matching,

0-7803-8742-2/04/$20.00 (C) 2004 IEEE 1391

Authorized licensed use limited to: M Th Koelen. Downloaded on June 12, 2009 at 10:00 from IEEE Xplore. Restrictions apply.
Spline or
Simoncelli
Features Gray Levels Simoncelli
Band-Pass
Low-Pass

Similarity Correlation L2 Norm Mutual Information Hausdorff Distance


Measure

Thevenaz,
Gradient Rutimann, Spall’s Robust
Strategy FFT
Descent Unser Optimization Feature
Optimization Matching

Figure 3
Modular Approach to Image Registration Combining Various Choices
for Feature Extraction, Similarity Metrics and Matching Strategy

III. EXPERIMENTS AND TEST DATASETS reference chip of that area are computed and extracted, and
III.1. TEST
DATASETS local chip/window registrations are performed. Then, for each
Three different datasets are utilized for this study. pair of scenes, a global registration is being computed with a
Previous experiments were reported in [13,14,15] that generalized Least Mean Squares method that combines all
utilized subsets of these datasets while testing limited previous local registrations. Manual registration is available
numbers of component combinations shown in Figure 3. for this dataset to compute algorithms' accuracies.
These early experiments showed accuracies in the range of
[0.019,0.5] for synthetic images, [0.21,0.64] for multi- Perform Comparison of Different Combinations of Registration Components
temporal or multi-sensor data, and inter-algorithm on the Same Pair of Images
(Definition: Algorithm = "Features + Similarity_Measure + Strategy")
consistencies of about 1/3 pixel. The experiments described
below are now underway, and complete results will be READ Reference, Input, Masks, and Sizes
shown at the conference.
CHOOSE "RST" type and Interpolation Method
(A) Synthethic Dataset Experiments
The first dataset is synthetically created using a 512x512 LOOP on Feature methods
section of a Landsat-TM (band 4) of a Pacific Northwest
DECOMPOSE Reference +
scene. This reference image is geometrically transformed DECOMPOSE Input
using the combination of a scaling, a rotation and a translation, Record Timing
and radiometrically transformed using different amounts of
LOOP on Similarity Measures
Gaussian noise and by convolution with a simulated Point-
Spread Function (PSF). In these experiments, rotations vary LOOP on Strategy Methods
between -20 and +20 degrees, translations between -20 and
+20 pixels, scales between 0.8 and 1.2, and noise variance CALL "image_register" subroutine
between -15 and +20. Both rotation and scaling are done with
respect to the center of the image. After the transformation is OUTPUT and STORE: Deformation
applied, the 256x256 centers of the transformed images are Parameters,timing, Simil-Measure_Value,
extracted and registered to the 256x256 center of the original and Overlap for that Algorithm
reference image. Figure 4 shows a flow chart of the testing of
the multiple registration components of Figure 3 applied to Figure 4
Experiments Setup
register synthetic images transformed with the various
geometric and radiometric transformations defined above. (C) Multi-Sensor Dataset Experiments
The third dataset used in this study represents multi-sensor
(B) Multi-Temporal Dataset Experiments data acquired by four different sensors over four of the
The multi-temporal dataset has been acquired over two MODIS Validation Core Sites. The four sites represent four
areas, Central Virginia and the Washington DC/Baltimore different types of terrain in the United States:
area. For each area, one reference scene is chosen and 6 to 8
• Coast Reserve area with the Virgina site, data acquired
reference chips of size (256x256) are extracted. Also for each
in October 2001.
area, the dataset includes 4 input scenes known from their
UTM (Universal Transverse Mercator projection) coordinates. • Agricultural area with the Konza Prairie in the state of
From these coordinates, windows corresponding to each Kansas, data acquired July to August 2001.

0-7803-8742-2/04/$20.00 (C) 2004 IEEE 1392

Authorized licensed use limited to: M Th Koelen. Downloaded on June 12, 2009 at 10:00 from IEEE Xplore. Restrictions apply.
• Mountainous area with the CASCADES site, data
acquired in September 2000.
• Urban area with the USDA, Greenbelt, Maryland, site, ETM+ IKONOS
data acquired in May 2001.

The four sensors and their respective bands and spatial


resolutions involved in the study are:
• IKONOS Bands 3 (Red) and 4 (Near-Infrared), spatial
resolution of 4 meters per pixel,
• Landsat-7/ETM+ Bands 3 (Red) and 4 (Near-Infrared),
spatial resolution of 30 meters per pixel.
• MODIS Red and Near-Infrared bands, spatial resolution
of 500 meters per pixel. Figure 6
• SeaWIFS Red and Near-Infrared bands, spatial ETM and IKONOS data of the CASCADES Mountainous Area
resolution of 1000 meters per pixel.
Figures 5 and 6 show examples of these data.
ACKNOWLEDGEMENTS
The authors wish to acknowledge the support of the NASA
Intelligent Systems/ Intelligent Data Understanding Program.
ETM+ IKONOS
REFERENCES
[1] L. Brown, “A Survey of Image Registration Techniques,” ACM
Computing Surveys, vol. 24, no.4, 1992.
[2] J. Le Moigne, W.J. Campbell, and R.F. Cromp, 2002, “An Automated
Parallel Image Registration Technique of Multiple Source Remote
Sensing Data,” IEEE Transactions on Geoscience and Remote Sensing,
Vol. 40, No. 8, pp. 1849-1864, August 2002.
[3] J. Le Moigne et al, 2000, “Geo-Registration of Landsat Data by Robust
Matching of Wavelet Features,” IGARSS'2000, Intern. Geoscience and
Remote Sensing Symposium, Honolulu Hawaii, July 24-28, 2000.
[4] J. Le Moigne, and I. Zavorin, “Use of wavelets for image registration,”
SPIE/Aerosense'2000, Orlando, FL, Apr.00.
Figure 5
[5] J. Le Moigne et al, “ Multi-Sensor Registration of Earth Remotely Sensed
ETM and IKONOS data of the Virginia Coastal Area
Imagery,” 8-th SPIE Int. Symp. Remote Sensing, Im&Sign. Proc. for
Remote Sensing VII, Vol.4541, France, Sept. 2001.
For this test, multi-sensor registrations are performed in [6] J. Le Moigne et al, “ Multi-Sensor Image Registration for On-the-Ground
"cascade": IKONOS to ETM+, ETM+ to MODIS, and or On-Board Science Data Procesisng,” Science Data Processing
Workshop, SDP'2002, Greenbelt, January 2002.
MODIS to SeaWIFS. Wavelet decomposition is utilized not [7] H.S. Stone, “Progressive Wavelet Correlation Using Fourier Methods,”
only to compute registration features, but also to bring IEEE Transactions on Signal Processing, Vol. 47, No. 1, 97-107, 1999.
various spatial resolution data to similar resolutions, by [8] R. Eastman, J. Le Moigne, "Gradient-Descent Techniques for Multi-
performing recursive decimation by 2. For example, after 3 Temporal and Multi-Sensor Image Registration of Remotely Sensed
Imagery," FUSION'2001, 4-th Int. Conf. Info. Fusion, CA, August 2001.
levels of wavelet decomposition, the IKONOS spatial [9] P. Thévenaz et al, “A pyramid approach to subpixel registration based on
resolution is brought to 32 meters that, compared to the intensity”, IEEE Trans. on Image Processing, Vol. 7, No. 1, .1998.
Landsat spatial resolution, corresponds to a scaling of 1.07. [10] I. Zavorin et al, “Evaluating Performance of Automatic Techniques for
This will be the scaling expected when registering IKONOS Sub-Pixel Registration of Remotely Sensed Imagery,” SPIE Electronic
Imaging 2003, Image Processing: Algorithms and Systems II.
to Landsat data in our study. For this dataset, previous [11] K. Johnson et al, 2001, “Multi-Resolution Image Registration of
experiments have shown algorithms to be self-consistent Remotely Sensed Imagery Using Mutual Information,” SPIE Aerosense
within 1/8 pixel. 2001, Wav. Appl. VIII, Florida.
[12] A. Cole-Rhodes et al, 2002, “Multi-Resolution Registration of Remote
Sensing Images Using Stochastic Gradient,” SPIE Aerosense 2002,
V. CONCLUSION Wavelet Applications IX, Orlando, FL, April 2002.
The experiments presented in this paper deal with multi- [13]N. Netanyahu et al, “Geo-Registration of Landsat Data by Robust
sensor, multi-resolution precision correction or image Matching of Wavelet Features,” to appear in IEEE Transactions on
registration. Using gray levels or wavelet features, several Geoscience and Remote Sensing, 2004.
[14] J.Le Moigne et al, "Earth Science Imagery Registration," IGARSS'2003,
similarity metrics and search strategies are being tested using International Geoscience and Remote Sensing Symposium, Toulouse,
synthetic data as well as Landsat multi-temporal data and France, July 20-24, 2003.
IKONOS, Landsat, MODIS and SeaWIFS data over four [15] J. Le Moigne et al, "Registration of Multiple Sensor Earth Science Data,"
well-chosen EOS Land Validation Core Sites. From these Earth Science Technology Conference, Palo Alto, CA, June 22-24, 2004.
tests, we will be able to categorize the different algorithms in
terms of their accuracy but also in terms of their sensitivity
to initial (i.e., navigation) conditions, as well as their
computational and memory requirements. Full results of
these experiments will be shown at the conference.

0-7803-8742-2/04/$20.00 (C) 2004 IEEE 1393

Authorized licensed use limited to: M Th Koelen. Downloaded on June 12, 2009 at 10:00 from IEEE Xplore. Restrictions apply.

You might also like