Full text: Papers accepted on the basis of peer-reviewed abstracts (Part B)

In: Wagner W., Székely, B. (eds.): ISPRS TC VII Symposium - 100 Years ISPRS, Vienna, Austria, July 5-7, 2010, IAPRS, Voi. XXXVIII, Part 7B 
2. METHODOLOGY 
A new approach for the use of the correlation coefficient in 
automatic image registration was recently explored (Gonsalves 
etal., 2008). In this paper, we generalized this approach, which 
will be described in the following. In order to simplify the 
provided analysis, we will focus on the problem of finding a 
translation in both horizontal and vertical directions, assuming 
that the considered region is approximately “flat”. Considering 
(Pref ,Lref ) and (P new ,Lnew) as the (Pixel,Line) coordinates of 
the reference and new (to be registered) images, respectively, 
their relation may be expressed as 
Pnew - Pref + 8 X 
(1) 
Lnew = Lref + §y 
(2) 
where § x and 8 y are the displacements (in pixel units) on the 
horizontal and vertical directions, respectively, between the 
reference and the new image. The registration of a full scene or 
images with more complex deformations may be performed 
according to the description in (Gonfalves etal, 2008), and 
further evaluated trough a proper set of measures (Gongalves 
etal., 2009). The several steps of the proposed methodology will 
be described in the following. 
2.1 Division of the image into tiles 
As previously mentioned, in this work the focus relies on 
approximately “flat” regions. Depending on the terrain slope 
variation and on the image acquisition geometry, it may become 
difficult to avoid slight differences on the shifts throughout the 
images. Therefore, the division of the image into tiles is also 
considered in this work, to evaluate whether it may allow for 
reducing some of these remaining effects. The tiles must be 
sufficiently higher than the shift known or estimated a priori, for 
which a minimum size of 64x64 pixels up to the full image size 
(a single tile) may be generally applicable. The following steps 
are applied to each tile. 
2.2 Similarity image 
Instead of the traditional similarity surface, two similarity 
images are proposed, each one corresponding to the horizontal 
and vertical directions. The similarity image is produced by 
computing a similarity measure along one dimension at a time. 
Considering a tile with m-by-n pixels, then the similarity image 
for the horizontal and vertical directions will have n-by-n and 
m-by-m pixels, respectively. This procedure is schematically 
represented in Figure 1. The correct shift between the tiles is 
expected to produce a brighter diagonal strip on the similarity 
image, corresponding to the higher values of the similarity 
measure. An example of a similarity image is provided in Figure 
2c. 
Image 1 
Image 2 
Similarity image 
the xx axis 
X 
.. 
\ x 
X 
x 
X 
X X 
x 
X 
X 
X 
X 
X 
X 
► 
X X 
X 
X 
X 
X 
X 
X 
X 
X X 
X 
X 
* 
X 
X 
X 
* 
Image 1 
Image 2 
Similanty image 
the yy axis 
X 
X 
X 
X 
X 
X 
X X 
x 
X 
HHHI 
X 
X 
X 
X 
X X 
X X 
x 
x 
4c x 
x 
: x 
X X 
x 
x 
X 
x 
X 
X 
X 
X 
X X 
X 
X 
Figure 1. Illustration of the similarity image computation in 
both xx and yy axis. 
(c) (d) (e) (f) 
Figure 2. Illustration of the main steps of the proposed 
methodology (further details in section 2): (a) A segment with 
256x256 pixels from a Landsat image; (b) a segment with 
256x256 pixels from an ASTER image; (c) similarity image 
(using the correlation coefficient); (d) filtered similarity image; 
(e) image in (d) converted to binary; (f) -45° line detected by 
the Hough transform, superimposed on the similarity image. 
2.3 Similarity image filtering 
In order to enhance the visibility of the similarity image brighter 
diagonal strip, the similarity image is filtered using a -45° 
oriented line. The length of this line is defined to be 15% of the 
image dimension norm, which may be broadly applied to any 
sensor (Gonijalves etal., 2008). The filter window is composed 
by positive values along the diagonal and zeros outside. Its 
effect is illustrated in Figure 2d. 
2.4 Conversion from gray level to binary 
Prior to the Hough transform computation, there is the need to 
convert the filtered similarity image to a binary format. A 
threshold equal to the percentile (l-3/n)xl00 (rounded to the 
smaller integer) is considered, where n is the number of lines 
(or columns) of the similarity image (which is squared). The 
binary image of the example is provided in Figure 2e. 
2.5 Hough transform 
At the Hough transform step (Hough, 1962), the 0 and p 
resolution is defined as 0.5. The Hough transform is computed 
for the similarity images in both xx and yy axis. For each of 
them, more than one line may be identified, associated to the 
most prominent peaks. 
2.6 Main diagonal identification and displacement 
computation 
The slope of the detected line(s) in the previous step is 
computed, being considered only those with slope between 
-0.95 and -1.05. In case of more than one line is detected with a 
slope of exactly -1.00, the line with highest height is selected. 
The displacement on each axis is finally obtained by computing 
the distance from the selected line to the main diagonal (-45° 
line starting at row 1 column 1). This step is illustrated in 
Figure 2f. 
2.7 Estimation of 5 X and 5 y 
In the case that the image is not divided into tiles (the image 
being itself a single tile), then the estimates for S x and 8 y are
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.