Full text: XVIIIth Congress (Part B3)

   
      
INMENTS 
Detection, Invariants 
| of manhole covers as 
les us to automatically 
ching procedure which 
a cadastral database. 
spondences per image, 
les) are available from 
stem; and, as we will 
d with high precision, 
ites from the cadastral 
0 major aspects in this 
arks of the considered 
0 match constellations 
ole positions from the 
> following the interior 
to be known. 
:CTION AND 
ANDMARKS 
based on a parametric 
cation, size, shape, and 
picted manhole covers. 
or between the model 
model parameters and, 
Imark's position in the 
This can be shown for 
i. À short description of 
| be given below. More 
Rohr, 1995). 
lar Landmarks 
rers varies from country 
ific type which consists 
rk concentric ring (see 
normally are recorded 
plane, images of these 
ge intensities of a cross- 
he considered type form 
| also take into account 
ise of the band-limiting 
ed shape as sketched in 
roximately be described 
, and Tmin, Where hmaz 
na 1996 
h max 
C mu h 0 
  
    
h min 
mmm mmm mmm mgm mm mmm 
----R-------4 
Figure 1: Ideal appearance of a manhole cover (left) and blurred cross-section intensities (right). 
and Amin are the relative values of the function's maximum 
and minimum with respect to the background-level ho; Tmin 
denotes the distance of the minimum from the center. As 
suggested by Figure 1, we approximate the ideal intensity 
profile using an analytic model whose general shape corre- 
sponds to the second derivative of the 2D Gaussian. However, 
the shape of this function is controlled by only two param- 
eters (amplitude and variance), while three parameters are 
needed for adequately describing the intensity profile of the 
landmark. Therefore, we represent the model by a modified 
function, which, on the one hand, well approximates the sec- 
ond derivative of a Gaussian, and, on the other hand, has 
three parameters describing its shape, namely ai, a2, and o: 
2 
M(z,y) = ao + (as + a2 - 1”) - exp (zx) © 
with r? = (x —x0)? + (y — vo)” and (xo, yo) being the image 
coordinates of the landmark center. 
Given our model function M and the image intensities of an 
instance of a manhole cover, we minimize the error E between 
the image and the model. In our approach, E is defined by 
the sum of the squared differences between the image intensi- 
ties I and the model M (which is a function of ao, a1, a5, c, 
zo, and yo) at some data points taken from a square window 
centered around the initial location estimate of the landmark. 
Since we are dealing with a non-linear model, we apply the 
iterative Levenberg-Marquardt method for minimizing the er- 
ror function. This method requires to analytically calculate 
the first partial derivatives of M with respect to each of the 
parameters to be optimized. An example for a model fitting 
result is given in Figure 2. 
2.2. The Landmark Extraction Scheme 
In our landmark extraction scheme, we obtain an initial set 
of potential landmark positions by exploiting the normalized 
cross-correlation for all local intensity maxima in the image, 
using a landmark prototype template. The potential land- 
marks detected this way are submitted to the parameter op- 
timization procedure described above which adapts the ana- 
lytic model function to the intensities of the given landmark. 
The results of the model fitting are twofold: the set of opti- 
mally adapted parameters and the final approximation error. 
Both are checked in a subsequent verification step in order to 
decide whether the adapted model describes a valid landmark 
instance. In this way, we are able to suppress a large fraction 
of false detections and obtain a high-precision localization of 
the actual landmarks. 
The initial parameter values required to set up the parame- 
ter optimization process are obtained from a small number 
of representative examples which have to be selected by the 
operator in a preceding training phase. The training results 
are also used to derive the thresholds applied in the verifi- 
cation step and to generate the prototype template used for 
landmark detection. 
2.3. Localization Precision Obtained by Model Fitting 
We investigated the localization precision obtained by model 
fitting for simulated landmark images which have been gen- 
erated with known parameters. The physical landmark size 
was assumed to be 80 cm; the acquisition parameters and 
the pixel size were set to typical values (see below), giving 
a pixel resolution of 15 cm on the ground, i.e. a landmark 
is typically represented by 6 by 6 pixels (unblurred). Using 
our simulation technique we are able to statistically evaluate 
the localization precision with respect to variations in image 
blur, sampling effects, noise, perspective projection, regular 
shape distortions, and interaction effects with background 
structures (e.g. road markings). In a large number of random 
experiments we found that the localization error is well be- 
low a hundredth of a pixel (less than 1 mm on the ground) 
for noise-free images and less than a tenth of a pixel (about 
1 cm on the ground) for images having a realistic amount of 
noise. Even in the presence of serious background distortions 
the localization precision is still in the lower sub-pixel range 
for most kinds of distortions. 
2.4. Extraction Performance on Real Image Data 
Experiments on real image data have been done on a number 
of data sets. We used color infrared photographs, which have 
been acquired with a Zeiss RMK A-30/23 camera from an 
altitude of 1500 m, giving an image scale of 1:5000. The 
photographs have been scanned at a resolution of 30 um, 
resulting in a pixel size of 15 cm on the ground and an image 
size of about 7700 by 7700 pixels (covering a ground area of 
about 1.3 km?). 
A number of 400 to 500 manhole covers is visible in each 
of the images, including a significant fraction of landmarks 
which are seriously distorted by background structures, have 
very low contrast, or do not agree with the ideal landmark 
model at all. According to these effects, the extraction 
process typically yields a number of 100 to 200 detections. 
The percentage of false detections is in the range of 1076 
to 20% of the total number of detections, which is very 
low considering the high complexity of the analyzed scenes. 
147 
International Archives of Photogrammetry and Remote Sensing. Vol. XXXI, Part B3. Vienna 1996 
   
    
  
  
  
   
    
    
   
    
  
  
   
  
   
  
   
  
   
     
  
  
   
    
   
  
    
   
    
   
   
  
   
   
    
  
    
  
   
  
   
   
  
   
    
  
   
   
  
   
   
    
    
EARLIER S MM aa A 
  
  
VUE 
EE 
a cu 
TE EEE 
FE 
s
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.