XXII ISPRS Congress 2012: Technical Commission III

RFACE MODELS 
AGERY 
rmany 
ny 
igh resolution digital surface 
ned automatically, there are 
: surface models. Especially 
which is a growing problem 
illy manual selection of tie- 
for stereo matching. It also 
| estimate of the depth range 
1 this paper an approach is 
point selection, enabling the 
rater masking and elevation 
tests with a large number of 
1 reliability of the proposed 
drastically reduced, the time 
teps became the critical part 
ore, during the last years, 
been developed to solve this 
| 2008; Wohlfeil, 2010 and 
) a very operational solution 
of high resolution digital 
put. As most of the current 
resolution are line scanners, 
g. WorldView 1/2, GeoEye, 
| on this type of sensors. 
eparation 
image data 
r 
processing 
     
oto generation 
e processing steps 
  
ly slow, the search range 
nge in the images can be 
f the terrain. An automatic 
ed in Section 3.6. 
structions from images, the 
f cameras must be known. 
~ (interior orientation) is 
typically known due to camera calibration. For line cameras 
the extrinsic parameters (exterior orientation) essentially 
consists of 6 degrees of freedom for every captured camera 
line. Dense stereo matching requires that the remaining 
geometric error is less than 1 pixel in image space. However, 
if possible it should be below 0.5 pixels. Since the absolute 
pointing accuracies of satellites are much worse, the exterior 
orientation must be optimized with respect to a precise 
relative orientation, using homologous points (also called tie 
points). Especially for line imagery there was a lack of 
software for performing this task robustly and reliably. With 
an integrated approach, presented in the Sections 3.1 to 3.3 of 
this paper, this problem 1s solved. 
If the requirements in terms of relative orientation cannot be 
met, the resolution of the DSM has to be reduced. This task is 
also being automatized with the help of a suitable accuracy 
measure, introduced in Section 3.4. 
Another problem is water, which cannot be matched, since 
images are taken at different times, causing different textures 
in every image based on the movement of waves. Thus water 
should be identified, ignored while matching and smoothly 
interpolated from the shore, later on. The automatic water 
masking is described in Section 3.5. 
An overview of the processing chain is given in Figure 1. The 
only manual interaction that remains is the selection of the 
suitable stereo imagery and its housekeeping data (initial 
exterior and interior orientation, etc.), described in the 
following Section. 
2. PREPARATION 
For the generation of DSMs one or more groups of images 
are specified. Each group contains two or more images that 
are to be matched via SGM. If possible, the images of each 
group should be captured 
- with a large overlap, in which matching can be performed 
- with different along-track viewing angles (pitch angles) 
- at similar seasons and daytimes in order to avoid large 
differences of shadows and vegetation, which can reduce 
the quality of the result 
If it is not possible or not economic to meet these conditions, 
processing is possible as well, but the quality and resolution 
of the resulting products will be suboptimal, but still very 
useful for many application. 
The images of different groups should partly overlap. This is 
important to enable the generation of one large and continu- 
ous DSM of the entire captured area. The homologous points, 
found in the overlapping areas, allow a global alignment of 
the images, reducing spatial discontinuities of the DSM 
between the areas in a high degree. It is also recommendable 
to use groups with different across-track viewing angles (roll 
angles) for the same area in order to resolve most of the 
occlusion in urban areas. At the current stage of development 
the selection of the images is performed manually because 
this only takes a few minutes. There are very good prospects 
to automatize even this step as well (see Section 5). 
3. PROCESSING 
3.1 Height Range Determination 
One important parameter for SGM process is the maximal 
occurring disparity i.e. the size of the search range for 
matching. This parameter negatively influences the result of 
the algorithm if set too low, which results in the search for 
matches being cancelled too early. If — on the other hand - it 
is set unnecessarily high, it slows down processing, since the 
75 
computation time of the process depends on the size of the 
search range. 
Finding the right value for this parameter is essentially a 
matter of finding the maximum and minimum height in the 
scene being processed. We achieve this by consulting the 
digital elevation data of the Shuttle Radar Topography 
Mission (SRTM) of the year 2000. 
The SRTM data is freely available on the web from the U.S. 
Geological survey, providing an elevation model of the 
world in 3 arc-second resolution. This information is 
downloaded and evaluated when needed and a safety buffer 
of 400 meter is added to allow for buildings and other 
deviations from the model. 
3.2 Automatic and robust tie point selection 
The relative orientation of images is optimized by bundle 
adjustment, which requires a sufficient number of homolo- 
gous points visible in two or more images. Several ap- 
proaches for automatic point selection were developed in the 
last decades. The general approach is to select small salient 
image regions (features) in one image and to find the 
corresponding image regions in the other images. This can be 
performed via cross correlation or the efficient implementa- 
tion (Bouguet, 2000) of the KLT feature tracker (Tomasi and 
Kanade, 1991). In case of unknown scale and rotation differ- 
ences between images, approaches like SIFT (Lowe, 2004) 
and SURF (Bay et al, 2008) are preferable. However, 
satellite images are typically provided with a good initial 
orientation and the KLT feature tracker is by far more 
efficient than the other options. 
Independent of the approach used for feature matching, there 
is always the problem of mismatches that can occur under 
suboptimal conditions. Especially if several difficulties like 
moving objects and shadows, repetitive patterns, changing 
vegetation and illumination, specular reflections, water sur- 
faces, perspective distortion etc. come together, the number 
of mismatches easily exceeds the number of correct matches, 
even by multiples. In such cases almost all approaches for 
automatic point selection fail miserably. 
Two steps are vital to successfully process such difficult 
imagery. First, possible radiometric differences between 
images due to different spectral band characteristics of 
sensors, changed vegetation and sun angle have to be 
compensated as far as possible. This is performed by adaptive 
radiometric balancing (as explained in A.2.4 of Wohlfeil, 
2011). Second, the majority of mismatches have to be 
determined and eliminated already during tie point selection 
by a consistency check of redundant matches. Therefore, 
image features are being matched redundantly from every 
image to any other image in all possible directions. The 
consistency of different matches can then be checked. 
  
  
  
  
  
F Position GC | 112434 1.5 
x 1 | X IX X | X 
o XIX X 
3 
4 X] X XX 
SX XIX 
  
  
  
  
  
  
  
  
Table 1: Exemplary information associated with one feature 
in N= 5 possible images. The feature is apparently not visible 
in image 3 and it couldn't be tracked between images 5 and 2. 
The score associated with this feature is 56% (14/25) 
! http://dds.cr.usgs.gov/srtm/
1
2
...
86
87
88
89
90
...
586
587
Full text: Technical Commission III (B3)

Access restriction

Copyright

Note to user