Full text: Proceedings, XXth congress (Part 7)

ih ES M 
M. 
do 
on 
of 
xn 
els 
30- 
are 
nts 
ıld 
ith 
to 
its) 
ion 
ata 
International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol XXX V, Part B7. Istanbul 2004 
  
additional information from a GPS (Global Positioning System) 
and IMU (Inertial Measurement Unit) device are recorded at a 
very high temporal resolution, providing information about the 
exact position and orientation of the sensor at the time of 
acquisition. This so-called direct georeferencing is crucial, as 
due to the dynamic acquisition principle and due to 
unsteadiness of the airplane movement each scanline has its 
individual exterior orientation and a conventional orientation 
procedure would not be able to deliver the required accuracy. 
The typical process of this direct georeferencing is described in 
Fricker (2001). Next, after automatic tie-point measurement 
triangulation can be applied. It is recommended to use some 
GCPs at least at the block corners, but as shown in the excellent 
results from Scholten and Gwinner (2003) it is not obligatory, 
since the collected orientation data from the GPS and IMU unit 
seem to be sufficient. 
Once the images are accurately oriented (multi-image) 
matching algorithms can be applied for deriving a DSM. 
Finally, the images and the DSM can be used to create an 
orthophoto of the scene, which will also be used as input for the 
building extraction algorithm. 
At the DLR (German Space Agency) the processing chain for 
handling airborne three-line scanner imagery has been 
automated to such a degree, that human interaction is only 
needed at the data acquisition step (Scholten and Gwinner, 
2003). Once the data is downloaded onto the computers the 
whole process from image correction and rectification, to 
orientation and triangulation, and to the DSM and orthophoto 
production can be done fully automatically. 
4.2 Building Recognition 
The most important input for the creation of a Digital City 
Model is the DSM. But as mentioned previously, the resolution 
and accuracy if derived from spaceborne or airborne line 
scanning imagery is rather low (compared to Lidar-DSMs). 
Hence the images or orthophotos if available, will be used to 
proceed to building extraction. 
The first step is to create a so-called normalized Digital Surface 
Model (nDSM), which is the difference-model between the 
DSM and the DTM. Hence all objects in the nDSM stand on 
elevation height zero. 
There are multiple approaches for getting the nDSM. An 
approved method is to filter the existing DSM-data by applying 
a skewed error distribution (Kraus, 2000). This procedure is 
fully automated and the user needs only to input the parameters 
of the skewed error-distribution in order to calculate the nDSM, 
in which vegetation and buildings are eliminated. Of course, 
when the area of interest is highly urban and nearly no terrain 
points are available in the DSM this technique might not deliver 
the desired result. In such a case it is necessary to manually 
digitise points on the ground and interpolate or approximate a 
DTM-surface through them. 
The next and maybe most critical step is to find the regions of 
interest, i.c. the regions of potential candidates for buildings. 
Until today, either the buildings were digitised in stereo or the 
building corners were localized in the orthoimage or the user 
had to tell the system where a building is. 
Rottensteiner (2004) proposed to create a 'building mask’ using 
a certain elevation threshold. This mask would point out areas 
where objects higher than the threshold are located, and 
everything below that height is considered as 'non-building!. 
'Non-building' can be low vegetation, cars or other objects 
positioned on the terrain surface, but not being high enough to 
be treated as building candidates. Unfortunately, it is not 
seldom that vegetation, especially trees or even forests, have 
the same height as buildings do. 
Figure 2 shows such a typical case. One can easily recognize 
that the dense trees to the left are at least as high as the 
buildings. In such a case simple thresholding would not make it 
possible to distinguish between objects of interest (buildings) 
and vegetation. 
  
Figure 2. Image and DSM subset from the HRSC-Bern data set 
The algorithm proposed for finding regions with potential 
buildings is the following: 
Set a certain elevation threshold of minimal building- 
height according to the current scene, then 
- mark the potential candidate-regions of the nDSM in 
the image and 
— apply a check for homogeneity in the image to refine 
the candidate selection. 
This makes sense, since most of the times building-roofs are 
very homogeneous e.g. one colour, or one regular pattern of 
tiles, in comparison to treetops or other vegetation. To use the 
first derivative of the nDSM as a homogeneity parameter (a 
very popular method) is not recommended, because the derived 
nDSMs from high-resolution line scanning systems are much 
too coarse. 
4.3 Building Extraction 
Once the regions of building candidates are found, one has to 
extract the shape of the building. The coarseness of the nDSMs 
does not allow the determination of the roof shape. Even more, 
it is not even possible to get the building corners from these 
nDSMs. Therefore additional information is taken from the 
orthophotos. Since they provide higher geometric quality, they 
will be used to obtain the location of corners and hence the 
shape of the buildings. 
For the segmentation in intensity images four approaches are 
common, namely threshold techniques, boundary-based 
methods, region-based methods, and hybrid techniques which 
combine the latter two (Adams and Bischof, 1994). 
The proposed algorithm for building extraction is an ‘adaptive’ 
region-based method. The growing process of a certain region 
depends on the assigned threshold. If the difference between the 
value of the starting point (seed) and the value of examined 
neighbouring, adjacent pixels is equal or smaller than the given 
threshold, the neighbouring pixels will be aggregated into the 
region. The whole procedure is described in the following 
steps: 
I. Inside the potential building area called seed region 
(Figure 3a) start with the region growing process and 
1035 
 
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.