Full text: Close-range imaging, long-range vision

AUTOMATED APPEARANCE-BASED BUILDING DETECTION IN TERRESTRIAL 
IMAGES 
Jan Böhm”, Norbert Haala, Peter Kapusy 
Institute for Photogrammetry (ifp), University of Stuttgart, Germany 
Geschwister-Scholl-Strasse 24D, D-70174 Stuttgart 
Jan.Boehm@ifp.uni-stuttgart.de 
KEY WORDS: Object Recognition, CAD, Orientation, Navigation, Augmented Reality 
ABSTRACT: 
We present a method for automated appearance-based detection of buildings in terrestrial images. The problem is stated as follows: 
From an image with a given approximated exterior orientation and a three-dimensional CAD Model of the building, detect the exact 
location of the building in the image. The method we have developed uses the combination of an imaging device and hardware to 
approximately measure the exterior orientation. This paper presents a combination of our work on close-range photogrammetry and 
virtual city models. 
1. INTRODUCTION 
With the increased availability of low-cost and low-power 
consumption imaging devices we see a strong increase in their 
integration into mobile devices such as laptops, personal digital 
assistants (PDAs), mobile phones and so on. The combination 
of mobile computational capabilities, imaging capabilities, 
positioning capabilities and network access opens the door for a 
variety of novel applications, such as pedestrian navigation 
aids, mobile information systems and others usually referred to 
as 'location-based services’. With these devices at hand our 
interest is to exploit the capabilities of the imaging device for 
photogrammetric processing. 
Personal navigation and the provision of location dependent 
information are key features of location aware applications. One 
option to reach this goal is the application of augmented reality 
(AR) techniques. These techniques are based on the overlay of 
computer-generated graphics to the user’s actual view. The 
computer graphics are generated based on a spatial model of the 
visible environment. Of course the virtual computer graphic 
objects have to be overlaid to their corresponding primitives in 
the real world as the user observes them. For this reason the 
accurate determination of the actual position and orientation of 
the user is required in order to enable a precise mapping of the 
data. This paper presents a method for the precise detection and 
localization of buildings in terrestrial images and thereby also 
provides the means for better navigation of users. 
Within an urban environment AR can for example be applied 
for the presentation of name labels or additional alphanumeric 
data appearing to be attached to a side of a building. In addition 
to the visualization of these virtual signposts, more specialized 
applications could aim at the display of information based on 
“X-ray vision” in order to present features normally not visible 
for the user. Typical objects of interest are features hidden 
behind the facades of a building like the location of rooms or 
information on infrastructure like the position of power-lines. 
The integration of augmented reality into a tourist information 
System is another application for this kind of technique. The so- 
called telepointing capability is an important feature in 
  
* Corresponding author 
implementing a intuitive user interface for location based 
services (Fritsch et al. 2000). 
In order to detect objects usually a model of the object has to be 
available. In our case we need a model of the buildings to be 
detected. Therefore one of the key components of our approach 
is a 3D city model. The development of tools for the efficient 
collection of 3D city models has been a topic of intense 
research for the past years. In addition to Digital Height Models 
and data representing streets and urban vegetation, building 
models are the most important part thereof. Meanwhile a 
number of algorithms based on 3D measurement from aerial 
stereo imagery or airborne laser scanner data are available for 
automatic and semi-automatic collection of 3D building 
models. A good overview on the current state-of-the-art of 
experimental systems and commercial software packages is 
given in (Baltsavias et al. 2001). 
Within this article we present a method to analyze the image of 
an urban scene and reliably and robustly detect buildings. The 
method uses the combination of an imaging device and 
hardware to approximately measure the exterior orientation. 
The result of the detection can be used both for AR applications 
and for navigation purposes. Section 2 discusses the approach to 
the problem of object recognition we have chosen. It contains 
the description of the model representation, the feature 
extraction and the recognition algorithm. The utilization of the 
object recognition for the determination of the orientation 
parameters using spatial resection is detailed in section 3. In 
Section 4 we present the results of some typical test cases and 
demonstrate the accuracy of our approach. 
2. OBJECT RECOGNITION 
When the task is to detect a three-dimensional shape in an 
image, two general strategies for object representation are 
available. One is the three dimensional representation of the 
object which leads to a 3D to 2D matching problem, the other is 
a two-dimensional representation which leads to a 2D to 2D 
matching problem. While the former is the more general and 
theoretically more appealing approach, there are several 
practical problems, which often prevent its use. One of the 
—491- 
 
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.