Full text: Proceedings (Part B3b-2)

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. Vol. XXXVII. Part B3b. Beijing 2008 
3d point cloud from the image sequence three different cases of 
façades have to be investigated. 
1. The first façade type is standing parallel to the street 
and the camera is moving along this façade. These façades 
fulfil the conditions for the 3d point extraction, because the 
points are moving through the image from the right to the 
left and so their relative 3d position can be determined 
from their movement. 
2. The second class on façades is the class of occluded 
invisible façades. They, of course remain invisible as holes 
in the generated point cloud. 
3. The third class of façades is standing perpendicular to 
the street. Those façades are not moving along the camera, 
but are changing their scale as they are moving towards the 
camera. For those façades, the movement of points is very 
low and thus it is very difficult to estimate the correct 
coordinates. 
4. 
In addition to the limitations in the viewing position of the 
camera the low resolution and low grade of details cause a 
small number of points of interest. 
3.2 Description of the quality measurements 
It is necessary to make a quality investigation including the 
estimated camera path, the position and completeness of 
façades and the extracted textures of the façades in comparison 
to the given 3d building model of the GIS database and the 
recorded GPS camera path. For all images and GPS positions a 
synchronized time-code is stored. The camera path of the GPS 
is corrected using a Kalmann filter and interpolated for every 
time-code corresponding to an image of the sequence. Now, the 
first and the last position of the estimated camera from the 5- 
point pose estimation are moved to the interpolated positions of 
the GPS path with the corresponding time codes. This leads to a 
scale, rotation and translation of the estimated model onto the 
given 3d model and the GPS path. 
The estimated planes are now transformed to the world 
coordinate system. The scale factor is calculating comparing the 
length of the given camera path and the length of the estimated 
camera path for the first and last camera position. Afterwards, a 
3d rotation and translation is calculated to rotated and move the 
estimated camera path onto the given measured one. Because of 
the assumption that the GPS path is not afflicted with an error, 
the corresponding surfaces of the given building model and the 
generated planes are given by the smallest error in their 
orientation and the smallest translation vector of their 
barycenters. To avoid a systematic error caused by the GPS 
position, the translation vectors of the barycenters of all 
generated planes to their corresponding model surface are used 
to calculate a mean translation vector. The remaining 
translation vectors of the planes to the surfaces are the 
remaining positioning errors of the planes, the generated mean 
translation vector is used to move the generated camera path of 
the image sequence. The distance between this path an the GPS 
path can be caused by the 5-point algorithm, an incorrect 
camera calibration or the inaccuracy of the GPS positions. 
In addition to the positioning error of the planes, the 
completeness is a criterion of the quality of the surface 
reconstruction. For every estimated plane its length and height 
are determined calculating its bounding rectangle and compared 
to the length and height of the given corresponding façade. 
Caused by the reconstruction from points of interest, the 
generated planes are estimated to be a little bit smaller than the 
original façade. 
4. EXPERIMENTS 
The camera that was used for the acquisition of the test 
sequences offers an optical resolution auf 320x240 (FLIR 
SC3000) pixel with a field of view (FOV) of only 20°. The 
SC3000 is recording in the thermal infrared (8-12 pm). On the 
top of a van, the camera was mounted on a platform which can 
be rotated and shifted. Different scenarios of image sequences 
were acquired. The first scenario (Figure 2a) deals with several 
small façades belonging to one building block. The second 
(Figure 2b) shows a long façade with regular structure and a 
specific entry with overlap. The third scene (Figure 2c) shows a 
long façade with different structures. The forth scenario (Figure 
2d) deals with bridges between buildings crossing a street. 
Fig. 2: a) several façades with occlusion, b) façade with regular 
structure, c) façade with irregular structure, d) 
façade with building bridge 
For scenario 2 and 3, the situation is quite different. Both 
scenarios consist of only one long façade. This façade can be 
estimated quite easy because there are no occlusions of the 
façade and the complete façade is almost in one plane. Scenario 
4 has to deal with two building bridges crossing the street. 
Those bridges are separating the façade into segments. The 
bridges cannot be detected. 
5. RESULTS 
5.1 Extracted surface planes and camera path 
In general, the surface estimation works well for façades going 
parallel to the street. Façades standing perpendicular to the 
street are much more difficult to extract correctly. For the scene 
in figure 2a, three parallel surfaces can be extracted. For these 
surfaces, the relative camera path containing relative camera 
positions for all images of the sequence is generated. The result 
of the automated extraction is shown in figure 3.
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.