Close-range imaging, long-range vision

high reso- 
rban areas. 
he images 
v a 3D re- 
calibration 
itting, cor- 
lereo pairs. 
olar geom- 
chromatic 
the match- 
:s (Scholze 
btained by 
e segments 
illy incom- 
; are purged 
chromatic 
views. If a 
, its 3D po- 
to enhance 
D line seg- 
igure 5(c)) 
s described 
  
(c) 
air used for 
3D line seg- 
rements 
nts is a key 
different se- 
lished. The 
five possible labels form the set ©: 
1 
0 { w! ) = ridge, qO — gutter, 
3 
wi ) — gable, w® — convex, 
w® — concave (1) 
The names of the labels are chosen to be coherent, although 
they should not be taken literally. For example, a gutter 
segment just corresponds to the lower boundary of a patch, 
no matter if there is a gutter in the scene or not. Figure 
6 gives an overview. For convenience each semantic label 
(e.g. gutter) is represented by a variable w(*), where 
i encodes the actual label. In order to assign the seman- 
tic labels to the segments in a patch, geometric measure- 
ments are used. Measurements characterizing individual 
segments u; (unary attributes) and measurements between 
adjacent segments b;; (binary attributes) are distinguished. 
The actual measurements of 
1) 
ut length of segment 2 
ue slant angle of segment i relative to a 
horizontal plane 
po angle enclosed by adjacent (coplanar) 
segments ¢ and 7 
i : (signed) mid point height difference of 
adjacent segments ¢ and j (2) 
form the components of the attribute vectors u; = (ut? us”) 
1 2 
and bi; = (0,67). 
Ridge 
Gable 
  
Gutter 
Figure 6: Functional parts of a roof and their semantic la- 
bels. The label ridge is generally used for the upper bound- 
ary of a patch, gutter for the lower one. The boundaries of 
a patch are either labelled gable, convex connection or con- 
cave connection depending on the neighbourhood. 
3.3 Test Dataset 
A semi-automatically reconstructed test dataset has been 
used to learn the statistics of the geometric measurements. 
The dataset shows urban and sub-urban areas with four- 
fold overlap at an image scale of approximately 1:5000 
(IGP, 2001). Nearly 150 roofs, consisting of about 80 tri- 
angular and 270 quadrangular patches have been labelled 
manually using the semantic labels from Equation (1). Ad- 
ditionally to the semantic labels, the values of the geomet- 
ric measurements (unary and binary attributes) have been 
recorded. 
3.4 Initial Semantic Interpretation 
As will be described in the next Section, the patch recon- 
struction algorithm makes use of seed line segments for its 
search of patch hypotheses. Given the set of reconstructed 
3D line segments, a set of seed segments has to be derived 
which is used for the first pass of the reconstruction algo- 
rithm (cf. Figure 1(b)). In order to invoke the appropriate 
instantiation procedure, the label of the seed segment in a 
roof should be known in advance. Hence, the segments 
have to be attributed semantic labels from Equation (1). 
Of course, at this stage it is impossible to unambiguously 
assign a label to an arbitrary 3D line segment. Neverthe- 
less, class membership probabilities can be derived. Us- 
ing the distribution of only the unary attributes (line seg- 
ment length and slant) of the test dataset, the probability 
distributions for the respective labels are determined. For 
each class, the algorithm creates a list of seed line segments 
sorted by probability. 
4 PATCH RECONSTRUCTION 
Geometric patch reconstruction poses a twofold task. First, 
planar 3D line segment configuration corresponding to pla- 
nar roof patches are detected. Second, the outlines of patches, 
corresponding to the actual roof faces lying in these planes, 
have to be determined. 
4.1 Probabilistic Plane Detection 
For plane detection, the general approach is to rotate a half 
plane around a seed line segment, e. g. a tentative ridge line 
(cf. Figure 7). For each inclination of the half plane, the 
line segments in its neighbourhood which approximately 
lie in this plane are collected. To determine the number 
of planes and their positions, a Bayesian model selection 
procedure is applied. The plane model selection procedure 
maximizes the posterior probability 
p(m|x) — max (3) 
with x being the set of neighbouring 3D line segments 
an m the actual plane model. The plane model describes 
the number, positions and orientations of planes currently 
considered. Model selection is done taking into account 
model complexity. That is, a more complex model (con- 
taining more planes) may well describe the data better; but 
might also tend to over-fit the data. Consequently, a sim- 
pler model should be preferred if the more complex one 
does not describe the observations significantly better. This 
qualitative statement is quantified in form of an Ockham 
factor. A detailed explanation of the procedure is available 
in (Scholze, 2002). The result is a small number (usually 
1-4) of sets of plane hypotheses in the neighbourhood of 
a seed line together with their probabilities. The different 
sets of plane hypotheses correspond to different interpreta- 
tions of the scene. 
4.2 Maximum Expected Utility Patch Reconstruction 
During the Bayesian plane selection procedure, 3D line 
segments are associated with different plane hypotheses. 
—207— 
ec ES SES
1
2
...
220
221
222
223
224
...
640
641
Full text: Close-range imaging, long-range vision

Access restriction

Copyright

Note to user