Photogrammetric computer vision and image analysis: Papers accepted on the basis of peer-reviewed full manuscripts

paparoditis, nicolas
Images 
YIQ color 
space 
Udine detection tech- 
>hown in gray-scale). 
/ building mask. 
h of its point in the 
y a completely black 
if its point in the sec- 
f a completely white 
•imary building mask 
•eturns below X), and 
ate filled areas from 
>ove the same height 
ated masks for a test 
imary building mask 
wed to obtain these 
;k shapes in M p are 
d extended. Finally. 
n edge detector and 
minimum building 
ected on each curve 
t al., 2009). On each 
i corner and an end- 
rs are not available, 
In order to properly 
lding edges, a least- 
lied. With each line 
i-point’ indicates on 
:d. In older to avoid 
detected tree-edges, the mean of sigma of the NDVI value T is 
calculated on both sides of each line segment. If it is above a 
threshold T n dvi — 04 for any side, the line segment is classed as 
a tree-edge and removed. 
In the second step, the line segments are adjusted and extended. 
The adjustment is based on the assumption that the longer lines 
are more likely to be building edges. In an iterative procedure 
starting from the longest line and taking it as a reference, the 
angle between the reference and each line in its neighbourhood is 
estimated. The lowest rotation angle 0 r is recorded for each line 
over all iterations (for all long lines taken as references). After the 
iterative procedure, each line is rotated with respect to its centre 
by 0 r - If a rotation angle is not recorded for a line it is removed 
as a tree-edge. Each adjusted line is then extended iteratively by 
considering the percentage of black pixels (more than 70%) and 
applying the NDVI threshold to the building side. 
Finally, initial buildings are formed among the extended line seg 
ments. In an iterative procedure, an initial building position is 
detected using the first longest line segment, another using the 
second longest line segment and so on. Before detecting a rect 
angle using a line segment in each iteration, the line segment is 
first tested to ascertain w'hether it is already in a detected building. 
In order to detect an initial building on a line segment, an initial 
rectangle (of width 1.5m) is formed on the building side and then 
three of its sides are extended outwards with respect to P, n us 
ing the same technique applied to extend the extracted lines. Fig. 
3(a) shows the initial detected buildings on the test scene. 
3.3 Final Buildings 
The final building positions are obtained from their initial posi 
tions by extending each of the four sides. Image colour informa 
tion and the two masks M p and M s are considered during the 
extension. The colour information is basically used to extend the 
initial positions; M p is used to avoid unexpected extension of an 
initial position over more than one actual buildings, and M s is 
used to avoid unexpected extension of an initial position beyond 
the actual building roof. 
An initial building position may go outside the actual building 
roof due to a misregistration between the orthoimage and the LI- 
DAR data. In order to avoid this, since the initial position will be 
extended outwards while obtaining the final position, its length 
and width are reduced by 15% before extension. For each re 
duced building position ABCD. the dominant colour threshold 
pairs are estimated using colour histograms for intensity Y. hue 
/ and saturation Q, respectively. Each dominant colour threshold 
pair indicates a range denoted by its low l and high h values. 
There may be overlaps between the detected initial positions. It 
is hard to decide which overlap is unexpected and which is natu 
ral. If an initial building is completely within an already extended 
building or building part, it is removed assuming that it is an un 
expected overlap. Otherwise, it is extended assuming that it is a 
natural overlap. 
The initial building positions are sorted in descending order of 
their length or area, since both of these sorted lists were found 
to offer the same performance. Starting from the initial build 
ing having the longest length or largest area, its four sides are 
extended outwards separately. While extending each side in an 
iterative procedure, the percentages of black pixels in both M p 
and M s and of dominant colour components within the estimated 
colour threshold pairs are estimated. The side is extended if per 
centages of black pixels are above 90% and those of dominant 
colour components are above 40%. Fig. 3(b) shows the final 
detected buildings on the test scene. 
(a) (b) 
Figure 3: (a) Initial and (b) Final buildings. 
4 PROPOSED EVALUATION SYSTEM 
The proposed threshold-free evaluation system makes one-to-one 
correspondences using nearest centre distances between delected 
and reference buildings. The reference buildings are obtained us 
ing manual measurement from the orthoimagery. Altogether 15 
indices are used in three categories (object-based, pixel-based and 
geometric) to evaluate the performance. Most of these have been 
adopted from the literature and the rest are proposed for a more 
complete evaluation. 
4.1 Detected and Reference Building Sets 
For evaluation, two sets of data were used, in which each building 
is represented either as a rectangular entity, for T shape building, 
or a set of rectangular entities, for ‘L’, ‘U’ and ‘C’ shapes. The 
first set Bj — where 0 < i < m and m is the number 
of detected rectangular entities, is known as the detected set. It 
is obtained from the proposed detection technique. Each entity 
hd.i is an array of four vertices and the centre of a rectangular 
detected entity. The second set B r = {b r .j}. where 0 < j < n 
and n is the number of reference entities, is termed the reference 
set. It is obtained from manual building measurement within the 
orthoimagery. Each entity b r .j is an array of four vertices and the 
centre of a rectangular reference entity. 
To find the reference set B r , manual image measurement is used. 
Any building-like objects above the height threshold Ty, are in 
cluded in B r . As a result some garages (car-ports) whose heights 
are above T), are also included, but some building parts (veran 
das) whose heights are below Th are excluded. Different building 
parts are referred to separate rectangular entities. Consequently, 
there is one entity for T shape, two entities for ‘L’ shape, three 
entities for ‘U’ shape, four entities for ‘C’ shape and so on. 
4.2 Overlapping Sets 
It is natural that different rectangular entities of the same building 
overlap each other. In B r , two overlapping entities must always 
belong to the same building and represent two connected building 
parts (Fig. 4(a)). Such an overlap is defined as a natural overlap 
and for identification purposes a building identification number 
bid is assigned to each reference entity, this being stored in b r .j, 
in addition to the four vertices. Entities of the same building are 
assigned the same bid, but those of the different buildings are 
assigned different bid values. 
In Bd, the situation is different. Here two overlapping entities 
may belong to the same building and represent two connected 
building parts. In such a case, this overlap is a natural overlap 
(Fig. 4(a)) and it is not counted as an error in the proposed eval 
uation. In all other cases, the overlap is counted as an error in 
the evaluation system. For example, the overlapping entities may 
represent the same building (multiple detection, Fig. 4(b)) or con 
stitute combinations of true and false detections (Figs. 4(c)-(e)). 
51
1
2
...
62
63
64
65
66
...
324
325
Full text: Papers accepted on the basis of peer-reviewed full manuscripts (Part A)

Access restriction

Copyright

Note to user