Full text: XIXth congress (Part B3,2)

| aspects 
ie space of 
rategies in 
instance be 
omposition 
arallel line 
sider many 
ice of the 
jects under 
1g complex 
/hen we list 
sually there 
within the 
id. he or she 
. of concern 
' the search 
s, Euclidian 
etter search 
rve. It may 
andles such 
to mention, 
: of a subset 
ntified after 
uses, that fit 
ight, length, 
| successive 
use spacing 
ss and much 
  
Eckart Michaelsen 
  
Symmetry: This relation has drawn remarkably less attention in the pattern recognition community then in psychology. 
Gestaltists usually found it to be very important in figure-background discrimination. Maybe this lack of interest is just 
another fear for difficulties, because searching a data set for axis of symmetry is a very non-local task involving 
correspondence hypothesis, and thus may again demand high computational complexity. 
3 PROBLEMS 
Transferring such ideas of perceptual grouping as they are presented by psychologists into code that will run in 
admissible time and with satisfying results on non-trivial data is difficult. So following list of problems is not meant to 
be complete and the reader is invided to add his or her own items from their own experience. Also the order does not 
necessarily reflect our assessment on the impact of the problems. 
1. 
Likelihood of Appearance of Modelled Features: Usually polyhedrons are used for the geometric aspect of the 
models and straight line segments are extracted from the image data. There are a couple of questionable 
assumptions in the match or identification or correspondence between these. Some of the modelled features will 
appear, but if they do, they do it for different reasons: 1) A figure-background contour will appear, if the 
reflectivity of the background is different from that of the object, or if there is shadow cast on the background and 
not on the object. The left hand side of Fig. 2 exemplifies that such contours are frequently missed. 2) Inner 
contours will appear, if there is a change either in reflectivity or in reflection (mostly due to different surface 
orientation with respect to the lighting). The former is relatively rare with buildings. Usually a difference in 
reflectivity is assumed in road recognition — although it is quantitatively unknown. The later will fail for both 
complete rows in Fig. 2 if the sun is perpendicular to them. Existing geometric models of objects of interest such as 
buildings or vehicles have often been assembled for different purposes - not for visual recognition. They tend to fail 
because many of the features modelled do not appear in image (e. g. if free-formed surfaces are modelled by 
triangles). 
Insertion of Unpredictable Objects: In nearly every aerial image You'll find surprising objects You'd never 
thought about. Every person modelling a scene will also be able to name several object classes neglected by the 
model. In Fig. 2 for example also trees, garages, cars e.c. appear, although the model only captures simple gabled 
houses and roads. Usually the majority of the features will not origin from the objects modelled. Some of the 
objects not modelled but present in the scene might be similar to parts of the model. 
Occlusiuon: Many Features demanded by a model based recognition algorithm may not appear in the data because 
they are occluded or partially occluded by other objects. This is extremely difficult with oblique views. But also in 
example like Fig. 2 partial occlusion is evident. Especially large portions of the road contour are hypothesised 
instead of measured. 
Determination of an Adequate Feature Subset: From problems 1 and 2 we learned, that usually only a subset of 
the features are present and we do not know which. Since we need e. g. some 20 features for discrimination from 
arbitrary background the Model should have some 50 features. The set of all subsets of between 20 and 30 elements 
of a set of 50 elements is astronomically large. It is not sufficient to specify a threshold for the percentage or 
absolute number of features, because for instance a small number of ‘the right’ features will be enough to stabely 
infer the presence of a modelled object, whereas a larger number of ‘unimportant’ features will not do. 
3D-2D Invariance: Many properties of geometric models such as angles, measures, topology, are not invariant 
under perspective projection. Usually one geometric model may either be used for matching with 3D-features 
gathered by stereo methods or it may be transformed via hidden line or any other rendering into many appearances 
or aspects, which are used for 2D matches. Then a single 3D model generates possibly hundreds of 2D appearance 
models. Matching become instable with such many templates. For this reason we prefer working with 3D-models 
and with 3D-features (like in Fig. 2). Doing this on aerial images demands correctly calibrated, overlapping image 
sets. Also correspondence errors may lead to wrong 3D-features. 
Erroneous Early Decisions: Often there are alternatives in the correspondence choice between a certain model 
part and a certain subset of the features. For instance in situations like in Fig. 2 there will often be more then one 
line that fit — together with some other already identified lines - into the model of a rectangular part of a roof. One 
might be tempted to only accept the ‘best’. But at this stage of analysis this will be a local criterion. Later, when the 
house-row is established, another line might fit better. Moreover if such early decisions are made on local criteria, 
the whole outcome of the search depends on the features and models it used in the begin. Often the correct solution 
will not be found. On the other hand, if every alternative is kept, the computational effort and demand for memory 
will grow very badly with the size and hierarchical depth of the model. The following problem point clarifies this. 
Combinatorial Growth: Let us again consider the house-row example: Let the probability for missing lines be 
0.25 and the probability for the presence of two competing line instantiations be as well 0.25. So each rectangle 
will in the average have one segment missing and one double. This gives 2 alternatives for each rectangle and little 
  
International Archives of Photogrammetry and Remote Sensing. Vol. XXXIII, Part B3. Amsterdam 2000. 579 
 
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.