Full text: XVIIIth Congress (Part B3)

   
n method is 
erent scales 
cting possi- 
in order not 
ites. Imple- 
ng factor. 
m based on 
re our main 
ficient and 
symbols of 
'jews previ- 
yutlines our 
letails. Sec- 
d Section 6 
)RK 
for symbol 
of previous 
ion in gen- 
studied, the 
suitable for 
cy, correct- 
he previous 
ext section 
:h. 
ably one of 
ny arbitrary 
ard, 1981). 
rform map 
1ally gener- 
to complex 
1986). The 
as its com- 
th the com- 
he memory 
required can quickly become unmanageable. 
A lot of existing symbol recognition systems use 
invarlant features for shape recognition. In (Wu, 1986), 
Wu and Stark present an algorithm to recognize an object 
in an image regardless of the object’s size and orientation. 
They mainly use two features: a circular harmonic func- 
tion that is rotation invariant and the Mellin transform 
which is scale invariant. Other popular features are the 
UNL Fourier features (Rauber, 1994), Wavelet transform 
(Antonini, 1992), moment invariants (Reeves, 1988 and 
Reiss, 1991) and Fourier descriptors (Jun, 1986 and Lin, 
1987). 
A set of features is usually used to classify symbols. 
Since the instances of each symbol may vary, several 
instances of each symbol are required in order to build a 
representative training set that will produce reasonable 
recognition rates. This training set is constructed dynami- 
cally with the help of the user. At first, the user will inter- 
actively correct the recognition when necessary. All the 
symbols identified and corrected by the user will help con- 
solidate the classification. Once the user is satisfied with 
the classification, the system can then run stand-alone 
without user interaction. 
A good example is the system by Samet and Soffer 
(Samet, 1994). It uses the (user-located) legend to initial- 
ize a table of models to be found in the rest of the image. 
Then the system divides an image into its constituent ele- 
ments using a connected component labeling algorithm. 
For each region a set of features is computed. The system 
uses global and local shape descriptors identified as fea- 
tures that best discriminate between geographic symbols 
(Levine, 1982). These features are invariant to scale, ori- 
entation and translation. 
In general, however, a feature based approach is hard 
to implement when symbols can overlap or touch each 
other. Segmentation must then be performed before fea- 
ture calculation and the resulting problems can rapidly 
become very complex. In (Gorman, 1988), Gorman et al. 
use a dynamic programming method to attack the segmen- 
tation problem. Different segmentation possibilities are 
tried depending on what is most likely to be recognized so 
far. This method is mostly used for recognizing complex 
shapes. For example, if one wishes to distinguish an 
image of an airplane from that of a train and only a wing 
with a reactor is showing, the system will easily recognize 
the object as an airplane. However, it is not clear if the 
method would yield good results if used to recognize very 
small and similar symbols. 
Several other authors have suggested approaches 
using almost only heuristics. Kasturi has worked on the 
problem of separating text from graphics (Kasturi, 1986, 
Fletcher, 1988 and Kasturi, 1990) and also on locating 
simple symbols consisting mainly of small loops like rect- 
angles, circles, etc. 
Yu et al. (Yu, 1994) and Shimotsujia (Shimotsuji, 
1990) rely on simple heuristics to perform segmentation. 
Based on features, shapes like lines, circles and arcs are 
recognized and grouped. A simple matcher is used to per- 
681 
International Archives of Photogrammetry and Remote Sensing. Vol. XXXI, Part B3. Vienna 1996 
    
   
   
   
   
   
   
   
   
   
   
   
   
   
    
  
   
  
   
    
   
   
   
      
  
   
  
   
    
       
  
  
   
    
  
    
   
    
    
     
    
  
     
    
    
     
     
    
    
   
   
     
   
   
   
form recognition. The user can input any type of symbols. 
Okazaki er al. (Okazaki, 1988) described a complete 
system for electric circuit symbol recognition. Most sym- 
bols in an electrical circuit diagram contain at least a 
closed loop. An interesting feature about this system is its 
hybrid approach. It first computes a set of features to iso- 
late loops of interest. Then it uses another set of features 
to recognize the exact symbol. This recognition is medi- 
ated with a pattern matching attempt on the same symbol. 
The two recognition schema collaborate in order to find 
the right symbol. 
Heuristic methods have several problems. First, they 
are not general. They can be used to find only a small set 
of symbols. Their major disadvantage is that they are very 
sensitive to noise. If a symbol is incomplete, blurred, 
touching or overlapping another one, it might be missed. 
3. OUR APPROACH 
The Hausdorff distance measures the degree of mismatch 
between two sets of points by measuring the distance of a 
point of one set that is farthest away from any point of the 
other, and vice versa. Formally, the directed Hausdorff 
distance h between a model M and an image / at a specific 
point in the image is: 
max ( min : 
hM, I = [' (distance (m, 59) (1) 
me M\iel 
The Hausdorff distance H is defined as: 
HM, I) 2 max(h((M,D,h(1, M))) (2) 
This distance can be used to determine the degree of mis- 
match between two objects that are superimposed on one 
another. 
One of the most interesting properties of the Haus- 
dorff distance is that it obeys metric properties. The func- 
tion is positive everywhere and has the properties of 
identity, symmetry and triangle inequality. These proper- 
ties correspond to our intuitive notions of shape resem- 
blance, namely, that a shape is identical only to itself (and 
not one having identical features), the order of compari- 
son of two shapes does not matter, and two shapes that are 
highly dissimilar cannot both be similar to some third 
shape. This final property (the triangle inequality) is par- 
ticularly important in pattern matching applications in 
which several stored model shapes are compared with an 
unknown shape. Thus, two highly dissimilar models can- 
not both be similar to an unknown symbol. 
However, the distance threshold alone is not good 
enough to discriminate between good and bad matches. 
When using a small (tight) distance threshold, a lot of 
poorly drawn symbols can be missed. This is undesirable 
in some applications. On the other hand, a lot of false
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.