XVIIth ISPRS Congress: XVIIth ISPRS Congress

fritz, lawrence w.; lucas, james r.
  
to vague for the purpose of high-level analysıs 
such as matching and object recognition. 
Structure is used to represent the 
interrelationship of the boundaries, which 
provides the basis for many kinds of tasks in 
computer vision, specially for matching, object 
recognition purpose. 
3-D structural description of models. The 
representation scheme for 3d world is the 
typical topic in symbolic artificial intelligence 
and computer graphics. A structural 
description of an object consists of the 
descriptions of its parts and their 
interrelationships. The parts of an object can 
be primitive (nondecomposeable) or they may 
be further broken down into subparts. When 
the parts of an object are not primitives, the 
structural description of the object consists of 
one level of descriptions for each level of 
subparts. Such a multilevel description is 
called a hierarchic description and is useful 
for complex objects with many repetitions of 
parts and subparts. 
Operations on one laver 
By "operations on one level", we mean that input 
and output of the operation are in the same format, 
e.g., from raster image to raster image, from vector 
data to vector data, etc. During the processing, the 
content of representation may change. 
operations on the original raster images. These 
operations may include 1) calculating the 
characteristics of image (e.g. histogram 
transformation, etc); 2) image quality 
improvement (e.g. enhancement, noise 
suppression by filtering, etc). 
operations on the segmented images. Split- 
and-merge is the main mechanism in the 
segmentation procedure, which merges small 
regions into more meaningful big regions, or 
split the big regions into small regions if 
necessary. 
operations on the vector data. This refers to 
the line fitting or curve fitting algorithm, 
which reduces the data needed while keeping 
the result as closed as possible to the original 
data. To be useful for high-level analysis, 
these vector data must be approximated so as 
to overcome local noise, and be represented in 
a more manageable form. The more 
comprehensive that representation is, the 
better the performance of the analysis would 
be. 
Operations between the layers 
The operations under this category changes or 
602 
transfers the data from one representation to another 
representation, which are the essential parts of 
image analysis. 
operations between the original image and 
segmented image. These operations are 
generally called segmentation which usually 
is in two kinds of forms: a), edge detection 
and line following. This category of 
techniques study various of operators applied 
to raw images, which yield primitive edge 
elements, followed by a  concatenating 
procedure to make a coherent one 
dimensional feature from many local edge 
elements; b), Region-based methods, which 
depend on pixel statistics over localized areas 
of the image. Regions of an image 
segmentation should be uniform and 
homogeneous with respect to some 
characteristic such as grey tone or texture. 
Region interiors should be simple and without 
many small holes. Adjacent regions of a 
segmentation should have significantly 
different values with respect to the 
characteristic on which they are uniform. 
Boundary of each segment should be simple, 
not ragged, and must be spatially accurate 
[Haralick]. 
operations between the segmented image and 
vector data. This so-called vectorization 
procedure traces along the each region 
boundary to get the boundary position and 
the position is represented in chain code, 
which is used later by shape analysis. On the 
other hand, in order to integrate shape 
constraint into segmentation, there is another 
information flow which transfers the result of 
curve fitting into the region growing. The 
principle of encoding shape is described in 
section 3. 
operations between the vector data and 2d 
structural description. These operations build 
the structural description by performing a 
geometric analysis on the vector data. Vector- 
based perceptual grouping can be also 
included in this category, which organizes the 
fragmented  low-level descriptions into 
meaningful higher level descriptions by 
mimicking the human visual system in 
detecting geometric relationship such as 
collinearity, parallelism, connectivity, and 
repetitive patterns in an otherwise randomly 
distributed set of image elements, some 
relevant work can be referred to [Mohan]. 
operations between the 2d and 3d structural 
descriptions. Matching two or more than two 
images of the same scene from different 
viewing positions in order to recover the 
al
1
2
...
611
612
613
614
615
...
1000
1001
Full text: XVIIth ISPRS Congress (Part B3)

Access restriction

Copyright

Note to user