Close-range imaging, long-range vision

g problem 
ns around 
)rganizing 
- character 
geometric 
movie-file 
ates some 
) captures 
cture with 
this gaol, 
d to: the 
geometric 
ın optimal 
jerties of 
2921); but 
re. 
er regions 
ature, and 
ion can be 
olor. This 
(position- 
n color in 
| particular 
o different 
is given by 
to obtain 
the image. 
model can 
y to have a 
pearing at 
ly, we can 
z a contour 
)ecause we 
ithematical 
the second 
boundaries 
crimination 
(to identify 
labelling) 
procedure, which is propagated for adjacent pixels following a 
like WTA algorithm. The recursive procedure is 
computationally expensive O(e") where exponential depends on 
the recursive order. We avoid it, by making only one sweep. To 
get labels for regions in a image context, we have adapted some 
basic ideas to the Kohonen’s Self-Organizing-Map (SOM). An 
iteration around image pixels (horizontal scanline p.e.), 
compares vector labels for each vector pixel (position and 
color), and evaluates a weigthed distance between them. 
Weigthing up the color, rather than the position, gives 
uniformity. Inversely, weigthing up the position, rather than 
color, gives consistence (connected regions). For each iteration 
in the pixel level, the region that minimizes the distance is the 
provisonal winner. In a fixed number of regions scheme, the 
vector of that region is brought near to the pixel’s vector. In a 
non-fixed number of regions scheme, if the value of the distance 
goes up to a thresholding value, a new region is created, else the 
winner’s region vector is modified. 
Transitions between adjacent regions vector are given by means 
of a linear averaged approach between two vectors, whose 
weights depend on the ‘size’ factor. Another said, if the region 
is small (‘has’ few pixels), the aproximation is more huge than 
if it ‘has’ much pixels. This algorithm allow us to define each 
region over an image surface with a linear cost. If we take a 
fixed number of regions, O(n) is always the same on two images 
equally sized, independent of the image complexity. By varying 
the threshold in the non-fixed number of regions version, the 
value of the color uniformity in a region is modified. In both 
cases, the time dedicated to the segmentation and merging 
process is smaller, than the contour segmentation. 
3. A clustering technique based in Computational 
Geometry 
Our proposal for clustering uses an adaptation of 3D Voronoi 
diagram linked to typical mean values for each region. Such 
typical mean value is given by a median which is locally 
computed following a winner-takes-all (WTA) criteria. In this 
way, we reduce the number of typical values for colors inside 
each view, and we generate convex regions with homogeneous 
color following a competitive propagation algorithm with centre 
each median. We have developed a computer implementation 
inspired by the behaviour of human retina. An example is given 
by the following iterative algorithm: 
INITIATE depending on the threshold (fixed by the user) 
FOR every pixel of the original image, DO. 
SELECT a representative element by a WTA typical 
competition scheme 
GENERATE a map of color classes with a 
representative pixel by region 
COMPUTE the smallest distance in a color map to 
the representative pixel 
IF this value is higher tham a maximal 
threhold, THEN 
CREATE a new region 
END IF 
UPDATE the map of classes 
END FOR 
FOR every pixel of the original image DO 
COMPUTE the smallest distance between the current 
pixel and the characteristic class of existing regions 
LABEL the region corresponding to the pixel as a new 
region in the map of representative or characteristic 
colors. 
END FOR 
Mean color intensities and their variances give us the regional 
distribution of color, and each region arising from this process 
is consequently labelled. By taking the dual construction of 3D 
Voronoi decomposition, we obtain a Delaunay simplicial 
decomposition, allowing us to generate a 3D skeleton. The 
resulting 3D Delaunay skeleton give us a symbolic 
representation for any vector representation of color. 
So, vertices of the Delaunay simplicial decomposition give us 
some kind of virtual attractors for the valley regions where one 
can find a predominant color, whereas edges of Voronoi 
diagrams represent conflict or peak values. In the same way, 
Voronoi vertices represent in fact repulsors for a dynamics 
based on proximity relations between median values. At peak 
values, we reinforce the discontinuity by using a weighted sum 
linked to a vector representation of color (preferably the IHS). 
The Delaunay decomposition does present optimal properties 
relative to the regularity (maximization of the minimal angle is 
proved in [Law77] ). Such optimal properties are inheritated by 
the regional decomposition ([Ede93]). Statistical stability of our 
model is an immediate consequence of optimality properties of 
Delaunay simplicial decompositions. Abrupt changes in 
illumination conditions appear as foldings or unfoldings of 
previously detected regions, but these changes can be managed 
in terms of similar operations linked to the chosen symbolic 
representation in terms of Delaunay simplicial decompositions. 
So, we obtain a robust model with strongly correlated variations 
even under abrupt changes of illumination. 
4. Mobile segmentation for sequences of images 
In the dynamical case, we apply the above algorithm for a 
spatio-temporal segmentation, merging and tracking of a movie- 
file. The choice of a Delaunay simplicial representation as a 3D 
skeleton allows us an easy update in terms of elementary 
transformations involving to folding and unfolding processes. 
We determine the deformation of merged images which is lower 
when the frames are taken near in time, 20 per second we 
compare the centroid movement for each merged region to 
determine the movement features, and we determine 
illuminance variations from the color differences between 
consecutive images. In this way, we extract qualitative 
properties of the relative motion, allowing us to discriminate 
between the motion camera and the motion of different agents 
in the scene, independently of the complexity of the scene and 
motion. 
If we adopt again a normalized RGB representation for the 
color, we may introduce a superimposed linear model, where 
the homogenous color regions are connected between them by 
edges of Delaunay simplices. The sites of the dual Voronoi 
configuration are the centers of influence regions or color 
organisers associated to minimum energy and maximal volume. 
The minimum energy represents the square of the typical 
deviation. By adapting some arguments of a dynamical 
interpretation ([Per90]), the volume can be interpreted as the 
temporal evolution of the anisotropic diffusion given by a 
partial differential equation. This partial differential equation 
gives the temporal evolution of the complex chromacity as the 
product of the divergence of normalized color by by the 
gradient of the complex chromacity. Hence, the superimposed 
Delaunay model is optimal, as it could be expected from general 
properties of Delaunay simplicial decompositions ([Ede93]). 
—153—
1
2
...
166
167
168
169
170
...
640
641
Full text: Close-range imaging, long-range vision

Access restriction

Copyright

Note to user