Full text: Proceedings, XXth congress (Part 3)

  
International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol XXXV, Part B3. Istanbul 2004 
  
of example shapes. In order to solve the pairwise 
correspondence problem they use a  polygon-based 
correspondence algorithm which locates a pair of matching 
sparse polygonal approximations, one for each of boundaries, 
by minimising a cost function using a greedy algorithm. While 
the algorithm works well with different classes of objects, it 
assumes that the objects are represented by closed boundaries. 
Furthermore, the algorithm was not tested on objects with 
multiple closed boundaries, e.g., faces. Recently Davies er. al. 
(2002) have described a method for automatically building 
statistical shape models by using the Minimum Description 
Length (MDL) principle. The MDL is obtained from 
information theoretic considerations and the model order is 
defined as the model that minimises the description length, e.g. 
the model that encodes the vector observations in the most 
efficient way (Walter, 2002). In their method each shape is 
mapped onto a corresponding sphere where a given number of 
landmarks is first selected. The positions of the landmarks are 
then altered by parameterisation functions before selecting the 
parameterisations that build the best model. The best model is 
defined as the one, which minimises the description length of 
the training set, and its quality is regulated by an objective 
function that evaluates the quality of the PDM. This is a very 
promising method for measuring the model quality of a 
statistical shape model and results show better PDMs via 
manual landmarking. However, due to the very large number 
of function evaluations this optimisation method is 
computationally expensive. 
3. OUTLINE OF THE METHOD 
In our system we used 20 close related grey images of human 
hands recorded using a digital camera at a resolution 
1024x768. Four people contributed with five images each of 
their right hand. In our experiments three main stages have 
been conducted: 
l. Image segmentation or outline extraction using 
thresholding and the Canny Edge Detector to obtain the 
foreground (shape of the hand). 
2. Freeman chain code  8-connectivity boundary 
descriptor to obtain automatically the coordinate of the 
boundary pixels and the direction of the boundary. 
Minimum Perimeter Polygon (MPP) is used to identify 
curvature descriptions. 
3. Point Distribution Model (PDM) to describe the hand 
shapes and their variations based on the position of the 
automatic landmark points. 
3.1 Extracting the hand shape 
We are interested in extracting the outline of the hand shape. 
The method to separate the shapes from the background was to 
select a threshold 
Te T[x y FG. )] (1) 
where f(x,y) is the grey level of point (x,y). The threshold 
image g(x,y) is defined as: 
Ah 
es CT 
where 1 and 0 corresponds to the distinction between the 
background and the foreground. For edge detection we used 
750 
the Canny Edge detector which find edges by looking for local 
maxima of the gradient of f(x,y). 
The gradient gx» z[G +G;] and edge direction 
a(x, y) - tan (Gy/Gx)are computed at each point. A 
vector T=[T1 T2] containing two threshold is used to find the 
strong and the weak edge pixel and an edge linking is 
performed by including the weak pixels to the strong. By using 
those two detectors we managed to extract the shape of the 
hand regardless of the grey level difference between the 
foreground and the background. 
3.2 Obtaining the Boundary coordinates of the shapes 
For contour-based shape representation and description we 
chose an 8-connectivity derivative Freeman chain code, which 
is based on the fact that an arbitrary curve is represented by a 
sequence of small unit length vectors and a predefined set of 
possible directions. During the encoding successive contour 
points are adjacent to each other. The chain code was used as a 
numbered sequence that represents relative directions of 
boundary elements measured in a counter-clockwise 45? 
direction changes. The representation is based on 8- 
connectivity and the direction of each component is coded by 
the numbering scheme seen in Figure 1. 
  
Figure l. Chain code in 8-connectivity. 
The chain code was used to derive the boundary length of the 
hand shapes and their direction. The entire vertical and 
horizontal steps have unit length while the length of the 
diagonal steps is X2 . We calculated the contour length of the 
chain code as the number of vertical and horizontal components 
plus J2 times the number of diagonal components. The 
diameter of the shape boundary b is defined as: 
D(b)=max[D(p;,p)] (3) 
HJ 
where D is the Euclidean distance measure between D; 
and p, and is defines as: 
  
D(ipjNGi-x *xj-vj? — (4) 
where (X Jp XJ) coordinates ofthe p; and p; points. 
One constrain of the chain codes is that in order to work 
properly the boundary should be a closed boundary, and a 
starting point should be defined. In order to choose an initial 
starting point on the closed boundary, which will have the 
associated parameter value, u=1 we searched where the vertical 
and the horizontal principal axes, the axes that pass though the 
centroid of the boundary points, intersect. The horizontal 
extension of the intersection meets the thumb, which was 
selected as the starting point of the chain code. It is assumed 
that this point is fixed for all shapes. This is reasonable since 
Intel 
  
We: 
Free 
dire 
anal 
thro 
con 
met! 
angl 
con 
poir 
this 
whe 
appt 
33 
The 
shar 
acti 
(19¢ 
unde 
deve 
num 
bon: 
3.3.] 
how 
denc 
land 
vect 
train 
3.3. 
plac 
the « 
and 
(sca 
Proc 
eta 
mea 
Figu
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.