The role of models in automated scene analysis

torlegård, kennert
Zisserman - 8 
3. Compute 3D points X, by intersecting backprojected rays using 
P = [I| 0] P' = [M'| e'] 
where I is the 3x3 identity. 
Projective structure can be utilised directly for certain visual tasks. For example, it 
is possible to distinguish points in 3-space on either side of any plane (Binary Space 
Partition) [22]. 
Much of the current research is aimed at reducing this projective ambiguity to 
affine or Euclidean (metric) so that H is an affine or Euclidean matrix. Of course, 
structure can be “upgraded” to affine by identifying parallel lines/planes or sup 
plying ratios of lengths of collinear segments or areas. Similarly, structure can be 
promoted to Euclidean by identifying perpendicular lines, for example, or supplying 
length ratios. However, in computer vision the aim is to achieve this upgrading by 
constraints available on the motion or camera without resorting to scene specific 
knowledge. 
2.3.1 Affine and Metric Structure 
In order to upgrade 3D projective geometry to metric certain geometric objects 
have to be identified, namely the plane at infinity 7(affine geometry) and the 
absolute conic Doo (Euclidean geometry) [24]. The image counterparts of these 
entities for a pair of views are the infinite homography Hqo (for affine reconstruction), 
and the dual of the image of the absolute conic in each view, K and K' (for metric 
reconstruction). The infinite homography is the 3x3 point transformation matrix 
for vanishing points, i.e. v' = HqoV, where v and v' are corresponding vanishing 
points. Having the dual (i.e. the inverse) of the image of the absolute conic K, is 
equivalent to knowing the internal parameters of the cameras, i.e. the 3x3 upper 
triangular camera calibration matrix C, since K = CC T . Once Hqo is known, an affine 
reconstruction can proceed from the projection matrices P = [l|0], P' = [Hoo|e']. 
Also, the infinite homography between two views constrains the camera calibration 
matrix [10, 16] by: 
K' = H„KH^ (3) 
Affine structure can be utilised directly for certain visual tasks. Affine constructions 
such as centroids and mid-points have been used for control of fixation by track 
ing object centroids [21], and obstacle avoidance by servoing on mid-points of free 
space [3]. 
A number of methods have been developed for obtaining the Hqo and K image 
entities: 
1. Affine structure can be recovered from two uncalibrated images with pure 
(but unknown) translation between the views and unchanging (but unknown) 
internal parameters (i.e. K' = K) [18]. In this case Hqo = I. Subsequent rotated 
views can then be used to generate linear constraints on the internal camera 
parameters via equation (3), K = HoqKH^ [1].
1
2
...
191
192
193
194
195
...
201
202
Full text: The role of models in automated scene analysis

Access restriction

Copyright

Note to user