Full text: From pixels to sequences

  
214 
OPTICAL FLOW COMPUTATION IN THE LOG-POLAR PLANE 
Kostas Daniilidis and Volker Krüger 
Computer Science Institute, Christian-Albrechts University Kiel 
Preusserstr. 1-9, 24105 Kiel, Germany 
email: {kd,vok}@informatik.uni-kiel.de 
KEYWORDS: space-variant sensing, conformal mapping, optical flow. 
ABSTRACT: 
Vision systems with spatially homogeneous resolution are not able to provide a real time response in a dynamically 
changing environment. A reactive behavior necessitates selective sensing in space. Such a selection can be accomplished 
by the combination of a space-variant resolution scheme and a sensor with controllable degrees of freedom. The field 
of view is split into a homogeneous high resolution area - the fovea - and the periphery with decreasing resolution. 
Both in neurobiology and in robot vision, models of the resolution decrease towards the image boundaries have been 
established. The most convincing model is the theory of logarithmic polar mapping. In this paper we propose two new 
methods for the estimation of the optical flow and its spatial derivatives in the log-polar plane. We study analytically 
and experimentally the effects of the polar deformation and the decimation due to subsampling on the computation of 
optical flow. 
1 Introduction 
This paper is concerned with the computation of optical flow in image sequences obtained with the logarithmic polar 
transformation. The log-polar transformation is a model for space-variant resolution in the periphery of the image. 
Space-variant sensing arises as a necessity in systems which must be able to process simultaneously a central region of 
interest (fovea) in detail for recognition tasks and a wide-angle peripheral view for detecting events and new candidates 
for gaze change. Uniform resolution in the peripheral part would result to a computational burden unacceptable for 
real-time reactive behavior. The prerequisite for space-variant sensing is active sensing that means the capability to 
control the gaze direction. The biologically motivated log-polar transformation has a second significant advantage: It 
is a very rich representation regarding recognition tasks (rotation and scaling invariance) as well as navigational tasks 
(ego-motion and time to collision estimation, motion detection). 
We first shortly describe the basic anatomy of the primate's space-variant sensing. The retina consists of three layers. 
In the first layer we find the photo-receptor-cells (PRC) that code the visual information in terms of nervous impulses. 
The PRC are connected either directly via bipolar-cells or indirectly via amacrin- bipolar- and horizontal-cells in the 
second layer to the retinal ganglion cells (RGC) in the third layer. An area of PRC on the retina that are combined 
by one single RGC is called receptive field (RF). PRC are nonuniformly spaced over the human retina. Their heightest 
density can be found within the fovea centralis, but their density decreases with increasing eccentricity. The spatial 
density of RFs on the human retina is related to the density of the PRC. The highest density of RFs is therefore found 
in the fovea where some RF even consist of only one PRC each. In the periphery the density of RF decreases whereas 
their center-size increases nearly linearly with eccentricity. This allows keeping the amount of visual information as 
received by z 10? cones and rods low enough to be processed by only ~ 10° RGC and optic nerve fibers. [Schwartz, 
1977] proposed the complex logarithmic mapping for the retinotopic mapping of the RGC onto the first area of the visual 
cortex (V1). [Weiman and Chaikin, 1979] studied first the properties of the complex logarithmic transformation as a 
conformal mapping and they proposed a logarithmic spiral grid as a digitization scheme for both image synthesis and 
analysis. 
The goal of this paper is to study what kind of information is still preserved after the log-polar transformation which 
can be used for motion tasks. We use the optical flow as an intermediate step and study analytically and experimentally 
the effects of the polar deformation in sec. 3 and the logarithmic subsampling in sec. 5. In particular, 
e we prove that the polar transformation introduces fictitious gray-values curvature that leads to an erroneous 
elimination of the aperture problem, 
e we propose two new methods for the optical flow estimation in the log-polar domain that are superior to methods 
directly transferred from the cartesian domain and we experimentally study their performance in a real sequence, 
e we propose a basis for an analysis of the logarithmic subsampling that allows spectral techniques for the design of 
the necessary low-pass and gradient filters. 
We use (z,y) for the cartesian coordinates and (p,m) for the polar coordinates in the plane. By denoting with 
z — rt jy — pe?" a point in the complex plane the complex logarithmic (or log-polar) mapping is defined as 
w = In(z) (1) 
for every z # 0 where Re(w) = Inp and Im(w) = n + 2kx. To exclude the periodicity of the imaginary part we constrain 
the range of Im(w) to [0, 27). The complex logarithmic mapping is a well-known conformal mapping preserving the angle 
of the intersection of two curves. It is trivial to show that every scaling and rotation about the origin in the z-plane is 
represented in the w-plane by a shift parallel to the real and imaginary axis, respectively. 
IAPRS, Vol. 30, Part 5W1, ISPRS Intercommission Workshop “From Pixels to Sequences”, Zurich, March 22-24 1995
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.