Full text: XVIIIth Congress (Part B5)

ystems are able 
to the required 
le, and further 
acial structures, 
8. 
, D., Hallbauer, 
measurement of 
to the spine 
and Remote 
for small format 
1 5(86), pp. 325- 
n for automated 
| Archives of 
p. 265-272. 
1995. An 
CD for digital 
5(86), pp. 301- 
onal Computers. 
sing, 55(11), pp. 
  
A VIDEO-RATE STEREO MACHINE AND ITS APPLICATION TO VIRTUAL REALITY 
Kazuo Oda, Masaya Tanaka, Atsushi Yoshida, Hiroshi Kano and Takeo Kanade 
The Robotics Institute, Carnegie Mellon University 
5000 Forbes Avenue Pittsburgh, PA 15213 USA 
Commission V, Working Group 1 
KEY WORDS: Close Range, Real-Time Vision, Stereo, Virtual Reality, Z keying, Hardware, Computer Vision. 
ABSTRACT 
We have developed a video-rate stereo machine which has the capability of generating a dense range map at video rate. 
The CMU video-rate stereo machine has the following performance: 1) multi image input of up to 6 cameras; 2) throughput 
of 30 million point x disparity measurements per second; 3) frame rate of 30 frames/sec; 4) a dense depth map of 256 x 
240 pixels; 5) disparity search range of up to 60 pixels; and 6) high precision of up to 8 bits (with interpolation). The capa- 
bility of producing such a high resolution depth map (3D representation) at video rate opens up a new class of applications 
for 3D vision. We report one such application: z keying, which merges the real and virtual worlds in real time. 
1. INTRODUCTION 
Stereo range imaging uses correspondence between sets 
of two or more images for depth measurement. Despite a 
great deal of research during the past two decades, no 
stereo systems developed so far have achieved adequate 
throughput and precision to enable video-rate dense depth 
mapping. The throughput of a stereo machine can be most 
effectively measured by the product of the number of 
depth measurements per second (pixels/sec) and the 
range of disparity search (pixels); the former determines 
the density and speed of depth measurement and the lat- 
ter the dynamic range of distance measurement. There 
are several advanced real-time stereo systems (Nishihara, 
1990, Webb, 1993, Matthies, 1992, Faugeras et al., 1993); 
yet none of them is able to provide complete video-rate 
output of range as dense as the input image with low 
latency. 
We have developed a video-rate stereo machine which 
has the throughput of 30 million pixel?/sec. This through- 
put translates to a 200 x 200 x 5bit depth image at the 
speed of 30 frames per second - the speed, density and 
depth resolution high enough to be called a video-rate 3D 
depth measurement camera. Our video-rate stereo 
machine is based on a new stereo algorithm, the multi- 
baseline stereo theory (Okutomi et al., 1992, Nakahara 
and Kanade, 1992, Okutomi and Kanade, 1993). It uses 
multiple images obtained by multiple cameras to produce 
different baselines in length and direction. 
Video-rate stereo range mapping has many advantages. It 
is passive and it does not emit any radio or light energy. 
With appropriate imaging geometry, optics, and high-reso- 
lution cameras, stereo can produce a dense, precise 
range image. Stereo performs sensor fusion inherently; 
range information is aligned with visual information in the 
common image coordinates. Stereo depth mapping is 
scanless; thus it does not have the problem of apparent 
shape distortion from which a scanning-based range sen- 
sor suffers due to motion during a scan. These features of 
video-rate dense depth mapping open up a new class of 
applications of 3D vision. We report one such application: 
2 keying which merges the real and virtual worlds in real 
ime. 
2. CMU VIDEO-RATE STEREO MACHINE AND ITS 
PERFORMANCE 
The CMU video-rate stereo machine comprises special- 
purpose high-performance hardware. Table 1 summarizes 
its current performance. 
Table 1: Performance of CMU stereo machine 
  
Number of cameras 2106 
  
Processing time/pixel 33ns x (disparity range + 2) 
  
Frame rate up to 30 frames/sec 
  
Depth image size up to 256 x 240 
  
Disparity search range up to 60 pixels 
  
  
  
  
    
(a) Processor (b) Five-eye camera head 
Figure 1: The CMU video-rate stereo machine 
     
(b) range image 
(a) intensity image 
Figure 2: An example scene and its range image 
International Archives of Photogrammetry and Remote Sensing. Vol. XXXI, Part B5. Vienna 1996 
 
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.