Full text: The 3rd ISPRS Workshop on Dynamic and Multi-Dimensional GIS & the 10th Annual Conference of CPGIS on Geoinformatics

ISPRS, Vol.34, Part 2W2, “Dynamic and Multi-Dimensional GIS", Bangkok, May 23-25, 2001 
A NEW STEREO MATCHING APPROACH USING EDGES AND 
NONLINEAR MATCHING PROCESS OBJECTED FOR URBAN AREA 
Mitsuteru SAKAMOTO, Wei LU, Pingtao WANG 
Asia Air Survey Co., Ltd. R&D Department 
8-10, Tamura-cho, Atsugi-shi, Kanagawa 243-0016, Japan 
Tel:+81-46-295-1886, Fax: +81-46-295-1934, E-mail: mi.sakamoto@ajiko.co.jp 
Yukio KOSUGI 
Frontier Collaborative Research Center, Tokyo Institute of Technology 
4259, Nagatsuta-cho, Midori-ku, Yokohama 226-8503, Japan 
Tel: +81-45-924-5466, E-mail: kosugi@pms.titech.ac.jp 
KEYWORDS: Stereo Matching, Nonlinear mapping, Principle of Coincidence Enhancement, Building Detection, Edge Matching 
ABSTRACT 
Automatic recognition of buildings has been in demand for efficient digital mapping process or updating of existing information in 
geographical information application. To deal with this subject, effective stereo matching technique that is applicable to urban area is 
necessary. However conventional stereo matching techniques that make use of area-based, point-based or edge-based matching 
cannot generate satisfactory results for urban area because of occlusion or inability of recovering from mismatching. 
In this paper, a new stereo matching technique with combinational use of edges and nonlinear mapping is introduced. This approach is 
based on process called Coincidence Enhancement Method (CEM) that consists of competition and consensus operation [4]. Edge 
segments, derived from edge detection and matching process, are used as initial information or constraints for CEM process and thus 
improvements of CEM is achieved. 
Experiments on proposed approach in this paper, edge detection, edge matching, CEM and CEM with edge support were executed with 
stereo aerial imageries objected for urban area. As a result, it was ascertained that proposed approaches of CEM with edge support 
was efficient for improving of the matching results surrounding building’s boundary. 
1. INTRODUCTION 
Up till now mapping processes of ground objects with aerial 
stereo image pairs still depend on professional operator and 
needs tremendous time and costs. While new systems known as 
Digital Photogrammetric Workstation (DSW) has been presented 
past several years, which has more effective functions for low 
cost mapping, it has not been realized to detect or map ground 
objects automatically. Computer vision is most widely studied 
technology and is becoming more and more promising thanks to 
easier availability of auxiliary information, such as height 
information from laser profiler finder or SAR. However, for urban 
area where occlusion happens frequently, the most fundamental 
technology such as stereo matching still face great difficulty, so 
is edge detection process, which output is one of the most 
crucial information for reconstruction of building model. 
Edges that can be extracted with various kinds of filters are the 
most fundamental information for automatic recognition of 
manmade structures. If the structural boundary of a ground 
object can be extracted in the form of edge information, it will be 
not a much difficult job to reconstruct the 3 dimensional model of 
the original structure. There have been numerous researches 
regarding efficient and reliable edge detection. However, 
because of the limitation of image media, none of the acclaimed 
algorithms can produce complete and error free results. 
Coincidence Enhancement (CE), being an extension of Hebb’s 
rule, is a self-organizing process of neural network modeling. 
This process can be modeled by the principle of competition and 
consensus [4]. CE model can realize smooth projection between 
input signals and output pattern, which means that when the 
majority of the initial values are correct, the minority of 
erroneous data can be absorbed. This effect is very useful in 
reducing the wrong stereo matching result caused by local 
minimum. The key factor for CE model to function correctly is to 
ensure the reliability of the initial data, which is not an easy job 
in the case of computer vision, especially when dealing with real 
world images. 
Recently, we have been developing a DSW system, and 
constantly making improvement for this system [1]. Even though 
this system has many superior functions, such as the ability of 
handling imageries of several hundred mega bytes at high 
speed even on personal computers of middle range cost, 
various digitizing function and so on, almost all of the processes 
still depend on manual operations. Our ultimate goal for the 
system is to introduce as many as possible automatic 
processing modules of high reliability to reduce the cost of 
production and increase the quality of digitized information. 
The algorithms proposed in this paper aim at improving the 
precision and reliability stereo matching process by introducing 
CE process that is constrained by reliable edge information. The 
first section gives a general overview of the proposed 
algorithms. The second section describes in detail the improved 
algorithms for extracting reliable edge information. The third 
section describes the enhanced CE process with the reliable 
edge information functioning both as initial value and constraints 
during enhance process. Experiments have been conducted and 
the results show that the proposed algorithms are capable of 
improving the precision and reliability of stereo matching 
process. 
2. OVERVIEW OF THE PROPOSED ALGORITHMS 
2.1 Objectives and Tasks 
Our objectives are to improve the reliability and precision of 
stereo matching for automatic building recognition. The reliability 
is improved by using highly reliable edge information that is 
extracted through strict constraints. The precision is improved by 
using neural network enhanced with edge constraints. The 
background and outline of the two strategies are as follows. 
Edges are fundamental components of building. In this study, 
edge mainly servers for three purposes. One is for initialization 
of global search for stereo matching, which is equal to general 
registration in photogrammetry. The other is for constraining the 
local matching in CEM, whose brief description will be given 
below and the details will be given in section 4. The third one is 
for maintaining collinear condition during mapping between 
stereo image pairs. 
Since edge information extracted by conventional approaches 
tends to be noisy and unreliable, our strategy is to only detect of
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.