Photogrammetric computer vision: Papers accepted on the basis of peer-review full manuscripts

kalliany, r.; leberl, franz w.
ISPRS Commission III, Vol.34, Part 3A .Photogrammetric Computer Vision*, Graz, 2002 
2. BACKGROUND 
2. The Hough Transform 
Hough (1962) introduced a method for parameters estimation 
by way of a voting scheme. The basic principle behind this 
approach was to switch the roles of parameters and spatial 
variables. Hough transform is usually implemented through an 
accumulator array, which is an n-dimensional, discrete space, 
where n is the number of parameters under consideration. In 
this array, the cell with the maximum number of hits yields the 
parameters we are looking for. The variables contributing to the 
peak in the accumulator array can be tracked and identified. For 
more details, the reader can refer to (Leavers, 1993). 
2.2 The Modified Iterated Hough Transform (MIHT) for 
Robust Parameter Estimation 
Hough transform can be modified and used to estimate the 
parameters of a mathematical model relating entities of two data 
sets. In this approach, we assume no knowledge of 
correspondence and do not require complete matching between 
entities. As a result of the parameter estimation, the 
correspondence is implicitly determined. The method is 
outlined as follows. 
First, a hypothesis is generated that an entity in the first data set 
corresponds to an entity in the second one. The correspondence 
between conjugate entities of the data sets is expressed by a 
mathematical function. Using the hypothesized match, this 
mathematical function yields an observation equation(s). The 
parameters of the mathematical relation can be estimated 
simultaneously or sequentially, depending on the number of 
hypothesized matches simultaneously considered. All possible 
entity matches are evaluated, and the results (parameter 
estimations) are represented in an accumulator array. The 
accumulator array will exhibit a peak at the location of the 
correct parameter solution. By tracking the matched entities that 
contributed to the peak, the correspondence is determined. 
The number of parameters being simultaneously solved for 
determines the dimension of the accumulator array. In order to 
solve n parameters simultaneously, one must utilize the number 
of hypothesized entity matches needed to generate the required 
n equations. However, this approach is not practical. 
Simultaneous evaluation of all permutations of entities leads to 
combinatorial explosion. For example, if there are x entities in 
data set one and y entities in data set two, solving n parameters 
xy! 
(xy-n) nm 
(assuming that each matching hypothesis yields one equation). 
In addition, the memory requirements of an n dimensional 
accumulator array create another problem. 
simultaneously would lead to combinations 
Random Sample Consensus (RANSAC) is another alternative 
for fitting a model to experimental data (Fischler and Bolles, 
1981). Rather than using large data set with high percentage of 
blunders and trying to eliminate invalid matched, RANSAC 
starts by a and consistent data set whenever possible. This 
method is quite similar to using the modified Hough transform 
discussed in the previous paragraph, to simultaneously solve for 
all the involved parameters. 
An alternative approach is to solve for the parameters 
sequentially in an iterative manner (starting from some 
initial/approximate values), updating the approximations at each 
step. Consequentially, the accumulator array becomes one- 
dimensional and the memory problem disappears. Also, if there 
are x elements in data set one and y elements in data set two, the 
total number of evaluated entity matches becomes xy, reducing 
the computational complexity of the problem. After each 
iteration, the approximations are updated and the cell size of the 
accumulator array can be reduced to reflect the improvement in 
the quality of the approximate values of the unknown 
parameters. In this manner, the parameters can be estimated 
with high accuracy. The convergence of this approach depends 
on the correlation among the parameters and the non-linearity 
of the transformation function. Highly non-linear 
transformations have a slower convergence rate and would 
require more iterations. 
The basic steps for implementing the MIHT for parameter 
estimation are as follows: 1) A mathematical model is 
established that relates corresponding entities of two data sets. 
The relation between the data sets can be described as a 
function of its parameters: f(p;, py,...p,). 2) An accumulator 
array is formed for the parameters. The accumulator array is a 
discrete tessellation of the range of expected parameter 
solutions. The dimension of the accumulator array depends on 
the number of parameters to be simultaneously solved, which is 
related to the number of entity pairings simultaneously 
considered as well as the number of equations provided by a 
single matching hypothesis. 3) Approximations are made for 
parameters which are not yet to be determined. The cell size of 
the accumulator array depends on the quality of the initial 
approximations; poor approximations will require larger cell 
sizes. 4) Every possible match between individual entities of the 
two data sets is evaluated, incrementing the accumulator array 
at the location of the resulting solution. 5) After all possible 
matches have been considered, the maximum peak in the 
accumulator array will indicate the correct solution of the 
parameter(s). Only one peak is expected for a given 
accumulator array. 6) After each parameter is determined 
(usually in a sequential manner), the approximations are 
updated. For the next iteration, the accumulator array cell size is 
decreased, and steps 2-6 are repeated. Detailed explanation 
about the MIHT can be found in (Habib et al, 2000a) and 
(Habib et al, 2001). 
2.3 Single Photo Resection (SPR) 
In SPR, the collinearity model (Equation 1) is used to relate 
points in the image with corresponding points in the object 
space, and this relation is expressed as a function of the EOP. 
Traditionally, the parameters are estimated by way of a least 
squares adjustment involving measured control points in the 
image. At least three control points are required to estimate the 
six EOP. The introduction of more than three points increases 
the redundancy and strengthens the solution of the parameters. 
Xx, X, =X, (1) 
MV — A R' (v, 0, Kk) Yale 
mé 2,742, 
where 
A: Scale 
Xj, Vr Image coordinates of i point 
X,Y. Z Object coordinates of i point 
Xp Yo. C The camera interior orientation parameters. 
X, Yo, Zo, €, 0, X: The EOP of the camera exposure station. 
A - 151
1
2
...
164
165
166
167
168
...
456
457
Full text: Papers accepted on the basis of peer-review full manuscripts (Part A)

Access restriction

Copyright

Note to user