Close-range imaging, long-range vision

    
   
     
    
   
  
  
  
   
   
    
  
     
  
  
  
  
  
  
  
  
  
  
   
   
    
  
   
BS o 
Neo €0 cC €) T CD UM 0 
T 
C$ 2A Ou Law OO OO O 
  
| 
  
have also been identified [Hartley, 2000, Kahl et al, 2001]. For 
metric construction, certain sequences will cause self- 
calibration to fail or not give a unique construction. These 
critical motions have been studied when camera intrinsic 
parameters do not vary [Sturm, 1997] and when they do vary 
from image to image [Kahl et al, 2000]. 
This paper focuses on the 3D construction, which is the least 
automated of all the steps, rather than correspondence, pose 
estimation or calibration. 
3. AUTOMATION AND WIDELY-SEPARATED VIEWS 
From the above overview of current techniques, the following 
points can be made: 
e In practical situations taking the sequences required for a 
fully automated techniques might not be feasible. 
e Large objects or complex scenes require a large number of 
images to carry matching and pose estimation automatically. 
e Since the modeling process still requires human interaction to 
define the topology, assign constraints, and post-process the 
results, large number of images makes this difficult. 
It is therefore important to develop an approach that requires 
only a small number of widely separated views and at the same 
time offers a high level of automation, and be able to deal with 
occluded and unmarked surfaces. 
4. ON OCCLUSION AND LACK OF TEXTURE 
Occlusions and lack of texture hinder image-based modeling 
since the methods require features that can be seen either by a 
computer or human. The scene in figure 1 has occlusions and 
most of the columns surfaces have no texture. Both fully 
automated and fully manual methods will have difficulty here. 
Yet, this is typical in much classical architecture. In our 
approach, with less than 30 manually measured points, the full 
scene [figure 2] with automatically added 300 points can be 
completed without further human intervention. 
   
    
Figure 1: Corner of a Figure 2: The constructed 
courtyard solid model 
   
5. DETAILS OF THE APPROACH 
The approach is designed mainly for man-made objects. A good 
example is classical architectures, which are designed within 
constraints of proportion and configurations. Classical buildings 
are divided into architectural elements. These elements are 
logically organized in space to produce a coherent work. There 
is a logical hierarchical relation among building parts and 
between parts and whole. The most common scheme divides 
the building into two sets of lines forming a rectangular grid 
[Tzonis and Lefaivre, 1986]. The distance between the grid 
lines are often equal or when they vary, they alter regularly. The 
grid lines are then turned into planes that partition the space and 
control the placement of the architectural elements. The 
automation of 3D reconstruction is better achieved when such 
understanding is taken into account. We will reconstruct the 
architecture elements from minimum number of points and put 
them together using the planes of a regular grid. Other schemes, 
such as a polar grid, also exist but the basic idea can be applied 
there too. Classical architecture can be reconstructed, knowing 
its components, even if only a fragment survives or seen in the 
images. For example, a columnar element consists of: 1) The 
capital, a horizontal member on top, 2) the column itself, a long 
vertical tapered cylinder, 3) a pedestal or a base on which the 
column rests. Each of those can be further divided into smaller 
elements. In addition to columns, other elements include pillars, 
pilasters, banisters, windows, doors, arches, and niches. Each 
can be reconstructed with a few seed points from which the rest 
of the element is built. 
imaging 
Selection 
Initial Point Extraction 
widely separated víews 
    
  
  
Interactive 
  
  
  
Segmentation «€-------—---— 
Constraints 
Seed Points Element Properties 
  
Figure 3. Simplified diagram of the general procedure that 
shows which is automatic. 
Our approach is photogrammetry-based. In order to increase the 
level of automation, the process takes advantage of properties 
like those mentioned above for man-made objects and 
structures. The approach provides an adequate amount of 
automation to assist an operator to provide high level of details 
with excellent geometric accuracy. Figure 3 summarizes the 
procedure and indicates which step is interactive and which is 
automatic (interactive operations are light gray). The figure also 
shows an option of taking a closely-spaces sequence of images, 
if conditions allow, and increase the level of automation. In the 
remainder of the paper, we will discuss only the option of 
widely separated views. Images are taken, all with the same 
camera set up, from positions where the object is suitably 
showing. There should be a reasonable distance, or baseline, 
between the images. Several features appearing in multiple 
images are interactively extracted, usually 12-15 per image. The 
user points to a corner and label it with a unique number and the 
system will accurately extract the corner point. Harris operator 
is used [Harris, 1998] for its simplicity and efficiency. Image 
registration and 3D coordinate computation are based on the 
photogrammetric bundle adjustment approach for its accuracy, 
flexibility, and effectiveness compared to other structure from 
motion techniques [Triggs et al, 2000]. Advances in bundle 
adjustment eliminated the need for control points or physically 
entering initial approximate coordinates. Many other aspects 
required for high accuracy such as camera calibration with full 
distortion corrections have long been solved problems in 
Photogrammetry and will not be discussed in this paper.
1
2
...
158
159
160
161
162
...
640
641
Full text: Close-range imaging, long-range vision

Access restriction

Copyright

Note to user