Full text: Systems for data processing, anaylsis and representation

  
5.2 Ordering 
The second area that we want to look into is to 
explore the internal nature of the large spatial 
objects. For example, it has been demonstrated that 
remotely sensed imagery has very high spatial 
correlation among pixels in a neighbor and between 
various spectral bands. Partition and re-ordering 
the spatial data to take these features into 
consideration can speed up many GIS queries. 
The traditional method of storing large object data 
is linear allocation whereby the data is stored by 
some predetermined orders. Typical date storage 
orders may be one of the pixel interleave, line 
interleave or plane interleave. This order may be 
efficient for some data processing that involved all 
pixels, but will make other access such as point or 
window queries in-efficient. Optimizing the 
allocation of large object data becomes increasingly 
important as more and more multi-spectral, multi- 
temporal large objects are available today in GISs. 
In GIS application such as display, editing and 
analysis, only a small part of data is required at a 
time. Point or window based queries to extract parts 
of the image are most often performed. This makes 
data ordering particular important for large object 
handling. To improve the system performance, 
several strategies have been proposed, which 
include: 
a) Tiling: which divides an image into tiles that 
are stored and accessed independently. By 
combination with parameters such as optimism 
buffer size, display window size, these techniques 
can improve the system performance dramatically, 
especially for network based GISs. 
b) Reordering: which permutes the order of image 
pixels to reduce average seek distance. For 
example, in data display, we could anticipate that 
zoom/pan is frequently performed on the large 
objects. Therefore, instead of linear storing each 
tile, we use the Mount-Caro order[Samet, 1989] to 
organize the tile files. 
c) Redundancy: which stores redundant copies of 
the image. The redundant images are abstracted 
and organized differently to optimize different 
patterns of access. For example, for overview 
purpose, a low resolution image can be used to give 
users the rough idea on the object. Only when the 
user specifically zooms in the data, the original 
data is provided to give more details. This 
technique not only makes the system easy to use, 
but also makes most interactive applications 
possible. 
5.3 Abstraction 
The third area that needs to be investigated is 
object abstraction. The conventional DBMS 
paradigm handles well-formatted data and does 
not need to perform any abstraction on the data. In 
contrast, large spatial objects are intrinsically 
information intensive and semantics richer. 
Intelligence needs to be build into the system before 
the system can be used to effectively and 
efficiently answer user's queries. The DAD in our 
data model is mainly used for this purpose, because 
we believe that data abstraction can be built into 
DAD either through automatic image processing 
methods or through manually editing or indexing 
methods and can be used to answer many queries in 
GISs. 
Abstraction can be defined as the process of 
constructing new concepts from a given collection of 
concepts. Abstraction is a method of extraction of 
the essential part of the data set. It is created 
using some combination of statistical analyses, 
data classification and artificial intelligence. 
Derived Attribute Data is the result of abstraction. 
Classification in the object oriented modeling is 
the example of abstraction. Image classification is 
another example of abstraction. 
The key observation is that while a query may be 
posed to a large object, the answer to the query may 
be tiny. For example, a query such as "what is the 
size of Lake Ontario" can be answered very easily 
if the abstracted DAD data is available. While 
many techniques can be used to perform data 
abstraction[Sequio 2000 Report 11], we only limit 
ourselves to the construction of meta-data and the 
vector counterpart of large spatial objects. For 
example, we use statistical analyses to derive the 
mean, variance, histogram, and other parameters 
of image objects. We also use image classification 
techniques and rules to extract classes such as 
water, cloud, land use, etc. with manual editing 
techniques followed to derive vector representation 
of the image. These derived data is modeled as 
DAD data and is stored in GIS data base for 
various applications. 
5.4 Indexing 
The last method to be investigated is ability to 
provide value indexing. The conventional DBMS 
paradigm provides object indexing. Hence, one can 
designate one or more fields in a record as indexed, 
and DBMS will build the appropriate kind of 
index on the data in the required fields. Value 
indexing may be reasonable in traditional 
216 
applica 
large © 
queries 
conten 
Ontaric 
of a cl 
and no 
for im 
values 
examp 
return 
and the 
To be 
datab: 
param 
throug 
large « 
further 
when 
large c 
proper 
we nc 
conter 
rarely 
seconc 
type « 
search 
end-u: 
retriey 
Howe 
used 
huma 
[Wiec 
obser 
techni 
Altho 
our lc 
and « 
gener 
paran 
Befor 
consi 
bein: 
a)as 
W 
b) a 
al 
cal 
N 
With 
datat
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.