Full text: Proceedings (Part B3b-2)

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. Voi. XXXVII. Part B3b. Beijing 2008 
Learning (training) a network is the process of adapting or 
modifying the connection weights between neurons so that the 
network can fulfil a specific task. Back Propagation is the most 
common learning algorithm which is an iterative gradient 
decent algorithm designed to minimize error function expressed 
in equation 1: 
manually produced reference map which is used for accuracy 
assessment. 
In the first try, the spectral information for each pixel, after 
normalizing the RGB values between 0 and 1, is simply entered 
to the network as its input parameters. 
(Eq.l) 
Where d j and cr represent desired output and current 
response of the neuron “j” in the output layer respectively and 
“L” is the number of neurons in the output layer. In an iterative 
method, corrections to weight parameters are computed and 
added to previous values as below: 
< 
(Eq2; 
Where W- ■ is weight parameter between neuron i and j, TJ is a 
positive constant that controls the amount of adjustment and is 
called learning rate, OC is a momentum factor that can take 
values between 0 and 1 and “t” denotes the iteration number. 
The parameter ^ smoothes rapid changes between the weights. 
Learning rate and momentum parameter have a major influence 
on the success of learning process. 
2.1 Network Structure 
In order to use neural networks in road detection, input layer is 
consisted of neurons the same number as input parameters and 
output layer is made up of just one neuron that shows whether 
the input parameters can represent a road pixel or not. Usually, 
one hidden layer is sufficient, although the number of neurons 
in the hidden layer is often not readily determined (Richard, 
1993). 
In this research, a back propagation neural network with one 
hidden layer is implemented that uses 500 road and 500 
background pixels as its training set. 
An adaptive strategy is used to avoid trail and error learning 
rate and momentum assignment. In this method, both 
parameters are adjusted downwards as half after some training 
intervals if the overall training error has increased and upward 
1.2 times if the overall error has decreased (Heerman and 
Khazeinie, 1992). 
One of the most important factors in employing ANNs is to 
decide what type of information should be extracted from input 
image to be fed through the network as its input parameters. 
The discrimination ability of the network is highly affected by 
chosen input parameters. In continue different input vectors are 
evaluated to find the optimum network design. 
2.2 Network Input Parameters Design 
As a case study, a part of a pan-sharpened true colour (RGB) 
Ikonos image with the size of 550*550 pixels from Kish Island 
in Iran was chosen. Figure 1 shows the original image and its 
Thus, three neurons are designed in input layer in charge of 
receiving spectral values for each pixel in entire image. Figure 
2 shows the network structure and obtained results are 
presented in Table 1. 
Figure 1; a) RGB Ikonos Image from Kish Island in Iran, b) Manually 
produced reference map. 
Hi.N 
Best 
Iteratio 
n 
RCC 
BCC 
RMSE 
Kappa 
Coeff. 
Over 
all 
Acc. 
5 
15000 
77.66 
88.96 
0.2459 
67.92 
93.8 
1 
10 
5000 
73.29 
88.85 
0.2240 
68.34 
94.6 
6 
15 
5000 
73.60 
89.97 
0.2204 
69.46 
94.7 
5 
20 
10000 
74.79 
90.85 
0.2235 
69.30 
94.4 
7 
Table 1: Three spectral values as input parameters 
In this table, RCC and BCC are the road and background 
correctness coefficients that show the percentage of true 
functionality of the network about road and background pixels 
respectively. The RMSE value is computed by comparing of 
network responses in the output neuron and desired value from 
manually produced reference map (1 for road and 0 for 
background pixels). Kappa coefficient and overall accuracy 
parameters are obtained the same way as conventional 
classification methods where the network response about each 
pixel is classified into two classes as road and background 
pixels using an appropriate threshold value. 
In order to improve network’s functionality and considering the 
fact that roads are presented as homogeneous areas in high
	        
Waiting...

Note to user

Dear user,

In response to current developments in the web technology used by the Goobi viewer, the software no longer supports your browser.

Please use one of the following browsers to display this page correctly.

Thank you.