Professional Documents
Culture Documents
(x), where f
) = min
xX
f(x), (3)
where the objective function f : IR
n
IR has continuous rst and
second derivatives and X II
n
is an ndimensional interval.
Our method distinguishes the minima among all the stationary points
by applying a new characterization criterion. The stationary points of
a function f(x) in the box X
0
are the zeros of the set of equations:
g(x) = f(x) = O = (0, 0, . . . , 0), (4)
where f =
f
x
1
, . . . ,
f
x
n
(X
k
)(X
k
X
k
) = g(
X
k
), (5)
where g
(X
k
) represents an appropriate interval extension of the Jaco-
bian matrix of g over the box X
k
and
X
k
is the midpoint of the interval
vector X
k
.
To simplify the above notations we denote X
k
by X = (x
1
, x
2
, . . . ,
x
n
), and set A = A
i,j
n
i,j=1
with A
i,j
to be the interval in the ith
row and jth column of A = g
(X), g(
X
k
) = g = (g
1
, g
2
, . . . , g
n
) and
X
k
= M = (m
1
, m
2
, . . . , m
n
). With these notations Equation (5) can
be written in the following form:
A(X
k
M) = g. (6)
Then, applying an interval version of the GaussSeidel method, [27, 34],
with starting guess X
k
is a competitive way to compute the components
of X
k
on a rowbyrow basis according to the relation:
x
i
= m
i
g
i
+
n
j=1
j=i
A
ij
(x
j
m
j
)
A
ii
. (7)
The interval GaussSeidel linear solver possesses properties which
allow it to function as an existence and uniqueness test and to couple
the intervalNewton method to a bisection procedure [28]. These desir-
able characteristics of the GaussSeidel iteration follow from a proof
presented in [34]. Applying the GaussSeidel linear solver we are able
to obtain a root inclusion test to each subbox by assigning the val-
ues true, false or unknown as follows: The value true is assigned
if, after applying Relation (7), it holds that X
k
X
k
, which means
that System (4) has a unique solution in X
k
. If, on the other hand,
x
i
x
i
= for any i, then there are no solutions to System (4) in X
k
,
and the GaussSeidel method, acting as a root inclusion test, returns
the value false. Finally, if the GaussSeidel method returns neither
true nor false, then the root inclusion test is inconclusive, and
the procedure returns the value unknown. When the GaussSeidel
method returns unknown either the value of X
k
is set equal to that
of X
k
and another GaussSeidel iteration is performed, or one of the
coordinate intervals of X
k
is bisected to form two subboxes. In this
way, a bisection process enables an intervalNewton method to isolate
all the stationary points of f in the initial interval X
0
, (for more details
see [26]).
3WGO_PRO.tex; 4/03/2002; 16:54; no v.; p.6
Global Optimization for Imprecise Problems 7
Using the above way we are able to maintain two kinds of lists: a
list L of boxes X to be processed and a list | of boxes for which this
stage of the algorithm cannot decide whether they contain a unique
stationary point. These boxes will be processed at the next stage of the
algorithm.
In the second phase of the algorithm, the isolated stationary points
are characterized as minima, maxima or saddle points [54]. This pro-
cedure is based on the concept and properties of the characteristic
npolyhedron (CP) [51, 44, 47].
First, we give some preliminary concepts and theoretical background
necessary for the description of the characterization procedure. Let us
dene a characteristic npolyhedron by constructing the 2
n
n matrix
/
n
whose rows are formed by all possible combinations of 1, 1, which
is called ncomplete matrix. Let
n
=
1
,
2
, . . . ,
2
n
) be an oriented
npolyhedron in IR
n
with 2
n
vertices. The matrix of signs associated
with f and
n
, denoted by o(f;
n
), is the 2
n
n matrix whose
entries in the kth row are the corresponding coordinates of the vector:
sgnf(
k
) =
sgnf
1
(
k
), sgnf
2
(
k
), . . . , sgnf
n
(
k
)
, (8)
where sgn denes the wellknown sign function. Then the npolyhedron
n
=
1
,
2
, . . . ,
2
n
) in IR
n
is a characteristic polyhedron (CP) rela-
tive to f if the matrix o(f;
n
) is identical with the ncomplete
matrix /
n
[51]. In other words, the signs of the components of f at
the 2
n
vertices of
n
yield every combination of 1.
If
n
is a CP then, under suitable assumptions on the boundary
of
n
, the value of the topological degree of f at O relative to
n
is nonzero which implies the existence of a stationary point inside
n
(CPcriterion) [44, 45, 51]. It is important to note that the CPcriterion
avoids all calculations concerning the topological degree since it requires
not its exact value but only its nonzero value.
Next, we proceed with the description of the characterization proce-
dure. From the rst part of the algorithm we have a list of boxes each
of which contains a unique stationary point x
st
of f and so forms a
characteristic npolyhedron (CP), according to the CPcriterion. Each
stationary point can be characterized according to the orientation of
the vertices of the CP [54] by following the procedure outlined below.
First, the oriented CP =
1
,
2
, . . . ,
2
n
) is transformed so that its n
proper 1simplexes with a common vertex are edges of the polyhedron
with vertices formed by the rows of the following dened 2
n
n matrix
: If
x
min
j
= min
1
j
,
2
j
, . . . ,
2
n
j
, x
max
j
= max
1
j
,
2
j
, . . . ,
2
n
j
(9)
3WGO_PRO.tex; 4/03/2002; 16:54; no v.; p.7
8 M.N. Vrahatis et al.
are the minimum and maximum of all the jth components
i
j
of the CP
vertices
i
, i = 1, 2, . . . , 2
n
, respectively, ( is the rank1 2
n
n matrix
with elements in the jth column the value x
min
j
and B is the n n
diagonal matrix having as ith element the dierence h
i
= x
max
i
x
min
i
,
then we dene the matrix = ( + /
n
B. For example for n = 2 we
have:
= (+/
2
B =
x
min
1
x
min
2
x
min
1
x
min
2
x
min
1
x
min
2
x
min
1
x
min
2
0 0
0 1
1 0
1 1
h
1
0
0 h
2
x
min
1
x
min
2
x
min
1
x
max
2
x
max
1
x
min
2
x
max
1
x
max
2
), satises (
n
) ,
for some (0, 1), is given by:
B =
log
2
(
n
)
1
, (10)
where the notation | refers to the smallest integer, which is not less
than the real number quoted (see [44] for a proof).
Using the above procedures we propose the following model algo-
rithm:
Algorithm 1. (Locating, characterizing and computing the stationary
points of a function)
1. Initialize list L by placing the initial search region X
0
into it.
2. Do while L ,= ;
2.1 Remove the rst box X from L;
2.2 Do one of the following:
(a) Reject X (when the range of f over X does not contain
zero);
(b) Use an intervalNewton method to do one of the following:
i) reduce the size of X ( X X X) ;
ii) discard X (X X = );
iii) verify that a unique stationary point exists in X (X
X);
(c) If a unique stationary point is isolated, then characterize
it. If it is characterized as a minimum then compute it
within the accuracy , otherwise discard the box X. Get
the next box from the list L;
(d) In the case where the root inclusion test is inconclusive
then subdivide X to make it more likely to succeed at
rejecting, reducing, or verifying uniqueness and put the
corresponding subboxes into list L;
end{while}
end{algorithm}
4. Numerical Applications
The above procedures were implemented using a new portable Fortran
program named IMINBIS, which has been applied to several test func-
tions. For the rough isolation of the stationary points we have applied
3WGO_PRO.tex; 4/03/2002; 16:54; no v.; p.9
10 M.N. Vrahatis et al.
the intervalNewton/bisection package INTBIS due to Kearfott and
Novoa [28].
IMINBIS was tested on a HP715 workstation as well as on a PC
IBM compatible. Our experience is that the algorithm behaves pre-
dictably and reliably. The results were satisfactory without any redun-
dant function evaluations. Some typical computational results are given
below. For the following problems, the reported parameters are:
n dimension,
X
0
starting box,
the value of the variance of the simulated noise,
x
st
an ndimensional stationary point,
IFC the total number of interval function calls to determine the
range of the function,
IJC the total number of interval Jacobian calls,
IFNC the total number of interval function calls to do the interval
Newton method,
RFE1 the total number of real function evaluations for the local-
ization portion of our algorithm,
RFE2 the total number of real function evaluations for the charac-
terization portion of our algorithm,
RFE3 the total number of real function evaluations for the com-
putation of the minima within the accuracy = 10
8
.
Here we exhibit results from the following test cases.
Example 1. Himmelblau function, [20]. In this case f is given by:
f(x) =
x
2
1
+ x
2
11
2
+
x
1
+ x
2
2
7
2
.
This function has the following stationary points:
a) four global minima
x
st
1
= (3, 2),
x
st
2
= (3.584428340330, 1.848126526964),
x
st
3
= (3.779310253378, 3.283185991286),
x
st
4
= (2.805118086953, 3.131312518250),
b) one maximum x
st
5
= (0.270844590667, 0.923038556480), and
3WGO_PRO.tex; 4/03/2002; 16:54; no v.; p.10
Global Optimization for Imprecise Problems 11
Table I. Himmelblau function, n = 2.
x
st
i
RFE1 RFE2 RFE3 Characterization
x
st
1
4 2 58 minimum
x
st
2
4 2 66 minimum
x
st
3
4 2 45 minimum
x
st
4
4 2 63 minimum
x
st
5
4 2 0 maximum
x
st
6
4 1 0 saddle
x
st
7
4 1 0 saddle
x
st
8
4 1 0 saddle
x
st
9
4 1 0 saddle
= 0.01, IFC = 107, IJC = 89, IFNC = 89
c) four saddle points
x
st
6
= (0.086677504555, 2.884254701175),
x
st
7
= (3.073025750764, 0.081353044288),
x
st
8
= (3.385154183607, 0.073851879838),
x
st
9
= (0.127961346731, 1.953714980244).
Executing the implemented program with the initial box X
0
= [5, 5]
[5, 5], one characterizes all the above stationary points and computes
the four minima x
st
1
, . . . , x
st
4
, all of which have the same function value.
For this example the elapsed CPU time on a HP715 was 0.33 seconds.
In Table I we exhibit the corresponding results obtained by IMINBIS.
Example 2. Kearfott function, [25]. The objective function f is given
by:
f(x) =
x
2
1
+ x
2
2
2
2
+
x
2
1
x
2
2
1
2
.
This function has the following nine stationary points:
a) four global minima x
st
1
= (
1.5,
0.5), x
st
2
= (
1.5,
0.5),
x
st
3
= (
1.5,
0.5), x
st
4
= (
1.5,
0.5),
b) one maximum x
st
5
= (0, 0), and
c) four saddle points x
st
6
= (
1.5, 0), x
st
7
= (
1.5, 0), x
st
8
= (0,
0.5), x
st
9
= (0,
0.5).
3WGO_PRO.tex; 4/03/2002; 16:54; no v.; p.11
12 M.N. Vrahatis et al.
Table II. Kearfott function, n = 2.
x
st
i
RFE1 RFE2 RFE3 Characterization
x
st
1
4 2 58 minimum
x
st
2
4 2 57 minimum
x
st
3
4 2 51 minimum
x
st
4
4 2 52 minimum
x
st
5
4 2 0 maximum
x
st
6
4 1 0 saddle
x
st
7
4 1 0 saddle
x
st
8
4 1 0 saddle
x
st
9
4 1 0 saddle
= 0.01, IFC = 38, IJC = 32, IFNC = 32
Running IMINBIS with the initial box X
0
= [2, 2] [2, 2], we char-
acterize all the above stationary points and compute the four global
minima x
st
1
, . . . , x
st
4
. In Table II we exhibit the obtained results.
Example 3. Extended Kearfott function, [25, 44]. In this case f is given
by
f(x) =
n
i=1
f
2
i
(x),
where:
f
i
(x
1
, x
2
, . . . , x
n
) = x
i
2
x
i+1
, i = 1, 2, . . . , n 1,
f
n
(x
1
, x
2
, . . . , x
n
) = x
n
2
x
1
.
This function, for n = 4, has the following stationary points:
a) two global minima x
st
1
= (0, 0, 0, 0), x
st
2
= (1, 1, 1, 1), and
b) one saddle point x
st
3
= (0.5, 0.5, 0.5, 0.5),
Executing the implemented program with the initial box X
0
= [2, 2]
4
,
one characterizes all the above stationary points and computes the two
global minima after 2.32 seconds of CPU time on an HP715. In Table
III we exhibit the corresponding results obtained by IMINBIS.
Example 4. Quadratic function, [48]. The objective function f is given
by:
f(x
1
, x
2
, . . . , x
n
) = x
2
1
+ x
2
2
+ + x
2
n
r,
3WGO_PRO.tex; 4/03/2002; 16:54; no v.; p.12
Global Optimization for Imprecise Problems 13
Table III. Extended Kearfott function, n = 4.
x
st
i
RFE1 RFE2 RFE3 Characterization
x
st
1
16 8 739 minimum
x
st
2
16 8 781 minimum
x
st
3
16 1 0 saddle
= 0.01, IFC = 2610, IJC = 1627, IFNC = 1627
with f(x