Next: Dynamic Load Balancing and
Up: Reliable High Performance Computing
Previous: Branch-and-Bound
Of particular interest here are BP and BB schemes based on
interval analysis.
A real interval is defined as the set of real numbers lying between
(and including) given upper and lower bounds;
i.e.,
.
A real interval vector
has real interval components and can be interpreted geometrically
as an -dimensional rectangle (box). Note that in this section lower case
quantities are real numbers and upper case quantities are intervals.
Several good introductions to interval analysis are available
(e.g., [57,58,59]). In this section, interval
analysis is described in the context of solving nonlinear parameter
estimation problems, since that is the primary example used in the tests
discussed later. However, it should be emphasized that the interval
methods discussed here are general-purpose and can be used in
connection with other objective functions in a global optimization
problem and other equation systems in an equation solving problem.
BP and BB techniques can be constructed using the interval-Newton technique.
Given a nonlinear equation system with a finite number of real roots
in some initial interval, this technique provides the capability to
find (or, more precisely, narrowly enclose) all the roots of the
system within the given initial interval. For the unconstrained
minimization of an objective function (or estimator)
in parameter estimation, a common approach
is to use the gradient of
and seek a solution of
in order to determine the optimal parameter values
.
The global minimum will be a root of this nonlinear equation system,
but there may be many other roots as well, representing local minima
and maxima and saddle points. Thus, for this approach to be reliable,
the capability to find all the roots of
is needed, and this is provided by the interval-Newton technique.
In practice, by using an objective range test, as discussed below,
the interval-Newton procedure can also be implemented as a
BB technique, so that roots of
that cannot be the global minimum need not be found.
The solution algorithm is applied to a sequence of intervals,
beginning with some initial interval
specified
by the user. This initial interval can be
chosen to be sufficiently large to enclose all physically feasible values.
It is assumed here that the global optimum
will occur at an interior stationary minimum of
and not at the boundaries of
. Since the
estimator
is derived based on
a product of Gaussian distribution functions corresponding
to each data point, only a stationary global minimum is reasonable
for statistical regression problems such as considered here.
For an interval
in the sequence, the first step in
the solution algorithm is the function range test.
Here an interval extension
of the function
is calculated.
An interval extension provides upper and lower bounds on the
range of values that a function may have in a given interval.
It is often computed by substituting the given interval into the
function and then evaluating the function using interval arithmetic.
The interval extension so determined is often wider than the actual range of
function values, but it always includes the actual range.
If there is any component of the interval extension
that does not contain zero, then we may discard (prune) the current interval (node)
,
since the range of the function does not include zero anywhere in this interval,
and thus no solution of
exists in this interval. We may then
proceed to consider the next interval in the sequence, since the
current interval cannot contain a stationary point of
. Otherwise, if
,
then testing of
continues.
The next step is the objective range test. The interval extension
, which contains the range of
over
, is computed. If the lower bound of
is greater than a known upper bound on the
global minimum of
, then
cannot
contain the global minimum and need not be further tested. Otherwise,
testing of
continues. The upper bound on the
objective function used for comparison in this step can be determined
and updated in a number of different ways. Here we use point evaluations of
done at the midpoint of previously tested
intervals
that may contain stationary points. Using the objective range test yields a
BB procedure for the global minimization
of
, while if this step is skipped,
we will have a BP technique for finding all solutions of
, i.e., all stationary
points of
.
The next step is the interval-Newton test.
Here the linear interval equation system
is set up and solved for a new interval
,
where
is
an interval extension of the Jacobian of
,
i.e., the Hessian of
,
over the current interval
,
and
is a point in the interior of
,
usually taken to be the midpoint.
It has been shown (e.g., [57,58,59])
that any root
of
is also contained in the image
,
implying that if there is no intersection between
and
then no root exists in
,
and suggesting the iteration scheme
.
In addition to this iteration step, which can be used to tightly
enclose a solution, it has been proven
(e.g., [57,58,59]) that
if
is contained completely
within
, then there
is one and only one root contained within the
current interval
.
This property is quite powerful, as it provides a
mathematical guarantee of the existence and uniqueness
of a root within an interval when it is satisfied.
There are thus three possible outcomes to the interval-Newton test,
as shown schematically for a two variable problem in Figs. 1-3.
The first possible outcome (Fig. 1) is that
.
This represents mathematical proof that there exists a
unique solution to
within the current interval
,
and that that solution also lies within the image
.
This solution can be rigorously enclosed, with
quadratic convergence, by applying the interval-Newton step to the image
and repeating a small number of times.
Alternatively, convergence to a point approximation of the solution can be
guaranteed using a routine point-Newton method starting from anywhere
inside of the current interval. Since a unique solution has been
identified for this subproblem, it can be pruned, and
the next interval in the sequence can
now be tested, beginning with the function range test.
Figure 1:
The computed image
is a subset
of the current interval
. This is
mathematical proof that there is a unique solution of the
equation system in the current interval,
and furthermore that this unique solution is
also in the image.
 |
The second possible outcome (Fig. 2)
is that
.
This provides mathematical proof that no solutions of
exist within the current interval. Thus, the current interval can be
pruned and testing of next interval can begin.
Figure 2:
The computed image
has a null intersection with the current interval
.
This is mathematical proof that there is no solution of the
equation system in the current interval.
 |
The final possible
outcome (Fig. 3) is that the image
lies partially within the current interval
.
In this case, no conclusions can be made about the number of solutions
in the current interval. However, it is known that any solutions that
do exist must lie in the intersection
.
If the intersection is sufficiently smaller than the current interval,
one can proceed by reapplying the interval Newton test to the intersection.
Otherwise, the intersection is bisected, and the resulting two intervals added
to the sequence of intervals to be tested. This approach is referred to as
an interval-Newton/generalized-bisection (IN/GB) method, and depending on
whether or not the objective range test is employed, can be interpreted as
either a BB or BP procedure.
Figure 3:
The computed image
has a nonnull intersection with the current interval
.
Any solutions of the equation system must lie in the intersection of
the image and the current interval.
 |
It should be emphasized that, when machine
computations with interval
arithmetic operations are done, the endpoints of an interval are
computed with a directed outward rounding.
That is, the lower endpoint is rounded down to the next machine-representable
number and the upper endpoint is rounded up to the next machine-representable number.
In this way, through the use of interval, as opposed to floating point arithmetic,
any potential rounding error problems are eliminated, yielding an
approach that can provide a computational, not just mathematical,
guarantee of reliability.
Overall, when properly implemented,
the IN/GB method described above provides a procedure that is
mathematically and computationally guaranteed to find the global minimum of
, or, if desired,
to enclose all of its stationary points
(within, of course, the specified initial parameter
interval
).
In implementing an IN/GB algorithm, there are opportunities for
the use of parallel computing at multiple levels. On a fine-grained
level, the basic interval arithmetic operations can be
parallelized (e.g., [60]). On a larger-grained level,
the solution of the linear interval equation system for the image
can be parallelized (e.g., [61,62,63,64]). Of
course, on a coarse-grained level, each independent subproblem
generated in the bisection process can be tested in parallel
(e.g., [1,6,8,9]). It is only this
coarsest level of parallelism that will be considered here.
Next: Dynamic Load Balancing and
Up: Reliable High Performance Computing
Previous: Branch-and-Bound
ChaoYang Gau
2001-03-12
|