0% found this document useful (0 votes)

18 views27 pages

Improving Ultimate Convergence of An Augmented Lagrangian Method

article

Uploaded by

appolinaire.tougma09

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views27 pages

Improving Ultimate Convergence of An Augmented Lagrangian Method

article

Uploaded by

appolinaire.tougma09

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Improving ultimate convergence

of an Augmented Lagrangian method

∗ †
E. G. Birgin J. M. Martı́nez

June 12, 2007. Updated March 19, 2008

Abstract
Optimization methods that employ the classical Powell-Hestenes-Rockafellar Augmented
Lagrangian are useful tools for solving Nonlinear Programming problems. Their reputation
decreased in the last ten years due to the comparative success of Interior-Point Newto-
nian algorithms, which are asymptotically faster. In the present research a combination of
both approaches is evaluated. The idea is to produce a competitive method, being more
robust and efficient than its “pure” counterparts for critical problems. Moreover, an addi-
tional hybrid algorithm is defined, in which the Interior Point method is replaced by the
Newtonian resolution of a KKT system identified by the Augmented Lagrangian algorithm.
The software used in this work is freely available through the Tango Project web page:
http://www.ime.usp.br/∼egbirgin/tango/.

Key words: Nonlinear programming, Augmented Lagrangian methods, Interior-Point meth-

ods, Newton’s method, experiments.

1 Introduction
We are concerned with Nonlinear Programming problems defined in the following way:

Minimize f (x)
subject to h(x) = 0
(1)
g(x) ≤ 0
x ∈ Ω,
where h : IRn → IRm , g : IRn → IRp , f : IRn → IR are smooth and Ω ⊂ IRn is an n-dimensional
box. Namely, Ω = {x ∈ IRn | ` ≤ x ≤ u}.
∗
Department of Computer Science IME-USP, University of São Paulo, Rua do Matão 1010, Cidade Uni-
versitária, 05508-090, São Paulo SP, Brazil. This author was supported by PRONEX-Optimization (PRONEX -
CNPq / FAPERJ E-26 / 171.164/2003 - APQ1), FAPESP (Grant 06/53768-0) and CNPq (PROSUL 490333/2004-
4). e-mail: [email protected]
†
Department of Applied Mathematics, IMECC-UNICAMP, University of Campinas, CP 6065, 13081-970
Campinas SP, Brazil. This author was supported by PRONEX-Optimization (PRONEX - CNPq / FAPERJ
E-26 / 171.164/2003 - APQ1), FAPESP (Grant 06/53768-0) and CNPq. e-mail: [email protected]

1
The Powell-Hestenes-Rockafellar (PHR) Augmented Lagrangian [41, 54, 56] is given by:
m p
ρ X λi 2 X µi 2

Lρ (x, λ, µ) = f (x) + hi (x) + + max 0, gi (x) + (2)
2 i=1 ρ i=1
ρ
p
for all x ∈ IRn , λ ∈ IRm , µ ∈ IR+ , ρ > 0.
PHR-based Augmented Lagrangian methods for solving (1) are based on the iterative (ap-
proximate) minimization of Lρ with respect to x ∈ Ω, followed by the updating of the penalty
parameter ρ and the Lagrange multipliers approximations λ and µ. The most popular prac-
tical Augmented Lagrangian method gave rise to the Lancelot package [22, 24]. Lancelot
does not use inequality constraints g(x) ≤ 0. When an inequality constraint gi (x) ≤ 0 ap-
pears in a particular problem, it is replaced by gi (x) + si = 0, si ≥ 0. The convergence of the
Lancelot algorithm to KKT points was proved in [22] using regularity assumptions. Under
weaker assumptions that involve the Constant Positive Linear Dependence (CPLD) constraint
qualification [3, 55], KKT-convergence was proved in [1] for a variation of the Lancelot method.
In [2], a new PHR-like algorithm was introduced that does not use slack variables to complete
inequality constraints and admits general constraints in the lower-level set Ω. In the box-
constraint case considered in this paper, subproblems are solved using a matrix-free technique
introduced in [11], which improves the Gencan algorithm [10]. CPLD-based convergence and
penalty-parameter boundedness were proved in [2] under suitable conditions on the problem.
In addition to its intrinsic adaptability to the case in which arbitrary constraints are included
in Ω, the following positive characteristics of the Augmented Lagrangian approach for solving (1)
must be mentioned:
1. Augmented Lagrangian methods proceed by sequential resolution of simple (generally un-
constrained or box-constrained) problems. Progress in the analysis and implementation of
simple-problem optimization procedures produces an almost immediate positive effect on
the effectiveness of associated Augmented Lagrangian algorithms. Box-constrained mini-
mization is a dynamic area of practical optimization [9, 12, 13, 16, 26, 40, 45, 51, 67, 70]
from which we can expect Augmented Lagrangian improvements. In large-scale problems,
the availability of efficient matrix-free box-constraint solvers is of maximal importance.
2. Global minimization of the subproblems implies convergence to global minimizers of the
Augmented Lagrangian method [8]. There is a large field for research on global optimiza-
tion methods for box-constraint optimization. When the global box-constraint optimiza-
tion problem is satisfactorily solved in practice, the effect on the associated Augmented
Lagrangian method for Nonlinear Programming problem is immediate.
3. Most box-constrained optimization methods are guaranteed to find stationary points. In
practice, good methods do more than that. The line-search procedures of [10], for example,
include extrapolation steps that are not necessary from the point of view of KKT conver-
gence. However, they enhance the probability of convergence to global minimizers. In the
context of box-constrained optimization, “magical steps” in the sense of [24] (pp. 387-391)
use to be effective to increase the probability of convergence to global minimizers. As a
consequence, the probability of convergence to Nonlinear Programming global minimizers
of a practical Augmented Lagrangian method is enhanced too.

2
4. The theory of convergence to global minimizers of Augmented Lagrangian methods [8]
does not need differentiability of the functions that define the Nonlinear Programming
problem. In practice, this indicates that the Augmented Lagrangian approach may be
successful in situations where smoothness is dubious.

5. The Augmented Lagrangian approach can be adapted to the situation in which analytic
derivatives, even if they exist, are not computed. See [44] for a derivative-free version of
Lancelot.

6. In many practical problems the Hessian of the Lagrangian is structurally dense (in the
sense that any entry may be different from zero at different points) but generally sparse
(given a specific point in the domain, the particular Lagrangian Hessian is a sparse matrix).
As an example of this situation, consider the following formulation of the problem of fitting
circles of radii r within a circle of radius R without overlapping [14]:
X
Min max{0, 4r2 − kpi − pj k22 }2 subject to kpi k22 ≤ (R − r)2 .
i<j

The Hessian of the objective function is structurally dense but sparse at any point such
that points pi are “well distributed” within the big circle. Newtonian methods usually
have difficulties with this situation, both in terms of memory and computer time, since
the sparsity pattern of the matrix changes from iteration to iteration. This difficulty
is almost irrelevant for the Augmented Lagrangian approach if one uses a low-memory
box-constraint solver.

7. Independently of the Lagrangian Hessian density, the structure of the KKT system may be
very poor for sparse factorizations. This is a serious difficulty for Newton-based methods,
but not for suitable implementations of the Augmented Lagrangian PHR algorithm.

8. If the Nonlinear Programming problem has many inequality constraints, the usual slack-
variable approach of Interior-Point methods (also used in [1, 22]) may be inconvenient.
There are several approaches to reduce the effect of the presence of many slacks, but they
may not be as effective as not using slacks at all. The price of not using slacks is the
absence of continuous second derivatives in Lρ . In many cases, this does not seem to be a
serious practical inconvenience [7].

9. Huge problems have obvious disadvantages in terms of storage requirements. The Aug-
mented Lagrangian approach provides a radical remedy: problem data may be computed
“on the flight”, used when required in the subproblems, and not stored at all. This is not
possible if one uses matricial approaches, independently of the sparsity strategy adopted.

10. If, at the solution of the problem, some strong constraint qualification fails to hold, the
performance of Newton-like algorithms could be severely affected. The Augmented La-
grangian is not so sensitive to this type of disadvantage.

11. Augmented Lagrangian methods are useful in different contexts, such as Generalized Semi-
Infinite Programming. If one knows how to solve Ordinary Semi-Infinite Programming

3
problems, the Augmented Lagrangian seems to be the reasonable tool to incorporate “x-
dependent” constraints in the lower-level problems [53].

Despite all these merits, the amount of research dedicated to Augmented Lagrangian methods
decreased in the present century. Modern methods, based on interior-point (IP) techniques,
sequential quadratic programming (SQP), trust regions, restoration, nonmonotone strategies
and advanced sparse linear algebra procedures attracted much more attention [4, 5, 17, 19, 21,
20, 31, 32, 33, 34, 46, 50, 59, 62, 63, 65, 66, 69].
A theoretical reason, and its practical consequence, may be behind this switch of interest.
Roughly speaking, under suitable assumptions, Interior-Point Newtonian techniques converge
quadratically (or, at least, superlinearly) whereas practical Augmented Lagrangian algorithms
generally converge only linearly. Therefore, if both methods converge to the same point, and the
required precision is strict enough, an Interior-Point Newtonian (or SQP) method will require
less computer time than an Augmented Lagrangian method, independently of the work per
iteration. (Of course, in practical problems there is not such a thing as an “arbitrarily high
precision”. The precision required in a practical problem is the one that is satisfactory for the
user purposes.)
The situation is analogous when one compares Newton’s method and an Inexact-Newton
method for solving nonlinear systems. Ultimately, if an extremely high precision is required,
Newton’s method will be the best. The Inexact-Newton method is a practical algorithm because
in some problems the cost of the Newton iteration cannot be afforded due to the problem
structure.
These facts inspired the following idea: Assume that we wish to solve a problem with a
structure that favors the use of the Augmented Lagrangian method, but the required precision
ε is rather strict. The Augmented Lagrangian performance could perhaps be improved if this
√
method is run up to a more modest precision (say ε) and the final point so far obtained is
used to initialize a fast local method. The present paper is dedicated to a numerical evaluation
of the practical perspectives of this idea. Basically, we will use two “fast local methods” to com-
plete Augmented Lagrangian executions. The first will be Ipopt, the interior-point algorithm
introduced in [65]. The second will be Newton’s method, applied to the KKT conditions, with
a reduced number of constraints and slack variables.
A comparison between the original methods and their hybrid and accelerated counterparts
will be presented.

Notation. The symbol k · k will denote the Euclidean norm. If v = (v1 , . . . , vn )T ∈ IRn we
denote v+ = (max{0, v1 }, . . . , max{0, vn })T . The distance between the point z and the set S is
denoted dist(z, S) and defined by

dist(z, S) = inf {kz − xk}.

x∈S

2 Augmented Lagrangian Algorithm

In [1, 2] safeguarded Augmented Lagrangian methods were defined that converge to KKT points
under the CPLD constraint qualification and exhibit good properties in terms of penalty pa-

4
rameter boundedness. Algencan, which is publicly available in the Tango Project web page
http://www.ime.usp.br/∼egbirgin/tango/, is the application of the main algorithm in [2] to prob-
lem (1).

Algorithm 2.1 (Algencan)

Let λmin < λmax , µmax > 0, γ > 1, 0 < τ < 1. Let {εk } a sequence of nonnegative numbers such
that limk→∞ εk = 0. Let λ1i ∈ [λmin , λmax ], i = 1, . . . , m, µ1i ∈ [0, µmax ], i = 1, . . . , p, and ρ1 > 0.
Let x0 ∈ Ω be an arbitrary initial point. Initialize k ← 1.

Step 1. Find an approximate minimizer xk of Lρk (x, λk , µk ) subject to x ∈ Ω. The condition

required for xk ∈ Ω is:

kPΩ (xk − ∇Lρk (xk , λk , µk )) − xk k∞ ≤ εk , (3)

where PΩ denotes the Euclidean projection onto Ω.

Step 2. Define
µk

Vik k
= max gi (x ), − i , i = 1, . . . , p. (4)
ρk
If k = 1 or
max{kh(xk )k∞ , kV k k∞ } ≤ τ max{kh(xk−1 )k∞ , kV k−1 k∞ }, (5)
define ρk+1 = ρk . Otherwise, define ρk+1 = γρk .

Step 3. Compute λk+1

i ∈ [λmin , λmax ], i = 1, . . . , m and µk+1
i ∈ [0, µmax ], i = 1, . . . , p. Set
k ← k + 1 and go to Step 1.

Remark. In practice, we use the first-order safeguarded estimates of the Lagrange multipliers:
λk+1
i = min{max{λmin , λki + ρk hi (xk )}, λmax }, for i = 1, . . . , m and µk+1
i = min{max{0, µki +
k
ρk gi (x )}, µmax } for i = 1, . . . , p.

Assume that the feasible set of a nonlinear programming problem is given by h(x) = 0, g(x) ≤
0, where h : IRn → IRm and g : IRn → IRp . Let I(x) ⊂ {1, . . . , p} be the set of indices of the
active inequality constraints at the feasible point x. Let I1 ⊂ {1, . . . , m}, I2 ⊂ I(x). The subset
of gradients of active constraints that correspond to the indices I1 ∪ I2 is said to be positively
linearly dependent if there exist multipliers λ, µ such that
X X
λi ∇hi (x) + µi ∇g i (x) = 0, (6)
i∈I1 i∈I2

with µi ≥ 0 for all i ∈ I2 and i∈I1 |λi | + i∈I2 µi > 0. Otherwise, we say that these gradients
P P

are positively linearly independent. The Mangasarian-Fromovitz Constraint Qualification MFCQ

says that, at the feasible point x, the gradients of the active constraints are positively linearly
independent [47, 57]. The CPLD Constraint Qualification says that, if a subset of gradients of

5
active constraints is positively linearly dependent at the feasible point x (i.e. (6) holds), then
there exists δ > 0 such that the vectors

{∇hi (y)}i∈I1 , {∇g i (y)}i∈I2

are linearly dependent for all y ∈ IRn such that ky − xk ≤ δ.

In [2], the following convergence result was proved.

Theorem 2.1. Assume that {xk } is a sequence generated by Algencan and x∗ is a limit point.
Then,

1. x∗ is a stationary point of the problem

m
X p
X
2
Minimize hi (x) + gi (x)2+ subject to x ∈ Ω.
i=1 i=1

2. If x∗ is feasible and fulfills the Constant Positive Linear Dependence constraint qualifica-
tion (with respect to all the constraints, including the bounds), then x∗ satisfies the KKT
conditions of (1).

Under additional local conditions it was proved in [2] that the sequence of penalty parameters
{ρk } remains bounded.
The following theorem is an easy consequence of Theorems 2.1 and 2.2 of [8].

Theorem 2.2. Assume that (1) admits a feasible point and that, instead of (3), each subproblem
is considered as approximately solved when xk ∈ Ω is found such that

Lρk (xk , λk , µk ) ≤ Lρk (y, λk , µk ) + εk

for all y ∈ Ω, where {εk } is a sequence of nonnegative numbers that converge to ε ≥ 0.

Then, every limit point x∗ of {xk } is feasible and

f (x∗ ) ≤ f (y) + ε

for all feasible point y.

Therefore, Theorem 2.2 states that the sequence generated by the algorithm converges to an
ε-global minimizer, provided that εk -global minimizers of the subproblems are computed at each
outer iteration. The practical consequences of Theorems 2.1 and 2.2 are different. Theorem 2.1
applies directly to the present implementation of Algencan, and, in spite of the effect of floating
point computations, reflects the behavior of the method in practical calculations. Theorem 2.2
describes what should be expected from the Augmented Lagrangian method if the subproblems
are solved with an active search of the global minimizer.
In the practical implementation of Algorithm 2.1 (Algencan), subproblems are solved using
Gencan [10] with the modifications introduced in [11]. The default parameters recommended

6
in [2] are τ = 0.5, γ = 10, λmin = −1020 , µmax = λmax = 1020 , εk = ε for all k, λ1 = 0, µ1 = 0
and
2|f (x0 )|

ρ1 = max 10−6 , min 10, . (7)
kh(x0 )k2 + kg(x0 )+ k2
At every iteration of Algencan, we define

λk+ = λk + ρk g(xk ) and µk+ = (µk + ρk g(xk ))+ .

Let us show that

|Vik | ≤ ⇒ | min{−gi (xk ), µk+
i }| ≤ . (8)
First observe that |Vik | ≤ implies that gi (xk ) ≤ . Now, if gi (xk ) ∈ [−, ], we also have
that −gi (xk ) ∈ [−, ]. Therefore, since µk+
i ≥ 0, we have that

min{−gi (xk ), µk+

i } ∈ [−, ].

Therefore, | min{−gi (xk ), µk+ }| ≤ .

Now, consider the case in which gi (xk ) < −. Since |Vik | ≤ , this implies that −µki /ρk ≥ −.
Therefore, µki /ρk ≤ . Adding the last two inequalities, we obtain:

gi (xk ) + µki /ρk < 0.

Therefore, µk+
i = 0. Since, in this case, −gi (xk ) > , we have that min{−gi (xk ), µk+
i } = 0.
Therefore, (8) is proved.
Taking = |Vik |, we obtain:

| min{−gi (xk ), µk+ k

i }| ≤ |Vi | for all i = 1, . . . , p. (9)
On the other hand, defining the Lagrangian L as:
m
X p
X
L(x, λ, µ) = f (x) + λi hi (x) + µi gi (x),
i=1 i=1

one trivially has that

∇L(xk , λk+ , µk+ ) = ∇Lρk (xk , λk , µk ). (10)
By (9) and (10), it is sensible to define, at each Algencan iteration k, the Infeasibility-
Complementarity measure (ICM) as:

ICM (k) = max{kh(xk )k∞ , kV k k∞ }

and the Dual Feasibility measure (DFM) as:

DFM (k) = kPΩ (xk − ∇Lρk (xk , λk , µk )) − xk k∞ = kPΩ (xk − ∇L(xk , λk+ , µk+ )) − xk k∞ .

For simplicity, we denote, from now on, ICM = ICM(k), DFM = DFM(k).
We stop the execution of the algorithm declaring Convergence to a KKT point if

ICM ≤ ε and DFM ≤ ε. (11)

7
Under reasonable assumptions, the quantity max{ICM, DFM} is of the same order as the dis-
tance between (xk , λk+ , µk+ ) and the set of primal-dual solutions of the nonlinear programming
problem. A precise statement of this fact is in Theorem 2.3.

Theorem 2.3. Assume that:

1. The functions f, h, g are twice continuously differentiable;

2. The feasible point x∗ satisfies the KKT conditions of (1) and the Mangasarian-Fromovitz
constraint qualification [47]. Let S be the set of (primal-dual) KKT triplets (x∗ , λ∗ , µ∗ )
associated to x∗ ;

3. For each primal-dual KKT point (x∗ , λ∗ , µ∗ ), the second order sufficient optimality condi-
tion holds.

Then, there exist δ > 0 and C > 0 such that, if dist((xk , λk+ , µk+ ), S) ≤ δ, we have:

C −1 max{ICM, DFM} ≤ dist((xk , λk+ , µk+ ), S) ≤ C max{ICM, DFM}.

Proof. This result is a straightforward corollary of Theorem 3.6 of [29]. See, also, [39, 52, 68].

Dual feasibility with tolerance ε (DFM ≤ ε) is guaranteed by (3) and the choice of εk .
In infinite precision, the criterion (3) is necessarily obtained for all the subproblems, since
Gencan converges to stationary points. In practice, and due to rounding errors and scaling,
Gencan may fail to satisfy (3) at some iterations. In these cases, Gencan is stopped by large
number of iterations (1000 in this implementation) or by a Not-Enough-Progress criterion. When
this happens, it may be possible that the Feasibility-Complementarity convergence criterion
ICM ≤ ε is satisfied at some iteration but not the projected gradient condition (3). If this is
the case, the execution of Algencan continues without increasing the penalty parameter.

3 Combinations with fast local solvers

3.1 Combination Algencan + Ipopt
Assuming that 10−8 is the “natural tolerance” for many problems, we defined a combined method
(Algencan-Ipopt) that employs Algencan up to precision 10−4 and uses the final point xk
and Lagrange multipliers λk+ and µk+ to initialize Ipopt. Moreover, the initial slack variables
for Ipopt are initialized in the obvious way: the slack variable corresponding to an inequality
constraint such that gi (xk ) ≤ 0 is initialized as −gi (xk ). Otherwise, the initial slack variable is
zero.
The parameters used by Ipopt were its default parameters, except IMUINIT=0, which
indicates that a warm-start is being used (with the initial point being provided by the user).
The Error reported by Ipopt at its initial point differs meaningfully from ICM and DFM,
reported by Algencan at its final iterate. This is due to two facts: on one hand, Ipopt measures
the non-complementarity as the maximum slack-multiplier product whereas ICM measures non-
complementarity as indicated in Section 2. On the other hand, Ipopt slightly modifies the

8
initial estimation of the solution (even in the warm-start case) as well as the bound constraints
of the problem. The modification of the initial point is done to avoid an initial point near
the boundary of the feasible region, whereas the modification of the bounds is done to avoid
feasible sets with empty interior. These modifications may be avoided by a non-standard setting
of the Ipopt parameters DBNDFRAC, DBNDPUSH and DMOVEBOUNDS (see the Ipopt
documentation for further details). However, modifying those parameters might influence the
overall performance of Ipopt. The determination of the optimal Ipopt parameters in the
presence of warm-starts is out of the scope of the present study.

3.2 Combination Algencan + Newton

In this section, we define a different local acceleration procedure. We will guess that, when
√
Algencan arrives to a precision ε, active constraints are identified and, very likely, Newton’s
method applied to the KKT system so far determined will converge to a stationary point.
KKT-quasi-Newton accelerations were used in combination with every iteration of Algencan
for equality constrained optimization in a recent work [30].
Assume that xk was obtained by Algencan using the stopping criterion (11) with ε = 10−4 .
Therefore, we have

kh(xk )k∞ ≤ 10−4 , kg(xk )+ k∞ ≤ 10−4 , and µk+ k −4

i = 0 whenever gi (x ) < −10 .

We will consider first that the probably active bounds identified at xk are those bounds such
that xki = `i or xki = ui , and that the probably active inequality constraints at xk will be the
constraints defined by gi (xk ) ≥ −10−4 . Let IA be the set of indices of probably active inequality
constraints. Let r be the number of elements of IA . Assume, without loss of generality, that the
last n − q variables are identified as having probably active bounds. Thus, xki = x̄i ∈ {`i , ui } for
all i = q + 1, . . . , n. We define

f¯(x1 , . . . , xq ) = f (x1 , . . . , xq , x̄q+1 , . . . , x̄n ),

h̄i (x1 , . . . , xq ) = hi (x1 , . . . , xq , x̄q+1 , . . . , x̄n ), i = 1, . . . , m,
ḡi (x1 , . . . , xq ) = gi (x1 , . . . , xq , x̄q+1 , . . . , x̄n ), i = 1, . . . , p,

and z = (x1 , . . . , xq ).
Therefore, the KKT system we aim to solve is:
m
∇f¯(z) +
X X
∇h̄i (z)λi + ∇ḡi (z)µi = 0, (12)
i=1 i∈IA
h̄(z) = 0, (13)
ḡi (z) = 0, ∀ i ∈ IA . (14)

This nonlinear system has q+m+r variables and equations. We tried to use Newton’s method
for its resolution. The particular implementation of Newton’s method was straightforward.
Namely, writing the system above as F (y) = 0, we solved, at each iteration, the linear system

F 0 (y)∆y = −F (y)

9
and we updated y ← y + ∆y. If kF (y)k∞ ≤ 10−8 the process was stopped. In this case, we
checked whether the obtained point remained feasible, up to tolerance 10−8 and whether the
inequality Lagrange multipliers remained nonnegative, up to tolerance 10−8 . If these require-
ments were satisfied, we declared that the local Newton acceleration was successful and obtained
a KKT point, up to the required tolerance. If Newton’s method used more than 5 iterations or
if the linear Newtonian system could not be solved, we declared local failure. The linear New-
tonian systems were solved using the HSL (Harwell) subroutine MA27. When MA27 detects
singularity, we perturb the diagonal of F 0 (y) and we try again.
Tests with this procedure (Newton 1) were not satisfactory. We experienced many failures
in situations in which one or more inequality constraints were wrongly identified as active after
the Algencan phase of the algorithm. As a consequence, the reduced KKT system (12–14)
turned out to be incompatible and the precision 10−8 could not be achieved.
Therefore, we tried a second heuristic Newton procedure, called here Newton 2. The idea is
the following: after the Algencan phase, we define IA , r, q, f¯, ḡ, h̄, z as above, but we replace
each inequality constraint corresponding to i ∈ IA by the equality constraint ḡi (x) + s2i /2 = 0,
where si is an auxiliary slack variable, and we state the KKT system associated to the new
problem. This KKT system includes (12)-(13) but, instead of (14) includes the equations

s2i
ḡi (z) + = 0 and µi si = 0 ∀ i ∈ IA . (15)
2
The system (12)-(13)-(15) has r more variables and constraints than the system (12–14) but
does not force the IA -constraints to be active at the solution. Of course, if we solve the new
system, the danger remains that, at the solution, some inequality constraint corresponding to
i∈/ IA may be violated. Moreover, some inequality Lagrange multiplier might become negative.
Therefore, as in the case of Newton 1, we test both possibilities up to tolerance 10−8 . Fulfillment
of all the tests reveals that a KKT point with precision 10−8 has been found.
The whole Algencan-Newton procedure is described below:

Algorithm 3.1 (Algencan-Newton)

√
Let ε > 0 the desired precision. Initialize ε̂ ← ε.

Step 1. Call Algencan with precision ε̂. Let x̂ = xk , λ̂ = λk+ , µ̂ = µk+ be the final approxi-
mations obtained by Algencan at this step.

Step 2. Set IA = {i | gi (x̂) ≥ −ε̂}, add squared slack variables and call Newton, using
x̂, λ̂, µ̂pto initialize this method and setting the initial estimates of the slack variables as
si = 2 max{0, −gi (x̂)}, ∀ i ∈ IA . Use a maximum of 5 Newtonian iterations. Declare
Convergence of Newton when all the components of the system (12)-(13)-(15) are, in
modulus, smaller than or equal to ε. Let x∗ be the solution given by Newton.

Step 3. If Newton converged and max{IFM, DFM} ≤ ε, stop declaring success and return
x∗ . In this case, both the convergence criteria of Algencan and Ipopt are satisfied.

Step 4. Set ε̂ ← max{ε, 0.1ε̂} and go to Step 1.

10
4 Test problems
For testing the algorithms studied in this paper, we used three variable-dimension problems.

Hard-Spheres [25, 43]:

Minimize pi ,z z
subject to kpi k2 = 1, i = 1, . . . , np ,
hpi , pj i ≤ z, i = 1, . . . , np − 1, j = i + 1, . . . , np ,

where pi ∈ IRnd for all i = 1, . . . , np . This problem has nd × np + 1 variables, np equality

constraints and np × (np − 1)/2 inequality constraints.

Enclosing-Ellipsoid [61]:

− ni=1
P d
Minimize lij log(lii )
subject to (p ) LLT pi ≤ 1, i = 1, ..., np ,
i T

lii ≥ 10−16 , i = 1, ..., nd ,

where L ∈ IRnd ×nd is a lower-triangular matrix. The number of variables is nd × (nd + 1)/2 and
the number of inequality constraints is np (plus the bound constraints). The np points pi ∈ IRnd
are randomly generated using the Cauchy distribution as suggested in [61].

Discretized three-dimensional Bratu-based problem [27, 42]:

2
(i,j,k)∈S [u(i, j, k) − u∗ (i, j, k)]
P
Minimize u(i,j,k)
subject to φθ (u, i, j, k) = φθ (u∗ , i, j, k), i, j, k = 2, . . . , np − 1,

where u∗ is defined by
4.5
u∗ (i, j, k) = 10 q(i) q(j) q(k) (1 − q(i)) (1 − q(j)) (1 − q(k)) eq(k) ,

for i, j, k = 1, 2, . . . , np , with q(`) = (np − `)/(np − 1) for all ` = 1, 2, . . . , np ,

φθ (v, i, j, k) = −∆v(i, j, k) + θev(i,j,k) ,

and
v(i ± 1, j, k) + v(i, j ± 1, k) + v(i, j, k ± 1) − 6v(i, j, k)
∆v(i, j, k) = ,
h2
for i, j, k = 2, . . . , np − 1. The number of variables is n3p and the number of equality constraints
is (np − 2)3 . We set θ = −100, h = 1/(np − 1) and |S| = 7. The elements of S are randomly
generated in [1, np ]3 . This problem has no inequality constraints.
Hard-Spheres and Enclosing-Ellipsoid problems possess many inequality constraints, whereas
the Bratu-based problem has a poor KKT structure. According to the reasons stated in the
Introduction, these are problems in which an Augmented Lagrangian method might present a
faster convergence to a loose-tolerance approximate solution when compared against an Interior-
Point Newtonian method.

11
The Fortran 77 implementations of these problems (including first and second derivatives)
are part of the “La Cumparsita” collection of test problems and are available through the Tango
Project web page, as well as the Fortran 77 implementation of Algencan. Moreover, Fortran 77
interface subroutines that add slack variables to the original formulation of the problems were
developed and are also available. This interface allows problems from “La Cumparsita” to be
tackled by methods that deal only with equality constraints and bounds.
We ran Algencan, Ipopt, Algencan-Ipopt and Algencan-Newton for many variations
of these problems. The convergence stopping criteria were equivalent for all the methods except
for Algencan. Algencan stops with the complementarity defined by ICM(k) ≤ ε (which
is related to the minimum between constraint and multiplier), whereas in the other methods
the measure of non-complementarity is the product between slack and multiplier. Of course,
sometimes one of these criteria is more strict, sometimes the other is. We used the tolerance
ε = 10−8 for declaring convergence in all the problems.
All the experiments were run on an 1.8GHz AMD Opteron 244 processor, 2Gb of RAM
memory and Linux operating system. Compiler option “-O4” was adopted.

5 Numerical Results
5.1 Nine Selected Problems
For each problem we will report: Number of Algencan iterations, number of Ipopt iterations,
number of Newton iterations, final infeasibility (sup norm), final objective function value,
computer time used by Algencan, computer time used by Ipopt, computer time used by
Newton, and total computer time. In the case of Algencan-Newton we also report the
number of times Algencan was called (Step 1 of Algorithm 3.1) and the number of times
Newton was called (Step 2).

5.1.1 Hard-Spheres problems

We considered three particular Hard-Spheres problems. For generating the initial approximation
we used the polar representation of the sphere. We define ngrid ≥ 3 and we generate np points
in the unitary sphere of IRnd in the following way:

1. We compute 2 × ngrid equally spaced “angles” ϕ1 ∈ [0, 2π]. For i = 2, . . . , nd − 1 we

computed ngrid equally spaced angles in ϕi ∈ [−π/2 + δ/2, π/2 − δ/2], where δ = π/ngrid .

2. For defining the initial approximation we compute np points in the unitary sphere. Each
point pk is generated taking:

pk1 = cos(ϕ1 ) × cos(ϕ2 ) × . . . × cos(ϕnd −1 ),

pki = sin(ϕi−1 ) × cos(ϕi ) × . . . × cos(ϕnd −1 ), i = 2, . . . , nd − 1,
pknd = sin(ϕnd −1 ),

d −1
for all the combinations of angles so far defined. Therefore, np = 2 × nngrid . The initial
0 1 n 0
approximation x was formed by p , . . . , p followed by the variable z = xn+1 . The initial
p

12
z was taken as the maximum scalar product hpi , pj i for i 6= j. The initial slack variables
for Ipopt were taken in such a way that all the constraints are satisfied at the initial
approximation.

The selected problems are defined by nd = 3 and ngrid = 7, 8, 9. Therefore, np = 98, 128, 162.
The results are reported in Table 1. The following conventions were used in this table, as well
as in Tables 2 and 3.

1. When reporting the iterations of Algencan-Ipopt the expression a + b means that the
method performed a iterations of Algencan and b iterations of Ipopt.

2. In the iterations report of Algencan-Newton the expression a(c) + b(d) means that a
iterations of Algencan and b iterations of Newton were performed. Moreover, Algen-
can was called c times, whereas Newton was called d times by Algorithm 3.1.

3. The expression (A: c%) indicates the percentage of the total time of the algorithm under
consideration that was used by Algencan. For example, in the Hard-Spheres problem
(3, 98) we read that Algencan-Newton converged using 6.54 seconds and that 97% of
the CPU time was employed by Algencan.

The main characteristics of the selected Hard-Spheres problems are:

• Hard-Spheres (3, 98): nd = 3, np = 98, n without slacks: 295, n with slacks: 5048,
number of equality constraints: 98, number of inequality constraints: 4753, total number
of constraints: 4851.

• Hard-Spheres (3, 128): nd = 3, np = 128, n without slacks: 385, n with slacks: 8513,
number of equality constraints: 128, number of inequality constraints: 8128, total number
of constraints: 8256.

• Hard-Spheres (3,162): nd = 3, np = 162, n without slacks: 487, n with slacks: 13528,

number of equality constraints: 162, number of inequality constraints: 13041, total num-
ber of constraints: 13203.

5.1.2 Enclosing-Ellipsoid problems

We consider three particular Enclosing Ellipsoid problems, defined by nd = 3 and np = 1000,
12000, 20000. As initial approximation we took L as the Identity matrix. If a point pi belongs to
the ellipsoid defined by the initial matrix, the corresponding slack variable for Ipopt was taken
in such a way that the associated constraint is satisfied. Otherwise, the initial slack variable was

13
Hard-Spheres (3,98)

Final infeasibility Final f Iterations CPU Time (secs.)

Algencan 3.5057E-10 9.3273E-01 8 8.17
Ipopt 5.1702E-09 9.3124E-01 177 37.68
Algencan+Ipopt 6.1793E-09 9.3240E-01 2 + 65 20.99 (A: 11%)
Algencan+Newton 2.5072E-09 9.3273E-01 7(3) + 12(3) 6.54 (A: 97%)

Hard-Spheres (3,128)

Final infeasibility Final f Iterations CPU Time (secs.)

Algencan 3.7860E-11 9.4825E-01 10 25.94
Ipopt 7.5212E-09 9.4820E-01 773 573.15
Algencan+Ipopt 5.6216E-09 9.4819E-01 2 + 130 128.44 (A: 8%)
Algencan+Newton 1.6279E-09 9.4825E-01 5(2) + 6(2) 15.78 (A: 99%)

Hard-Spheres (3,162)

Final infeasibility Final f Iterations CPU Time (secs.)

Algencan 3.7424E-11 9.5889E-01 10 40.15
Ipopt 5.7954E-10 9.5912E-01 944 1701.63
Algencan+Ipopt 1.7623E-09 9.5890E-01 2 + 33 35.83 (A: 42%)
Algencan+Newton 6.8632E-12 9.5889E-01 9(4) + 17(4) 31.12 (A: 98%)

Table 1: Three Hard-Spheres problems.

14
set to 0.

The main characteristics of the selected Enclosing-Ellipsoid problems are:

• Enclosing-Ellipsoid (3, 1000): nd = 3, np = 1000, n without slacks: 6, n with slacks:

1006, number of inequality constraints: 1000, total number of constraints: 1000.

• Enclosing-Ellipsoid (3, 12000): nd = 3, np = 12000, n without slacks: 6, n with slacks:

12006, number of inequality constraints: 12000, total number of constraints: 12000.

• Enclosing-Ellipsoid (3, 20000): nd = 3, np = 20000, n without slacks: 6, n with slacks:

20006, number of inequality constraints: 20000, total number of constraints: 20000.

The results are given in Table 2.

5.1.3 Bratu
We consider three particular Bratu-based problems, defined by np = 10, 16, 20. As initial ap-
proximation we took u ≡ 0.

The main characteristics of the selected Bratu-based problems are:

• Bratu (10): np = 10, n: 1000, number of equality constraints: 512, total number of
constraints: 512.

• Bratu (16): np = 16, n: 4096, number of equality constraints: 2744, total number of
constraints: 2744.

• Bratu (20): np = 20, n: 8000, number of equality constraints: 5832, total number of
constraints: 5832.

The results are given in Table 3.

5.2 Massive Comparison

The numerical results on the nine selected problems allow one to formulate two main conjectures:
Algencan-Newton is a reasonable tool for accelerating Algencan in problems with many
inequality constraints (Hard-Spheres, Enclosing-Ellipsoid). In the problems with poor KKT
structure (Bratu-based) Newton’s method is not efficient and uses a lot of time to factorize the

15
Enclosing-Ellipsoid (3,1000)

Final infeasibility Final f Iterations CPU Time (secs.)

Algencan 9.3886E-09 2.1358E+01 28 0.14
Ipopt 1.1102E-16 2.1358E+01 36 0.37
Algencan+Ipopt 1.1102E-16 2.1358E+01 20 + 20 0.23 (A : 17 %)
Algencan+Newton 1.8350E-16 2.1358E+01 20(1) + 2(1) 0.05 (A : 92 %)

Enclosing-Ellipsoid (3,12000)

Final infeasibility Final f Iterations CPU Time (secs.)

Algencan 8.3163E-09 3.0495E+01 28 1.82
Ipopt 4.1115E-16 3.0495E+01 41 5.54
Algencan+Ipopt 4.1115E-16 3.0495E+01 20 + 28 4.37 (A : 11 %)
Algencan+Newton 1.1399E-16 3.0495E+01 20(1) + 3(1) 0.56 (A : 94 %)

Enclosing-Ellipsoid (3,20000)

Final infeasibility Final f Iterations CPU Time (secs.)

Algencan 8.3449E-09 3.0495E+01 28 1.90
Ipopt 1.1102E-15 3.0495E+01 41 9.45
Algencan+Ipopt 1.1102E-15 3.0495E+01 20 + 26 6.82 (A : 11 %)
Algencan+Newton 1.1399E-16 3.0495E+01 20(1) + 3(1) 0.85 (A : 92 %)

Table 2: Three Enclosing-Ellipsoid problems.

16
Bratu-based (10, θ = −100, #S =7)

Final infeasibility Final f Iterations CPU Time (secs.)

Algencan 9.7005E-09 3.1880E-20 1 0.07
Ipopt 1.7778E-11 5.8106E-19 5 0.29
Algencan+Ipopt 5.3287E-09 1.5586E-17 3+1 0.17 (A: 41 %)
Algencan+Newton 3.5527E-14 7.7037E-34 3(1) + 2(1) 0.24 (A : 30 %)

Bratu-based (16, θ = −100, #S = 7)

Final infeasibility Final f Iterations CPU Time (secs.)

Algencan 1.5578E-09 6.8001E-20 4 1.17
Ipopt 1.1402E-09 8.6439E-16 5 24.95
Algencan+Ipopt 6.1779E-11 5.5251E-18 1+1 8.21 (A : 8 %)
Algencan+Newton 7.1575E-09 5.0211E-26 1(1) + 1(1) 8.29 (A : 8 %)

Bratu-based (20, θ = −100, #S = 7)

Final infeasibility Final f Iterations CPU Time (secs.)

Algencan 6.5411E-09 2.2907E-17 3 5.12
Ipopt 2.7311E-08 8.2058E-14 5 217.22
Algencan+Ipopt 2.2546E-09 2.1168E-15 3+1 66.92 (A : 7 %)
Algencan+Newton 1.7408E-13 3.0935E-33 3(1) + 2(1) 127.29 (A : 35 %)

Table 3: Three Bratu-based problems

17
KKT Jacobian matrices. In these cases, it is better to persevere with the matrix-free techniques
of Algencan and Gencan.
However, we felt the necessity of confirming these conclusions using a broader comparison
basis.
With this in mind, we generated the following problems:

• Twenty groups of Hard-Spheres problems taking nd = 5 with np ∈ {40, 41, . . . , 46}, nd = 6

with np ∈ {72, 73, . . . , 82}, and nd = 8 with np ∈ {126, 127}. The smallest Hard-Spheres
problem in this class has 201 variables (without slacks), 40 equality constraints and 780
inequality constraints. The largest has 1017 variables, 127 equality constraints and 8001
inequality constraints.

• Twenty groups of Enclosing-Ellipsoid problems fixing nd = 3 and choosing np ∈ {1000, 2000, . . . , 20000}.
These problems have 6 variables (without slacks) and no equality constraints. The number
of inequality constraints goes from 1000 to 20000.

• Sixteen groups of Bratu-based problems choosing np ∈ {5, 6, . . . , 20}. The size of these
problems goes from 125 variables with 27 equality constraints, to 8000 variables with 5832
equality constraints.

We generated ten instances of each problem within each group. The random generation
of the i-th instance (i = 1, 2, . . . , 10) of a particular problem (including its initial point) was
done using the Schrage’s random number generator [58] with seed s = 123456 i. In the case
of the Hard-Spheres problem, the difference between instances relies on the initial point. This
means that, in this case, we solve the same problem starting from ten different initial points. In
the Enclosing-Ellipsoid and the Bratu-based problems, some data are also randomly generated.
Therefore, in these two cases, the ten instances within the same group are in fact different
problems with different initial points.
The initial approximations were generated in the following way:

• Hard-Spheres: The initial point is randomly generated with pi ∈ [−1, 1]nd for i = 1, . . . , np
and z ∈ [0, 1].

• Enclosing-Ellipsoid: The initial point is randomly generated with lij ∈ [0, 1].

• Bratu: The initial point is randomly generated in [0, 1]n .

As a total we have 560 problems, divided in 56 groups. Table 4 provides the more important
characteristics of each group of problems. The numerical results are summarized in Table 5.
Each row of this table shows the average final functional value and computer time of a method,
with respect to the 10 problems of the group. ALprop denotes the average fraction of computer
time used by Algencan in Algencan-Ipopt.
Below we report some performance observations that do not appear in the table.

• Ipopt failed to satisfy the optimality condition in one of the Enclosing-Ellipsoid problems
of Group 9. However, the feasibility and the complementarity conditions were satisfied for
this problem.

18
• Ipopt failed to satisfy the optimality condition in 24 Bratu problems. Algencan-Ipopt
failed to satisfy optimality in 29 Bratu problems. In all these problems, Ipopt stopped
very close to a solution, except in one case, which corresponds to group 14 of Bratu.
Observe that, in this case, the final average objective function value is very high.

• Algencan did not satisfy the optimality criterion in two individual problems, correspond-
ing to groups 19 and 20 of Hard-Spheres. Anyway, the final point was feasible in both
cases (with the required precision) and the functional value was comparable to the one
achieved in the other problems.

As a consequence, we observe that only Algencan-Newton satisfied the full convergence

criterion in all the problems.

6 Conclusions
For a number of reasons displayed in the Introduction, we believe that Augmented Lagrangian
methods based on the PHR formula will continue to be used for solving practical optimization
problems for many years. In this paper we studied ways for alleviating their main inconve-
nience: the slow convergence near a solution. We showed two different ways of overcoming this
disadvantage. One is to combine the Augmented Lagrangian method Algencan [2] with a fast
Interior-Point Newtonian solver (Ipopt). The other relies in the combination of the Augmented
Lagrangian algorithm with the straightforward Newton method that uses the Algencan-
identification of active constraints. For computing Newtonian steps, we used a standard sparse
matrix solver. Of course, this destroys the matrix-free advantages of the Augmented Lagrangian
approach. However, we are confident that the employment of iterative saddle-point solvers [6]
should overcome this drawback.
The numerical experiments showed that, in contrast with our initial expectations, the combi-
nation Algencan-Ipopt was not successful. This is probably due to the fact that, as mentioned
in the Introduction, the tested problems exhibit characteristics that do not favor the application
of SQP or Interior-Point ideas, even if we start from good initial approximations. Hard-Spheres
and Enclosing-Ellipsoid problems have many constraints, whereas the Jacobian KKT structure
of the Bratu-based problems is hard for sparse matrix solvers.
On the other hand, Algencan-Newton, which consists of the application of Newton’s
method with a reduced number of squared slacks, starting from the Algencan low-precision ap-
proximation, was relatively successful in the many-constraints problems. In Enclosing-Ellipsoid
problems, the number of slacks added to the ordinary variables of the problem is small and,
so, Newton deals well with the KKT system identified by Algencan. In several Selected
Problems, Newton failed at some iterations of Algorithm 3.1. In these cases, the control came
back to Algencan and the computer time ended up being satisfactory, in spite of the ini-
tial Newtonian frustrated attempts. However, Algencan-Newton was not as efficient in the
Hard-Spheres massive comparison as it was in the Selected Problems. The reason is that, in
the selected problems, each row of the constraint Jacobian matrix contains 7 nonnull elements
(including the slack), whereas in the massive comparison the number of non-null row Jacobian

19
Problem parameters Original formulation Adding slack variables
nd np n m n m
1 5 40 201 820 981 820
2 5 41 206 861 1,026 861
3 5 42 211 903 1,072 903
4 5 43 216 946 1,119 946
5 5 44 221 990 1,167 990
6 5 45 226 1,035 1,216 1,035
Hard-Spheres problem

7 5 46 231 1,081 1,266 1,081

8 6 72 433 2,628 2,989 2,628
9 6 73 439 2,701 3,067 2,701
10 6 74 445 2,775 3,146 2,775
11 6 75 451 2,850 3,226 2,850
12 6 76 457 2,926 3,307 2,926
13 6 77 463 3,003 3,389 3,003
14 6 78 469 3,081 3,472 3,081
15 6 79 475 3,160 3,556 3,160
16 6 80 481 3,240 3,641 3,240
17 6 81 487 3,321 3,727 3,321
18 6 82 493 3,403 3,814 3,403
19 7 126 883 8,001 8,758 8,001
20 7 127 890 8,128 8,891 8,128
Problem parameters Original formulation Adding slack variables
nd np n m n m
1 3 1,000 6 1,000 1,006 1,000
2 3 2,000 6 2,000 2,006 2,000
3 3 3,000 6 3,000 3,006 3,000
4 3 4,000 6 4,000 4,006 4,000
5 3 5,000 6 5,000 5,006 5,000
Enclosing-Ellipsoid problem

6 3 6,000 6 6,000 6,006 6,000

7 3 7,000 6 7,000 7,006 7,000
8 3 8,000 6 8,000 8,006 8,000
9 3 9,000 6 9,000 9,006 9,000
10 3 10,000 6 10,000 10,006 10,000
11 3 11,000 6 11,000 11,006 11,000
12 3 12,000 6 12,000 12,006 12,000
13 3 13,000 6 13,000 13,006 13,000
14 3 14,000 6 14,000 14,006 14,000
15 3 15,000 6 15,000 15,006 15,000
16 3 16,000 6 16,000 16,006 16,000
17 3 17,000 6 17,000 17,006 17,000
18 3 18,000 6 18,000 18,006 18,000
19 3 19,000 6 19,000 19,006 19,000
20 3 20,000 6 20,000 20,006 20,000
Problem parameters Original formulation Adding slack variables
np n m n m
1 5 125 27 125 27
2 6 216 64 216 64
3 7 343 125 343 125
3D Bratu-based problem

4 8 512 216 512 216

5 9 729 343 729 343
6 10 1,000 512 1,000 512
7 11 1,331 729 1,331 729
8 12 1,728 1,000 1,728 1,000
9 13 2,197 1,331 2,197 1,331
10 14 2,744 1,728 2,744 1,728
11 15 3,375 2,197 3,375 2,197
12 16 4,096 2,744 4,096 2,744
13 17 4,913 3,375 4,913 3,375
14 18 5,832 4,096 5,832 4,096
15 19 6,859 4,913 6,859 4,913
16 20 8,000 5,832 8,000 5,832

Table 4: Description
20 of the problems.
Algencan Ipopt Algencan+Ipopt Algencan+Newton
Time f Time f Time ALprop f Time f
1 1.02 5.1045E-01 6.65 5.1663E-01 7.86 (0.17) 5.1663E-01 1.87 5.1052E-01
2 2.07 5.1792E-01 8.84 5.2259E-01 10.37 (0.14) 5.2259E-01 1.63 5.1830E-01
3 1.94 5.2366E-01 9.46 5.2803E-01 10.72 (0.13) 5.2803E-01 1.63 5.2368E-01
4 1.96 5.2861E-01 11.74 5.3451E-01 12.91 (0.11) 5.3451E-01 1.75 5.2786E-01
5 1.69 5.3392E-01 13.04 5.4147E-01 14.30 (0.10) 5.4147E-01 1.75 5.3308E-01
6 2.02 5.4174E-01 12.06 5.4596E-01 13.95 (0.11) 5.4596E-01 2.19 5.4240E-01
Hard-Spheres problem

7 2.96 5.4682E-01 11.05 5.5050E-01 13.04 (0.13) 5.5050E-01 2.30 5.4748E-01

8 13.32 5.3286E-01 127.13 5.3915E-01 157.95 (0.07) 5.3915E-01 16.23 5.3412E-01
9 15.74 5.3666E-01 95.25 5.4012E-01 122.01 (0.08) 5.4012E-01 17.11 5.3646E-01
10 13.95 5.3816E-01 182.68 5.4348E-01 213.98 (0.08) 5.4348E-01 16.40 5.3861E-01
11 17.01 5.4045E-01 201.86 5.4524E-01 227.95 (0.05) 5.4524E-01 14.88 5.4022E-01
12 17.76 5.4306E-01 142.91 5.4784E-01 173.24 (0.07) 5.4784E-01 17.25 5.4241E-01
13 14.97 5.4325E-01 254.90 5.5003E-01 304.87 (0.05) 5.5003E-01 17.15 5.4429E-01
14 14.65 5.4655E-01 227.40 5.5233E-01 282.44 (0.08) 5.5233E-01 19.57 5.4646E-01
15 16.42 5.4601E-01 443.80 5.5401E-01 504.29 (0.04) 5.5401E-01 16.97 5.4865E-01
16 15.04 5.4950E-01 243.27 5.5586E-01 275.06 (0.05) 5.5586E-01 19.40 5.4879E-01
17 17.12 5.5116E-01 187.08 5.5897E-01 209.83 (0.05) 5.5897E-01 17.16 5.4875E-01
18 17.49 5.5263E-01 371.16 5.6068E-01 447.61 (0.05) 5.6068E-01 21.00 5.5192E-01
19 105.66 5.2638E-01 2600.80 5.5269E-01 2480.87 (0.03) 5.5269E-01 112.70 5.2884E-01
20 132.42 5.3348E-01 3070.54 5.5369E-01 2911.93 (0.04) 5.5369E-01 131.66 5.3527E-01
Algencan Ipopt Algencan+Ipopt Algencan+Newton
Time f Time f Time ALprop f Time f
1 0.18 1.9816E+01 0.32 1.9816E+01 0.25 (0.15) 1.9816E+01 0.04 1.9816E+01
2 0.24 2.1363E+01 0.69 2.1363E+01 0.49 (0.14) 2.1363E+01 0.07 2.1363E+01
3 0.85 2.3645E+01 1.16 2.3645E+01 0.82 (0.13) 2.3645E+01 0.11 2.3645E+01
4 0.85 2.5712E+01 1.62 2.5712E+01 1.19 (0.12) 2.5712E+01 0.15 2.5712E+01
5 0.94 2.5964E+01 2.03 2.5964E+01 1.65 (0.15) 2.5964E+01 0.29 2.5964E+01
Enclosing-Ellipsoid problem

6 1.38 2.6537E+01 2.41 2.6537E+01 1.82 (0.12) 2.6537E+01 0.25 2.6537E+01

7 2.19 2.6540E+01 2.95 2.6540E+01 2.13 (0.12) 2.6540E+01 0.27 2.6540E+01
8 3.05 2.6694E+01 3.15 2.6694E+01 2.44 (0.13) 2.6694E+01 0.34 2.6694E+01
9 3.14 2.7296E+01 3.64 2.7296E+01 2.92 (0.12) 2.7296E+01 0.39 2.7296E+01
10 3.64 2.7419E+01 4.16 2.7419E+01 3.41 (0.12) 2.7419E+01 0.44 2.7419E+01
11 5.25 2.7526E+01 4.52 2.7526E+01 3.65 (0.13) 2.7526E+01 0.49 2.7526E+01
12 5.79 2.7526E+01 5.61 2.7526E+01 4.41 (0.18) 2.7526E+01 0.96 2.7526E+01
13 4.39 2.7688E+01 5.30 2.7688E+01 4.64 (0.15) 2.7688E+01 0.87 2.7688E+01
14 5.43 2.7688E+01 5.75 2.7688E+01 4.77 (0.12) 2.7688E+01 0.60 2.7688E+01
15 4.40 2.7798E+01 6.62 2.7798E+01 5.12 (0.13) 2.7798E+01 0.72 2.7798E+01
16 3.39 2.7931E+01 6.69 2.7931E+01 5.49 (0.12) 2.7931E+01 0.68 2.7931E+01
17 6.15 2.8302E+01 7.19 2.8302E+01 6.04 (0.13) 2.8302E+01 0.86 2.8302E+01
18 9.07 2.8463E+01 7.93 2.8463E+01 6.20 (0.13) 2.8463E+01 0.83 2.8463E+01
19 6.27 2.8621E+01 7.73 2.8621E+01 6.56 (0.12) 2.8621E+01 0.90 2.8621E+01
20 6.09 2.8621E+01 8.49 2.8621E+01 7.07 (0.12) 2.8621E+01 0.95 2.8621E+01
Algencan Ipopt Algencan+Ipopt Algencan+Newton
Time f Time f Time ALprop f Time f
1 0.00 1.9129E-18 0.01 1.6926E-18 0.00 (0.10) 7.5507E-20 0.00 1.6881E-21
2 0.01 1.8981E-18 0.02 6.6480E-17 0.01 (0.87) 1.3774E-19 0.01 5.5769E-22
3 0.03 4.1099E-18 0.05 5.5124E-17 0.04 (0.64) 4.8090E-19 0.04 1.7833E-20
3D Bratu-based problem

4 0.06 5.5694E-18 0.09 2.6538E-16 0.08 (0.67) 1.1778E-17 0.08 1.4850E-24

5 0.14 5.8973E-18 0.18 2.9316E-16 0.16 (0.72) 3.0963E-17 0.18 6.2145E-23
6 0.25 9.5184E-18 0.43 7.5506E-16 0.34 (0.62) 2.5513E-17 0.40 1.6937E-29
7 0.43 1.3721E-17 0.91 6.5911E-15 0.67 (0.56) 9.2599E-18 0.77 1.4441E-25
8 0.78 5.7405E-18 1.88 2.9443E-15 1.22 (0.57) 2.5198E-17 1.64 6.1007E-29
9 1.18 1.4256E-17 3.05 1.3153E-14 1.95 (0.55) 4.4736E-17 2.76 1.6551E-28
10 1.60 1.5410E-17 6.53 2.4381E-14 3.41 (0.41) 2.4443E-16 5.77 9.5237E-30
11 1.76 2.7524E-17 14.43 1.7844E-14 6.06 (0.26) 5.0203E-16 9.47 3.4421E-28
12 1.27 3.0145E-17 28.84 7.2741E-14 8.68 (0.13) 7.5170E-15 15.78 2.2958E-28
13 2.07 2.9833E-17 61.02 4.1502E-15 20.69 (0.09) 1.1376E-14 29.19 7.2348E-26
14 2.53 3.5881E-17 4236.79 4.8076E+03 25.11 (0.09) 3.1606E-14 42.96 1.7359E-29
15 3.42 3.5169E-17 647.43 2.2341E-14 36.21 (0.08) 8.4064E-14 71.65 1.2389E-28
16 4.33 4.4937E-17 221.35 1.5201E-13 88.63 (0.04) 2.1970E-14 151.68 9.8518E-27

Table 5: Massive
21 Comparison.
elements goes from 11 to 17. This difference is enough to reduce the comparative efficiency of
the Newtonian sparse matrix solver. Recall that MA27 does not take advantage of the specific
structure and saddle-point characteristics of KKT systems. So, it is reasonable to conjecture
that its replacement by a specific saddle-point solver would be more efficient. This observation
leads us to preconize, once more, the employment of (direct or iterative) specific linear saddle
point solvers as surveyed in [6].
No claims are made in this paper with respect to the behavior of Algencan, Ipopt or the
combined methods in problems with different characteristics than the ones studied here. We
believe, for example, that SQP-Interior Point ideas are very effective for small to medium scale
problems, or even large-scale problems with a moderate number of inequalities and reasonable
KKT Jacobian structure. Probably, in most of these situations, SQP-IP methods are more
efficient than Augmented Lagrangian algorithms. However, more numerical experimentation is
necessary in order to obtain reliable practical conclusions.

Acknowledgement. We are indebted to an anonymous referee for careful reading and encour-
aging words about this paper.

References
[1] R. Andreani, E. G. Birgin, J. M. Martı́nez and M. L. Schuverdt, Augmented Lagrangian
methods under the Constant Positive Linear Dependence constraint qualification, Mathe-
matical Programming 111, pp. 5–32, 2008.

[2] R. Andreani, E. G. Birgin, J. M. Martı́nez and M. L. Schuverdt, On Augmented Lagrangian

methods with general lower-level constraints, SIAM Journal on Optimization, to appear.

[3] R. Andreani, J. M. Martı́nez and M. L. Schuverdt, On the relation between the Constant
Positive Linear Dependence condition and quasinormality constraint qualification, Journal
of Optimization Theory and Applications 125, pp. 473–485, 2005.

[4] M. Argáez and R. A. Tapia, On the global convergence of a modified augmented Lagrangian
linesearch interior-point method for Nonlinear Programming, Journal of Optimization The-
ory and Applications 114, pp. 1–25, 2002.

[5] S. Bakhtiari and A. L. Tits, A simple primal-dual feasible interior-point method for non-
linear programming with monotone descent, Computational Optimization and Applications
25, pp. 17–38, 2003.

[6] M. Benzi, G. H. Golub and J. Nielsen, Numerical solution of saddle-point problems, Acta
Numerica 14, pp. 1–137, 2005.

[7] E. G. Birgin, R. Castillo and J. M. Martı́nez, Numerical comparison of augmented La-

grangian algorithms for nonconvex problems, Computational Optimization and Applications
31, pp. 31–56, 2005.

22
[8] E. G. Birgin, C. A. Floudas and J. M. Martı́nez, Global minimization us-
ing an Augmented Lagrangian method with variable lower-level constraints, avail-
able in Optimization Online, E-Print ID: 2006-12-1544, http://www.optimization-
online.org/DB HTML/2006/12/1544.html.

[9] E. G. Birgin and J. M. Martı́nez, A box-constrained optimization algorithm with negative

curvature directions and spectral projected gradients, Computing [Suppl] 15, pp. 49–60,
2001.

[10] E. G. Birgin and J. M. Martı́nez, Large-scale active-set box-constrained optimization

method with spectral projected gradients, Computational Optimization and Applications
23, pp. 101–125, 2002.

[11] E. G. Birgin and J. M. Martı́nez, Structured minimal-memory inexact quasi-Newton method

and secant preconditioners for Augmented Lagrangian Optimization, Computational Opti-
mization and Applications, to appear.

[12] E. G. Birgin, J. M. Martı́nez and M. Raydan, Nonmonotone spectral projected gradient

methods on convex sets, SIAM Journal on Optimization 10, pp. 1196–1211, 2000.

[13] E. G. Birgin, J. M. Martı́nez and M. Raydan, Inexact Spectral Projected Gradient methods
on convex sets, IMA Journal on Numerical Analysis 23, pp. 539-559, 2003.

[14] E. G. Birgin, J. M. Martnez and D. P. Ronconi, Optimizing the Packing of Cylinders into a
Rectangular Container: A Nonlinear Approach, European Journal of Operational Research
160, pp. 19–33, 2005.

[15] I. Bongartz, A. R. Conn, N. I. M. Gould and Ph. L. Toint, CUTE: constrained and un-
constrained testing environment, ACM Transactions on Mathematical Software 21, pp.
123–160, 1995.

[16] O. Burdakov, J. M. Martı́nez and E. A. Pilotta, A limited memory multipoint secant method
for bound constrained optimization, Annals of Operations Research 117, pp. 51-70, 2002.

[17] R. H. Byrd, J. Ch. Gilbert and J. Nocedal, A trust region method based on interior point
techniques for nonlinear programming, Mathematical Programming 89, pp. 149–185, 2000.

[18] R. H. Byrd, N. I. M. Gould, J. Nocedal and R. A. Waltz, An algorithm for nonlinear op-
timization using linear programming and equality constrained subproblems, Mathematical
Programming 100, pp. 27–48, 2004.

[19] R. H. Byrd, J. Nocedal and A. Waltz, Feasible interior methods using slacks for nonlinear
optimization, Computational Optimization and Applications 26, pp. 35–61, 2003.

[20] L. Chen and D. Goldfarb, Interior-Point `2 penalty methods for nonlinear programming
with strong global convergence properties, CORC Technical Report TR 2004-08, IEOR
Department, Columbia University, 2005.

23
[21] A. R. Conn, N. I. M. Gould, D. Orban and Ph. L. Toint, A primal-dual trust-region algo-
rithm for nonconvex nonlinear programming, Mathematical Programming 87, pp. 215–249,
2000.

[22] A. R. Conn, N. I. M. Gould and Ph. L. Toint, A globally convergent Augmented Lagrangian
algorithm for optimization with general constraints and simple bounds, SIAM Journal on
Numerical Analysis 28, pp. 545–572, 1991.

[23] A. R. Conn, N. I. M. Gould and Ph. L. Toint, Lancelot: A Fortran package for large
scale nonlinear optimization, Springer-Verlag, Berlin, 1992.

[24] A. R. Conn, N. I. M. Gould and Ph. L. Toint, Trust Region Methods, MPS/SIAM Series
on Optimization, SIAM, Philadelphia, 2000.

[25] H. Conway and N. J. A. Sloane, Sphere Packings, Lattices and Groups, 3rd ed., New York,
Springer-Verlag, 1999.

[26] Y-H Dai and R. Fletcher, Projected Barzilai-Borwein methods for large-scale box-
constrained quadratic programming, Numerische Mathematik 100, pp. 21–47, 2005.

[27] M. A. Diniz-Ehrhardt, M. A. Gomes-Ruggiero, V. L. R. Lopes and J. M. Martı́nez, Discrete

Newton’s method with local variations for solving large-scale nonlinear systems, Optimiza-
tion 52, pp. 417–440, 2003.

[28] E. D. Dolan and J. J. Moré, Benchmarking optimization software with performance profiles,
Mathematical Programming 91, pp. 201–213, 2002.

[29] F. Facchinei, A. Fischer and C. Kanzow, On the accurate identification of active constraints,
SIAM Journal on Optimization 9, pp. 14-32, 1998.

[30] L. Ferreira-Mendonça, V. L. R. Lopes and J. M. Martı́nez, Quasi-Newton acceleration

for equality constrained minimization, Computational Optimization and Applications, to
appear.

[31] R. Fletcher, N. I. M. Gould, S. Leyffer, Ph. L. Toint and A. Wächter, Global convergence
of a trust-region SQP-filter algorithm for general nonlinear programming, SIAM Journal
on Optimization 13, pp. 635–659, 2002.

[32] A. Forsgren, P. E. Gill and M. H. Wright, Interior point methods for nonlinear optimization,
SIAM Review 44, pp. 525–597, 2002.

[33] E. M. Gertz and P. E. Gill, A primal-dual trust region algorithm for nonlinear optimization,
Mathematical Programming 100, pp. 49–94, 2004.

[34] P. E. Gill, W. Murray and M. A. Saunders, SNOPT: An SQP algorithm for large-scale
constrained optimization, SIAM Review 47, pp. 99–131, 2005.

[35] C. C. Gonzaga, E. Karas and M. Vanti, A globally convergent filter method for Nonlinear
Programming, SIAM Journal on Optimization 14, pp. 646–669, 2003.

24
[36] N. I. M. Gould, D. Orban, A. Sartenaer and Ph. L. Toint, Superlinear Convergence of
Primal-Dual Interior Point Algorithms for Nonlinear Programming, SIAM Journal on Op-
timization 11, pp.974–1002, 2000.

[37] N. I. M. Gould, D. Orban and Ph. L. Toint, GALAHAD: a library of thread-safe Fortran
90 packages for large-scale nonlinear optimization, ACM Transactions on Mathematical
Software 29, pp. 353–372, 2003.

[38] N. I. M. Gould, D. Orban and Ph. L. Toint, An interior point `1 penalty method for
nonlinear optimization, Computational Science and Engineering Department, Rutherford
Appleton Laboratory, Chilton, Oxfordshire, England, 2003.

[39] W. W. Hager and M. S. Gowda, Stability in the presence of degeneracy and error estimation,
Mathematical Programming 85, pp. 181-192, 1999.

[40] W. W. Hager and H. C. Zhang, A new active set algorithm for box constrained optimization,
SIAM Journal on Optimization 17, pp. 526–557, 2006.

[41] M. R. Hestenes, Multiplier and gradient methods, Journal of Optimization Theory and
Applications 4, pp. 303–320, 1969.

[42] C. T. Kelley, Iterative methods for linear and nonlinear equations, SIAM, 1995.

[43] N. Krejić, J. M. Martı́nez, M. P. Mello and E. A. Pilotta, Validation of an Augmented

Lagrangian algorithm with a Gauss-Newton Hessian approximation using a set of hard-
spheres problems, Computational Optimization and Applications 16, pp. 247–263, 2000.

[44] R. M. Lewis and V. Torczon, A globally convergent augmented Lagrangian pattern search
algorithm for optimization with general constraints and simple bounds, SIAM Journal on
Optimization 12, pp. 1075–1089, 2002.

[45] C. Lin and J. J. Moré, Newton’s method for large bound-constrained optimization problems,
SIAM Journal on Optimization 9, pp. 1100–1127, 1999.

[46] X. Liu and J. Sun, A robust primal-dual interior point algorithm for nonlinear programs,
SIAM Journal on Optimization 14, pp. 1163–1186, 2004.

[47] O. L. Mangasarian and S. Fromovitz, The Fritz-John necessary optimality conditions in

presence of equality and inequality constraints, Journal of Mathematical Analysis and Ap-
plications 17, pp. 37–47, 1967.

[48] J. M. Martı́nez, Inexact Restoration Method with Lagrangian tangent decrease and new
merit function for Nonlinear Programming, Journal of Optimization Theory and Applica-
tions 111, pp. 39–58, 2001.

[49] J. M. Martı́nez and E. A. Pilotta, Inexact restoration methods for nonlinear programming:
advances and perspectives, in Optimization and Control with applications, edited by L. Q.
Qi, K. L. Teo and X. Q. Yang. Springer, pp. 271–292, 2005.

25
[50] J. M. Moguerza and F. J. Prieto, An augmented Lagrangian interior-point method using
directions of negative curvature, Mathematical Programming 95, pp. 573–616, 2003.

[51] Q. Ni and Y-X Yuan, A subspace limited memory quasi-Newton algorithm for large-scale
nonlinear bound constrained optimization, Mathematics of Computation 66, pp. 1509–1520,
1997.

[52] C. Oberlin and S. J. Wright, Active set identification in Nonlinear Programming, SIAM
Journal on Optimization 17, pp. 577-605, 2006.

[53] E. Polak and J. Royset, On the use of augmented Lagrangians in the solution of generalized
semi-infinite min-max problems, Computational Optimization and Applications 2, pp. 173–
192, 2005.

[54] M. J. D. Powell, A method for nonlinear constraints in minimization problems, in Opti-

mization, R. Fletcher (ed.), Academic Press, New York, NY, pp. 283–298, 1969.

[55] L. Qi and Z. Wei, On the constant positive linear dependence condition and its application
to SQP methods, SIAM Journal on Optimization 10, pp. 963–981, 2000.

[56] R. T. Rockafellar, Augmented Lagrange multiplier functions and duality in nonconvex pro-
gramming, SIAM Journal on Control 12, pp. 268–285, 1974.

[57] R. T. Rockafellar, Lagrange multipliers and optimality, SIAM Review 35, pp. 183–238,
1993.

[58] L. Schrage, A more portable Fortran random number generator, ACM Transactions on
Mathematical Software 5, pp. 132–138, 1979.

[59] D. F. Shanno and R. J. Vanderbei, Interior-point methods for nonconvex nonlinear pro-
gramming: orderings and high-order methods, Mathematical Programming 87, pp. 303–316,
2000.

[60] A. L. Tits, A. Wächter, S. Bakhtiari, T. J. Urban and C. T. Lawrence, A primal-dual

interior-point method for nonlinear programming with strong global and local convergence
properties, SIAM Journal on Optimization 14, pp. 173–199, 2003.

[61] M. Todd and E. A. Yildirim, On Khachiyan’s algorithm for the computation of minimum
volume enclosing ellipsoids, TR 1435, School of Operations Research and Industrial Engi-
neering, Cornell University, 2005.

[62] P. Tseng, A convergent infeasible interior-point trust-region method for constrained mini-
mization, SIAM Journal on Optimization 13, pp. 432–469, 2002.

[63] M. Ulbrich, S. Ulbrich and L. N. Vicente, A globally convergent primal-dual interior-point

filter method for nonlinear programming, Mathematical Programming 100, pp. 379–410,
2004.

26
[64] A. Wächter and L. T. Biegler, Failure of global convergence for a class of interior point
methods for nonlinear programming, Mathematical Programming 88, pp. 565–574, 2000.

[65] A. Wächter and L. T. Biegler, On the implementation of an interior-point filter line-search

algorithm for large-scale nonlinear programming, Mathematical Programming 106, pp. 25–
57, 2006.

[66] R. A. Waltz, J. L. Morales, J. Nocedal and D. Orban, An interior algorithm for nonlinear
optimization that combines line search and trust region steps, Mathematical Programming
107, pp. 391–408, 2006.

[67] C. Y. Wang, Q. Liu and X. M. Yang, Convergence properties of nonmonotone spectral

projected gradient methods, Journal of Computational and Applied Mathematics 182, pp.
51–66, 2005.

[68] S. J. Wright, Modifying SQP for degenerate problems, SIAM Journal on Optimization 13,
pp. 470-497, 2002.

[69] H. Yamashita and H. Yabe, An interior point method with a primal-dual quadratic barrier
penalty function for nonlinear optimization, SIAM Journal on Optimization 14, pp. 479–
499, 2003.

[70] B. Zhou, L. Gao and Y-H Dai, Monotone projected gradient methods for large-scale box-
constrained quadratic programming, Science in China Series A - Mathematics 49, pp.
688–702, 2006.

Uncorrected Author Proof: A Quasi-Newton Augmented Lagrangian Algorithm For Constrained Optimization Problems
No ratings yet
Uncorrected Author Proof: A Quasi-Newton Augmented Lagrangian Algorithm For Constrained Optimization Problems
10 pages
Zhao XY, Sun D, Toh KC. A Newton-CG Augmented Lagrangian Method For Semidefinite Programming
No ratings yet
Zhao XY, Sun D, Toh KC. A Newton-CG Augmented Lagrangian Method For Semidefinite Programming
30 pages
Genetic Algorithm for Constrained Optimization
No ratings yet
Genetic Algorithm for Constrained Optimization
26 pages
Ito 2003
No ratings yet
Ito 2003
8 pages
Augmented Lagrangian Methods Guide
No ratings yet
Augmented Lagrangian Methods Guide
6 pages
On The Boundedness of Penalty Parameters in An Aug PDF
No ratings yet
On The Boundedness of Penalty Parameters in An Aug PDF
26 pages
FX X RCX I CX I I: Study On Lagrangian Methods
No ratings yet
FX X RCX I CX I I: Study On Lagrangian Methods
10 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
Moor 1040 0103
No ratings yet
Moor 1040 0103
14 pages
Optimal Quadratic Programming Algorithms
No ratings yet
Optimal Quadratic Programming Algorithms
289 pages
Primal-Dual Subgradient Method Guide
No ratings yet
Primal-Dual Subgradient Method Guide
13 pages
Second and Higher Order Iteration in Lagrangian Methodpaper Shelja Salil
No ratings yet
Second and Higher Order Iteration in Lagrangian Methodpaper Shelja Salil
13 pages
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
No ratings yet
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
23 pages
4.3 Problems With Inequality Constraints: General Form
No ratings yet
4.3 Problems With Inequality Constraints: General Form
6 pages
1991imajna 11 325 332
No ratings yet
1991imajna 11 325 332
9 pages
Lecture 14 From Sensitivities To Optimisation
No ratings yet
Lecture 14 From Sensitivities To Optimisation
20 pages
Augmented Lagrangian Optimization Guide
No ratings yet
Augmented Lagrangian Optimization Guide
21 pages
New Inertial Proximal Gradient Methods For Unconstrained Convex Optimization Problems
No ratings yet
New Inertial Proximal Gradient Methods For Unconstrained Convex Optimization Problems
18 pages
Constrained Optimization
No ratings yet
Constrained Optimization
23 pages
A Modified Newton's Method For Solving Nonlinear Programing Problems
No ratings yet
A Modified Newton's Method For Solving Nonlinear Programing Problems
15 pages
Inexact Newton Method For Minimization of Convex P
No ratings yet
Inexact Newton Method For Minimization of Convex P
16 pages
Constrained Methods
No ratings yet
Constrained Methods
5 pages
SQ P Methods
No ratings yet
SQ P Methods
13 pages
An Optimal High-Order Tensor Method For Convex Optimization: Bo Jiang Haoyue WANG Shuzhong ZHANG
No ratings yet
An Optimal High-Order Tensor Method For Convex Optimization: Bo Jiang Haoyue WANG Shuzhong ZHANG
34 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
07SQUJS-AK FinalMATH070213 PDF
No ratings yet
07SQUJS-AK FinalMATH070213 PDF
12 pages
Sequential Quadratic Programming
No ratings yet
Sequential Quadratic Programming
50 pages
GRG Method with Search Directions
100% (1)
GRG Method with Search Directions
14 pages
Keynote 1
No ratings yet
Keynote 1
32 pages
An Adaptive Nonlinear Least-Squares Algorithm
No ratings yet
An Adaptive Nonlinear Least-Squares Algorithm
21 pages
2014zzy Jogo
No ratings yet
2014zzy Jogo
18 pages
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
No ratings yet
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
17 pages
Clnote Sept24
No ratings yet
Clnote Sept24
24 pages
Dimitri Bertsekas - Nonlinear Programming (Google Books Preview) (2016, Athena Scientific) - Libgen - Li
No ratings yet
Dimitri Bertsekas - Nonlinear Programming (Google Books Preview) (2016, Athena Scientific) - Libgen - Li
64 pages
Liu 2019
No ratings yet
Liu 2019
20 pages
2002 Austin 1a
No ratings yet
2002 Austin 1a
27 pages
An Overview of Traditional Optimization Methods - Truncated
No ratings yet
An Overview of Traditional Optimization Methods - Truncated
17 pages
Ippppp
No ratings yet
Ippppp
8 pages
1 PB
No ratings yet
1 PB
9 pages
Numerical Results For Gauss-Seidel Iterative Algor
No ratings yet
Numerical Results For Gauss-Seidel Iterative Algor
11 pages
Efficient Orbit Propagation Method
No ratings yet
Efficient Orbit Propagation Method
20 pages
The Levenberg-Marquardt Algorithm: Ananth Ranganathan 8th June 2004
No ratings yet
The Levenberg-Marquardt Algorithm: Ananth Ranganathan 8th June 2004
5 pages
Global Convergence of A Modified Fletcher-Reeves Conjugate Gradient Method With Armijo-Type Line Search - Zhang, Zhou (2006)
No ratings yet
Global Convergence of A Modified Fletcher-Reeves Conjugate Gradient Method With Armijo-Type Line Search - Zhang, Zhou (2006)
12 pages
BFGS
No ratings yet
BFGS
9 pages
Frank-Wolfe Algorithm Tutorial
No ratings yet
Frank-Wolfe Algorithm Tutorial
10 pages
A Q-Polak-Ribiere-Polyak Conjugate Gradient Algori
No ratings yet
A Q-Polak-Ribiere-Polyak Conjugate Gradient Algori
30 pages
A Global Optimization Algorithm (Gop) For Certain Classes of Nonconvex Nlps-Ii. Application of Theory and Test Problems
No ratings yet
A Global Optimization Algorithm (Gop) For Certain Classes of Nonconvex Nlps-Ii. Application of Theory and Test Problems
16 pages
Non-Linear Optimization Updates
No ratings yet
Non-Linear Optimization Updates
10 pages
Gasnikov 19 A
No ratings yet
Gasnikov 19 A
18 pages
A Sequential Quadratic Programming Algorithm Combining Merit Function and Filter Ideas
No ratings yet
A Sequential Quadratic Programming Algorithm Combining Merit Function and Filter Ideas
27 pages
Levenberg Marquardt Algorithm
100% (5)
Levenberg Marquardt Algorithm
5 pages
Line Search Algorithms With Guaranteed Sufficient Decrease
No ratings yet
Line Search Algorithms With Guaranteed Sufficient Decrease
22 pages
Convergence of Augmented Lagrangian Methods For Co
No ratings yet
Convergence of Augmented Lagrangian Methods For Co
39 pages
A Truncated Nonmonotone Gauss-Newton Method For Large-Scale Nonlinear Least-Squares Problems
No ratings yet
A Truncated Nonmonotone Gauss-Newton Method For Large-Scale Nonlinear Least-Squares Problems
16 pages
(1.5.2) Unconstrained Nonlinear Programming
No ratings yet
(1.5.2) Unconstrained Nonlinear Programming
25 pages
The Different Approach of Solution For Multi-Objective Fractional Programming Problems Under Fuzzy Environment
No ratings yet
The Different Approach of Solution For Multi-Objective Fractional Programming Problems Under Fuzzy Environment
20 pages
An Augmented Lagrangian Algorithm For Nonlinear Semidefinite Programming Applied To The Covering Problem
No ratings yet
An Augmented Lagrangian Algorithm For Nonlinear Semidefinite Programming Applied To The Covering Problem
17 pages
Null 2
No ratings yet
Null 2
7 pages
A Globally Convergent SQCQP Method For Multiobjective Optimization Problems
No ratings yet
A Globally Convergent SQCQP Method For Multiobjective Optimization Problems
23 pages
1 s2.0 S0378475422004396 Main
No ratings yet
1 s2.0 S0378475422004396 Main
17 pages
A Sequential Quadratically Constrained Quadratic Programming Technique For A Multi-Objective Optimization Problem
No ratings yet
A Sequential Quadratically Constrained Quadratic Programming Technique For A Multi-Objective Optimization Problem
21 pages
1 9781611973365 FM
No ratings yet
1 9781611973365 FM
11 pages
10.2478 - Jamsi 2023 0004
No ratings yet
10.2478 - Jamsi 2023 0004
17 pages
Computation of Multi-Objective Two-Stage Fuzzy Probabilistic Programming Problem
No ratings yet
Computation of Multi-Objective Two-Stage Fuzzy Probabilistic Programming Problem
12 pages
An Exact Exponential Penalty Function Method For Multiobjective Optimization Problems With Exponential-Type Invexity
No ratings yet
An Exact Exponential Penalty Function Method For Multiobjective Optimization Problems With Exponential-Type Invexity
17 pages
2019 MultiNewton-SIOPT WHYLY
No ratings yet
2019 MultiNewton-SIOPT WHYLY
37 pages
Mixed Integer Linearity in Nonlinear Optimization: A Trust Region Approach
No ratings yet
Mixed Integer Linearity in Nonlinear Optimization: A Trust Region Approach
22 pages
On The Convergence of The Proximal Algorithm For Nonsmooth Functions Involving Analytic Features
No ratings yet
On The Convergence of The Proximal Algorithm For Nonsmooth Functions Involving Analytic Features
12 pages
1 s2.0 S0377221722008773 Main
No ratings yet
1 s2.0 S0377221722008773 Main
28 pages
Approximate Optimality Conditions For Nonsmooth Optimization Problems
No ratings yet
Approximate Optimality Conditions For Nonsmooth Optimization Problems
22 pages
1 s2.0 S095741742300578X Main
No ratings yet
1 s2.0 S095741742300578X Main
13 pages
An Overview of Genetic Algorithms A Structural Analysis
No ratings yet
An Overview of Genetic Algorithms A Structural Analysis
5 pages
A Scaled Conjugate Gradient Algorithm
No ratings yet
A Scaled Conjugate Gradient Algorithm
9 pages
Transportation Problem Methods
No ratings yet
Transportation Problem Methods
51 pages
Advanced Simplex Methods Guide
No ratings yet
Advanced Simplex Methods Guide
44 pages
Pyomo - Dae:: A Modeling and Automatic Discretization Framework For Optimization With Differential and Algebraic Equations
No ratings yet
Pyomo - Dae:: A Modeling and Automatic Discretization Framework For Optimization With Differential and Algebraic Equations
28 pages
Lec3-The Kernel Trick
No ratings yet
Lec3-The Kernel Trick
4 pages
A Novel Approach For Tuning Power System Stabilizer (SMIB System) Using Genetic Local Search Technique
No ratings yet
A Novel Approach For Tuning Power System Stabilizer (SMIB System) Using Genetic Local Search Technique
11 pages
Business Management Regular and Distance
No ratings yet
Business Management Regular and Distance
151 pages
Introduction To Operations Research Hillier Lieberman 9th Edition Ebook and TestBank Bundle Unlocked Test Bank
No ratings yet
Introduction To Operations Research Hillier Lieberman 9th Edition Ebook and TestBank Bundle Unlocked Test Bank
325 pages
Paul
No ratings yet
Paul
17 pages
Theme-5 Management Science or OM FM
No ratings yet
Theme-5 Management Science or OM FM
439 pages
Reviews On Optimization of A Rocket's Trajectory For Maximum Payload
No ratings yet
Reviews On Optimization of A Rocket's Trajectory For Maximum Payload
32 pages
Linear Programming Basics
No ratings yet
Linear Programming Basics
25 pages
Algorithm Review MCT MFT ....
No ratings yet
Algorithm Review MCT MFT ....
16 pages
Numerical Methods For Engineers With Per
No ratings yet
Numerical Methods For Engineers With Per
5 pages
Hybrid of GA-PSO DG Sizing and Placing With Load Uncertainity
No ratings yet
Hybrid of GA-PSO DG Sizing and Placing With Load Uncertainity
16 pages
Spe 20747 MS
No ratings yet
Spe 20747 MS
7 pages
Advanced Engineering Informatics - Philosophical and Me - 2020 - Developments in
No ratings yet
Advanced Engineering Informatics - Philosophical and Me - 2020 - Developments in
7 pages
Plant Design and Economics Lect 2
No ratings yet
Plant Design and Economics Lect 2
35 pages
Shortcut Distillation Demo Model For PIMS-AO
No ratings yet
Shortcut Distillation Demo Model For PIMS-AO
1 page
Unbalanced Assignment Problem Guide
No ratings yet
Unbalanced Assignment Problem Guide
2 pages
Method of Characteristics For Nozzle Design
No ratings yet
Method of Characteristics For Nozzle Design
10 pages
Comparison of Methods To Define The Final Pit
No ratings yet
Comparison of Methods To Define The Final Pit
8 pages
Nomura-Timing Is Money The Value of Execution Scheduling-0906
No ratings yet
Nomura-Timing Is Money The Value of Execution Scheduling-0906
17 pages
Marine Transportation Dept Profile
No ratings yet
Marine Transportation Dept Profile
6 pages
HyperStudy Power Point
No ratings yet
HyperStudy Power Point
3 pages
5 - Linear Programming Problems Ex. Module-6-B
No ratings yet
5 - Linear Programming Problems Ex. Module-6-B
9 pages
How Can We Enhance The Precision and Accuracy of GPS Localization by Accounting For Displacement Errors
No ratings yet
How Can We Enhance The Precision and Accuracy of GPS Localization by Accounting For Displacement Errors
10 pages
Duality in LPP: Matrix Notation Primal
No ratings yet
Duality in LPP: Matrix Notation Primal
52 pages
IBP400 Col2505 SAP IBP For Inventory Planning and Optimization
No ratings yet
IBP400 Col2505 SAP IBP For Inventory Planning and Optimization
20 pages

Improving Ultimate Convergence of An Augmented Lagrangian Method

Uploaded by

Improving Ultimate Convergence of An Augmented Lagrangian Method

Uploaded by

Improving ultimate convergence

of an Augmented Lagrangian method

June 12, 2007. Updated March 19, 2008

Key words: Nonlinear programming, Augmented Lagrangian methods, Interior-Point meth-

dist(z, S) = inf {kz − xk}.

2 Augmented Lagrangian Algorithm

Algorithm 2.1 (Algencan)

Step 1. Find an approximate minimizer xk of Lρk (x, λk , µk ) subject to x ∈ Ω. The condition

kPΩ (xk − ∇Lρk (xk , λk , µk )) − xk k∞ ≤ εk , (3)

where PΩ denotes the Euclidean projection onto Ω.

Step 3. Compute λk+1

are positively linearly independent. The Mangasarian-Fromovitz Constraint Qualification MFCQ

{∇hi (y)}i∈I1 , {∇g i (y)}i∈I2

are linearly dependent for all y ∈ IRn such that ky − xk ≤ δ.

1. x∗ is a stationary point of the problem

Lρk (xk , λk , µk ) ≤ Lρk (y, λk , µk ) + εk

for all y ∈ Ω, where {εk } is a sequence of nonnegative numbers that converge to ε ≥ 0.

for all feasible point y.

λk+ = λk + ρk g(xk ) and µk+ = (µk + ρk g(xk ))+ .

Let us show that

min{−gi (xk ), µk+

Therefore, | min{−gi (xk ), µk+ }| ≤ .

gi (xk ) + µki /ρk < 0.

| min{−gi (xk ), µk+ k

one trivially has that

ICM (k) = max{kh(xk )k∞ , kV k k∞ }

and the Dual Feasibility measure (DFM) as:

ICM ≤ ε and DFM ≤ ε. (11)

Theorem 2.3. Assume that:

1. The functions f, h, g are twice continuously differentiable;

C −1 max{ICM, DFM} ≤ dist((xk , λk+ , µk+ ), S) ≤ C max{ICM, DFM}.

3 Combinations with fast local solvers

3.2 Combination Algencan + Newton

kh(xk )k∞ ≤ 10−4 , kg(xk )+ k∞ ≤ 10−4 , and µk+ k −4

f¯(x1 , . . . , xq ) = f (x1 , . . . , xq , x̄q+1 , . . . , x̄n ),

Algorithm 3.1 (Algencan-Newton)

Step 4. Set ε̂ ← max{ε, 0.1ε̂} and go to Step 1.

Hard-Spheres [25, 43]:

where pi ∈ IRnd for all i = 1, . . . , np . This problem has nd × np + 1 variables, np equality

lii ≥ 10−16 , i = 1, ..., nd ,

Discretized three-dimensional Bratu-based problem [27, 42]:

for i, j, k = 1, 2, . . . , np , with q(`) = (np − `)/(np − 1) for all ` = 1, 2, . . . , np ,

φθ (v, i, j, k) = −∆v(i, j, k) + θev(i,j,k) ,

5.1.1 Hard-Spheres problems

1. We compute 2 × ngrid equally spaced “angles” ϕ1 ∈ [0, 2π]. For i = 2, . . . , nd − 1 we

pk1 = cos(ϕ1 ) × cos(ϕ2 ) × . . . × cos(ϕnd −1 ),

The main characteristics of the selected Hard-Spheres problems are:

• Hard-Spheres (3,162): nd = 3, np = 162, n without slacks: 487, n with slacks: 13528,

5.1.2 Enclosing-Ellipsoid problems

Final infeasibility Final f Iterations CPU Time (secs.)

Final infeasibility Final f Iterations CPU Time (secs.)

Final infeasibility Final f Iterations CPU Time (secs.)

Table 1: Three Hard-Spheres problems.

The main characteristics of the selected Enclosing-Ellipsoid problems are:

• Enclosing-Ellipsoid (3, 1000): nd = 3, np = 1000, n without slacks: 6, n with slacks:

• Enclosing-Ellipsoid (3, 12000): nd = 3, np = 12000, n without slacks: 6, n with slacks:

• Enclosing-Ellipsoid (3, 20000): nd = 3, np = 20000, n without slacks: 6, n with slacks:

The results are given in Table 2.

The main characteristics of the selected Bratu-based problems are:

The results are given in Table 3.

5.2 Massive Comparison

Final infeasibility Final f Iterations CPU Time (secs.)

Final infeasibility Final f Iterations CPU Time (secs.)

Final infeasibility Final f Iterations CPU Time (secs.)

Table 2: Three Enclosing-Ellipsoid problems.

Final infeasibility Final f Iterations CPU Time (secs.)

Bratu-based (16, θ = −100, #S = 7)

Final infeasibility Final f Iterations CPU Time (secs.)

Bratu-based (20, θ = −100, #S = 7)

Final infeasibility Final f Iterations CPU Time (secs.)

Table 3: Three Bratu-based problems

• Twenty groups of Hard-Spheres problems taking nd = 5 with np ∈ {40, 41, . . . , 46}, nd = 6

• Bratu: The initial point is randomly generated in [0, 1]n .

As a consequence, we observe that only Algencan-Newton satisfied the full convergence

7 5 46 231 1,081 1,266 1,081

6 3 6,000 6 6,000 6,006 6,000

Therefore, | min{−gi (xk ), µk+ }| ≤ .