J Glob Optim DOI 10.1007/s10898-009-9399-x
pro
of
Author Proof
Solutions and optimality criteria for nonconvex constrained global optimization problems with connections between canonical and Lagrangian duality David Yang Gao · Ning Ruan · Hanif D. Sherali
Received: 2 November 2007 / Accepted: 6 January 2009 © Springer Science+Business Media, LLC. 2009
2 3 4 5 6 7 8 9 10
Abstract This paper presents a canonical duality theory for solving a general nonconvex quadratic minimization problem with nonconvex constraints. By using the canonical dual transformation developed by the first author, the nonconvex primal problem can be converted into a canonical dual problem with zero duality gap. A general analytical solution form is obtained. Both global and local extrema of the nonconvex problem can be identified by the triality theory associated with the canonical duality theory. Illustrative applications to quadratic minimization with multiple quadratic constraints, box/integer constraints, and general nonconvex polynomial constraints are discussed, along with insightful connections to classical Lagrangian duality. Criteria for the existence and uniqueness of optimal solutions are presented. Several numerical examples are provided.
unc orre cted
1
12
Keywords Canonical duality theory · Triality · Lagrangian duality · Global optimization · Integer programming
13
1 Introduction
11
14
15
16 17
18
In this paper, we address the following general constrained nonlinear programming problem: 1 (P ) : min P(x) = x T Ax − x T f : x ∈ Xc , (1) 2 where A = {Ai j } ∈ Rn×n is a symmetric matrix, f ∈ Rn is a given vector, the feasible space Xk ⊂ Rn is defined as (2) Xc = x ∈ Xa | g(x) ≤ d ∈ Rm , D. Y. Gao (B) · N. Ruan Department of Mathematics, Virginia Tech, Blacksburg, VA 24061, USA e-mail:
[email protected] D. Y. Gao · N. Ruan · H. D. Sherali Grado Department of Industrial and Systems Engineering, Virginia Tech, Blacksburg, VA 24061, USA
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
23 24 25 26 27
L(x, σ ) =
28
29 30 31 32 33
1 T x Ax − x T f + σ T (g(x) − d). 2
P ∗ (σ ) = inf L(x, σ ),
34
(4)
x∈Xa
35 36 37
where, under certain constraint qualifications that insure the existence of a Karush– Kuhn–Tucker (KKT) solution (see [1]), we have the following strong min–max duality relation: inf P(x) = sup P ∗ (σ ). σ ∈Rm+
38
x∈Xc
39 40 41 42 43
inf P(x) ≥ sup P ∗ (σ ). σ ∈Rm+
x∈Xc
45
47 48 49 50 51 52 53 54 55 56 57 58 59
(5)
In this case, the problem can be solved easily by any well-developed convex programming technique. However, due to the assumed nonconvexity of Problem (P ), the Lagrangian L(x, σ ) is no longer a saddle function and the Fenchel-Young inequality leads to only the following weak duality relation in general:
44
46
(3)
If all the components of g(x) are convex functions, and A 0, i.e., positive semidefinite (PSD), then Problem (P ) has a convex quadratic objective function and convex constraints, and the Lagrangian is a saddle function, i.e., L(x, σ ) is convex in the primal variables x, concave (linear) in the dual variables (Lagrange multipliers) σ , and the Lagrangian dual problem can be easily defined by the Fenchel–Moreau–Rockafellar transformation
unc orre cted
Author Proof
22
of
20 21
where g(x) = {gα (x)} : Xa → Rm is a given vector-valued differentiable (not necessary convex) function, Xa is a convex open set in Rn , and d ∈ Rm is a given vector. (We follow the traditional tensor notation used in finite deformation theory [10], where the indices i, j represent components of entities in Rn or Rn×n , while α, β represent components in other spaces). The problem (P ) involves minimizing a nonconvex quadratic function over a nonconvex m feasible space. By introducing a Lagrangian multiplier vector σ ∈ Rm + = {σ ∈ R | σ ≥ 0} m to relax the inequality constraints in Xc , the classical Lagrangian L : Xa × R+ → R is given by
pro
19
(6)
The slack θ in the inequality (6) is called the duality gap in global optimization, where possibly, θ = ∞. This duality gap shows that the well-developed Fenchel–Moreau–Rockafellar duality theory can be used only for solving convex minimization problems. Also, due to the nonconvexity of the objective function and/or constraints, the problem may have multiple local solutions. The identification of a global minimizer has been a fundamentally challenging task in global optimization. The classical Lagrangian duality theory was originally developed in the context of analytical mechanics [38]. In the realm of linear elasticity, the primal problem (P ) is usually a convex variational problem in functional space, while its dual (P ∗ ) is called the complementary variational problem. In mechanics and mathematical physics, the terminology “complementary” connotes the strong (or perfect) duality, i.e., the so-called canonical duality [10], and the strong duality relation (5) is also called the complementary variational principle. However, in finite deformation elasticity, the primal function P(x) is usually nonconvex and the existence of a purely stress-(dual variable)-based complementary variational principle has invoked a lively debate existing for more than 50 years, since E. Reissner (1953) [37,39,41].
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
94
2 Canonical dual problem and strong duality theorem
64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92
95
96
97
98
99
100
pro
Author Proof
62 63
unc orre cted
61
of
93
In order to close the duality gap inherent in the classical Lagrange duality theory, a socalled canonical duality theory has been developed, first in nonconvex analysis [27] and mechanics [7,10], and then in global optimization [12,16,26]. This new theory is composed mainly of a potentially useful canonical dual transformation, an associated complementarydual principle, and a triality theory, whose components comprise a saddle min–max duality and two pairs of double-min, double-max dualities. The canonical dual transformation can be used to formulate perfect dual problems without a duality gap; the complementary-dual principle shows that the primal problem is equivalent to its canonical dual in the sense that they have the same KKT points; while the triality theory can be used to identify both global and local extrema. The key step in the canonical dual transformation is to introduce a geometrical operator ξ = (x) and a canonical function V (ξ ) (whose Legendre or Fenchel conjugate can be uniquely defined) such that the nonconvex constraint can be written in the canonical form g(x) = V ((x)). Thus, the complementary (i.e., the canonical dual) function can be uniquely formulated via the Legendre-Fenchel transformation. The original idea of this canonical dual transformation comes from the joint work of Gao and Strang [27] on nonconvex/nonsmooth variational problems. They discovered that a necessary condition for a minimizer depends on the Gâteaux derivative t of the geometrical operator, while a sufficient condition and the so-called complementary gap function depends on the complementary operator c (x) := (x) − t x. This complementary gap function closes the duality gap present in Lagrangian duality and plays an important role in nonconvex analysis and global optimization. Some open problems in one-dimensional nonlinear elasticity have thus been solved recently [22,23]. A comprehensive review on the canonical duality theory and connections between nonconvex mechanics and global optimization appear in Gao and Sherali [26]. The purpose of the present paper is to illustrate the application of the canonical duality theory for solving the foregoing quadratic minimization problem with nonconvex constraints. In the next section, we will show how to use the canonical dual transformation to convert the nonconvex constrained problem into a canonical dual problem, in order to derive related global optimality conditions. Certain particular cases along with numerical examples are used to illustrate the theory in Sect. 3 (quadratic constraints), Sect. 4 (box and integer constraints), and Sect. 5 (polynomial constraints), for using the canonical dual problem to obtain all critical points. We also expose certain insightful connections with classical Lagrangian duality for these special applications. Finally, Sect. 6 provides a summary, some open problems, and conclusions.
60
For analytical convenience, we introduce an indicator function of the feasible set Xc : 0 if ǫ ≤ d W (ǫ) = +∞ otherwise and let
(7)
1 U (x) ≡ −P(x) = x T f − x T Ax. 2
Then the primal problem (P ) can be written in the following unconstrained form: min {(x) = W (g(x)) − U (x) : x ∈ Xa } .
(8)
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
103 104
which is convex and lower semi-continuous (l.s.c.) on Rm . From convex analysis [2,44], the following relations hold for (ǫ, σ ) ∈ Rm × Rm : σ ∈ ∂ W (ǫ) ⇔ ǫ ∈ ∂ W ♯ (σ ) ⇔ W (ǫ) + W ♯ (σ ) = ǫ T σ .
105
107 108
Replacing W (g(x)) in (x) by g T (x)σ − W ♯ (σ ) based on the Fenchel–Young equality, the extended Lagrangian o : Xa × Rm → R ∪ {∞} associated with the problem (8) can be given as
pro
106
o (x, σ ) = g T (x)σ − W ♯ (σ ) − U (x).
109
110 111 112 113
ξ = {ξβα } = (x) : Xa ⊂ Rn → Ea ⊂ Rm× pα ,
114
115 116
and a canonical function V : Ea → written in the canonical form:
119
120
121 122 123 124 125 126
127
Rm
(11)
such that the nonconvex constraint g(x) can be
g(x) = V ((x)).
117
118
(10)
Clearly, we have o (x, σ ) = L(x, σ ), ∀(x, σ ) ∈ Xa × Rm +. Since g(x) is a nonconvex function, following the standard procedure of the canonical dual transformation (see [10]), we assume that there exists a Gâteaux differentiable geometrical operator
unc orre cted
Author Proof
102
By the Fenchel transformation, the conjugate function W ♯ (σ ) of W (ǫ) can be defined by T d σ if σ ≥ 0 ♯ T W (σ ) = sup {ǫ σ − W (ǫ)} = (9) +∞ otherwise, m ǫ ∈R
of
101
(12)
By the definition introduced in [10], a function V : Ea → Rm is called canonical if it is Gâteaux differentiable on Ea such that the duality mapping ∂ Vα (ξ ) β ς = {ςα } = ∇V (ξ ) = : Ea → Ea∗ ⊂ R pα ×m , (13) ∂ξβα is invertible. We note that the geometric variable ξ = {ξβα } is an m × pα matrix, while its β
canonical dual variable ς = {ςα } is a pα ×m matrix. In finite deformation theory and theoretical physics, these are called second order two-point tensors [10]. For the constrained problem (P ) considered in this paper, the dimension pα of the geometrical variable ξ = {ξβα } depends on each given constraint gα (x) ≤ dα , α = 1, . . . , m. Let Iα = {β| β ∈ {1, . . . , pα }} be an index set and let ⎫ ⎧ ⎬ ⎨
α β : Ea × Ea∗ → Rm ξβ ςα ξ ; ς = ⎭ ⎩ β∈Iα
128 129
denote the partial bilinear form on the product space Ea × Ea∗ . Thus, the Legendre conjugate V ∗ : Ea∗ → Rm of V can be defined by V ∗ (ς) = sta{ξ ; ς − V (ξ ) : ξ ∈ Ea },
130
131 132 133
134
where the notation sta{∗} denotes computing the stationary points of {∗}. By the assumption that the duality relation (13) is invertible (i.e., canonical), the Legendre conjugate V ∗ (ς ) is uniquely defined on Ea∗ and the inverse duality relation can be written as ∂ Vα∗ (ς ) ∗ ξ = ∇V (ς ) = . (14) β ∂ςα
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
137 138
ς = ∇V (ξ ) ⇔ ξ = ∇V ∗ (ς ) ⇔ ξ ; ς = V (ξ ) + V ∗ (ς ).
In the terminology used in [10], (ξ , ς ) forms a canonical duality pair on Ea × Ea∗ . Thus, noting (12) and (15) to replace g(x) in (10) by V ((x)) = (x); ς − V ∗ (ς),
139
141
142
143 144
145
146 147
148
149 150
151
152 153 154
155
156 157
158 159 160
161
162
163 164 165 166
167
168 169
170
we define the so-called generalized total complementary function (or the extended Lagrang∗ ian [10,27]) : Xa × Rm + × Ea → R as
pro
140
(15)
1 (x, σ , ς) = σ T [(x); ς − V ∗ (ς ) − d] + x T Ax − x T f, (16) 2 m where σ ∈ Rm + is the dual variable vector associated with g(x) ≤ d ∈ R . Through this total complementary function, the canonical dual function can be defined by P d (σ , ς) = sta {(x, σ , ς) : x ∈ Xa } = U (σ , ς ) − σ T (V ∗ (ς) + d),
(17)
where U (σ , ς ) is the parametric -conjugate function of the quadratic function U (x) = − 21 x T Ax + x T f defined by the following -conjugate transformation [10,12]:
unc orre cted
Author Proof
136
It is easy to verify that the following equivalent relations hold on Ea × Ea∗ :
of
135
U (σ , ς ) = sta{σ T (x); ς − U (x) : x ∈ Xa }.
Rm +
(18)
Ea∗
× be a canonical dual feasible space on which the canonical dual Letting Sc ⊂ function P d (σ , ς) is well defined, the canonical dual problem can be posed as follows: (P d ) : sta{P d (σ , ς ) : (σ , ς) ∈ Sc }.
(19)
Theorem 1 (Complementary-Dual Principle) The problem (P d ) is canonically dual to the primal problem (P ) in the sense that if (¯x, σ¯ , ς¯ ) is a critical point of (x, σ , ς) over ∗ ¯ is a KKT point of (P d ), and ¯ is a KKT point of (P ), (σ¯ , ς) (x, σ , ς ) ∈ Xa × Rm + × Ea , then x P(¯x) = (¯x, σ¯ , ς¯ ) = P d (σ¯ , ς¯ ).
(20)
Proof If (¯x, σ¯ , ς¯ ) is a critical point of as hypothesized, then we have the following criticality conditions: ∇x (¯x, ς¯ , σ¯ ) = σ¯ T t (¯x); ς¯ + Ax¯ − f = 0,
(21)
¯ σ¯ ) = (¯x) − ∇V ∗ (ς) ¯ = 0, ∇ς (¯x, ς,
(22)
where t (x) = ∇(x) denotes the Gâteaux derivative of , along with the conditions: 0 ≤ σ¯ ⊥ (¯x); ς¯ − V ∗ (ς¯ ) − d ≤ 0, (23)
where the notation ⊥ represents the complementarity or orthogonality condition. Since (ξ , ς ) is a canonical duality pair on Ea × Ea∗ , the criticality condition (22) is equivalent to ς¯ = ∇ξ V ((¯x)) = ∂ V (ξ (¯x))/∂ξ . Substituting this into (21) and using the chain rule to deduce ∇g(¯x) = t (¯x); ∇ξ V ((¯x)), we have, Ax¯ − f + σ¯ T ∇g(¯x) = ∇x L(¯x, σ¯ ) = 0.
This is the criticality condition of the primal problem (P ). By the Legendre equality (¯x); ¯ − V ∗ (ς¯ ) = V ((¯x)), the condition (23) can be written as ς 0 ≤ σ¯ ⊥ (g(¯x) − d) ≤ 0.
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
172
This shows that x¯ is a KKT point of the primal problem (P ). From the complementarity condition (23), we have, (¯x, σ¯ , ς¯ ) = P(¯x).
174 175
On the other hand, by the definition of the canonical dual function, if (¯x, σ¯ , ς¯ ) is a KKT point as hypothesized, the criticality condition (21) leads to ¯ = σ¯ T (¯x); ς¯ − U (¯x). U (σ¯ , ς)
176
178 179 180 181
182
183 184 185 186
Therefore, (¯x, σ¯ , ς¯ ) = P d (σ¯ , ς¯ ) and (σ¯ , ς¯ ) is a KKT point of the dual problem (P d ). ⊓ ⊔
pro
177
This theorem shows that there is no duality gap between the primal problem and its canonical dual. In order to identify the global minimizer, we need to study the convexity of the generalized complementary function (x, σ , ς ). Without much loss of generality, we introduce the following assumptions: (i) the geometrical operator (x) : Rn → Ea is twice Gâteaux differentiable; (ii) the canonical function V : Ea → Rm is convex.
Sc+ = {(σ , ς ) ∈ Sc | G a (x, σ , ς ) 0, ∀x ∈ Xa }
187
188
189 190 191
Theorem 2 (Global Optimality Condition) Suppose that the conditions in (24) hold and that ¯ is a global maximizer ¯ ∈ Sc+ , then (σ¯ , ς) (¯x, σ¯ , ς¯ ) is a critical point of (x, σ , ς ). If (σ¯ , ς) d + of P on Sc and x¯ is a global minimizer of P on Xc , i.e., P(¯x) = min P(x) = x∈Xc
194 195 196 197 198 199
200
201 202
max
(σ ,ς )∈Sc+
P d (σ , ς ) = P d (σ¯ , ς¯ ).
(26)
Proof Actually, this theorem is a special application of the general theory proposed by Gao and Strang in [27]. By the convexity of V (ξ ), its Legendre conjugate V ∗ : Ea∗ → Rm is also T ∗ ∗ convex. Thus, for any given σ ∈ Rm + , the linear combination σ V (ς ) : Ea → R is convex and the generalized complementary function (x, σ , ς ) is concave in ς . By considering σ ∈ Rm + as a Lagrange multiplier for the inequality constraint in Xc , the complementary function (x, σ , ς) can be viewed as a concave (linear) function of σ ∈ Rm + for any given (x, ς) ∈ Xa × Ea∗ . Therefore, for any given x ∈ Rn , we have P(x) if x ∈ Xc , maxm max∗ (x, σ , ς ) = maxm L(x, σ ) = ∞ otherwise . σ ∈R + σ ∈R+ ς ∈Ea ∗ Moreover, if (σ , ς ) ∈ Sc+ ⊂ Rm + × Ea , then (x, σ , ς ) is convex in x ∈ Xa and concave in m ς for any given σ ∈ R+ . Therefore, if (¯x, σ¯ , ς¯ ) ∈ Xa × Sc+ is a critical point of , we have
(¯x, σ¯ , ς¯ ) = min
203
max
x∈Xa (σ ,ς )∈Sc+
=
204
205
(25)
be a subset of Sc . We have the following theorem.
192
193
(24)
By this assumption, we know that the conjugate function V ∗ (ς ) : Ea∗ → Rm is also convex, and for any given (σ , ς ) ∈ Sc , the generalized complementary function (x, σ , ς) is twice Gâteaux differentiable on x. Let G a (x, σ , ς ) = ∇x2 (x, σ , ς) denote the Hessian matrix of (x, σ , ς ) and let
unc orre cted
Author Proof
173
of
171
max
(x, σ , ς ) = min P(x) x∈Xc
min (x, σ , ς ) =
(σ ,ς )∈Sc+ x∈Xa
max
(σ ,ς )∈Sc+
P d (σ , ς ).
By Theorem 1, we then have (26).
⊔ ⊓
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
209
210 211
(x, σ , ς) =
212
213
214
215
216
217
218
219 220 221
222
223 224
225
226
227
228
229
230
231
232
233
1 T x G a (σ , ς )x − σ T (V ∗ (ς) + d) − x T F(σ , ς), 2
where G a (σ , ς ) = A +
m
σα Bαβ ςαβ
α=1 β∈Iα
(28)
(29)
is the Hessian matrix of (x, σ , ς ), which does not depend on x, and m
unc orre cted
Author Proof
208
This theorem provides a sufficient condition for a global minimizer of the nonconvex primal problem. In many applications, the geometrical mapping (x) : Xa → Ea is usually a quadratic operator 1 T α T α (x) = x Bβ x + x Cβ : Rn → Ea ⊂ Rm× pα , (27) 2 α ∈ Rn , and the range Ea depends on where Bαβ = Biαjβ = B αjiβ ∈ Rn×n , Cαβ = Ciβ α α both Bβ and Cβ . In this case, the generalized complementary function has the form:
of
207
pro
206
F(σ , ς ) = f −
σα Cαβ ςαβ .
(30)
α=1 β∈Iα
The criticality condition (21) in this case is a linear equation of x, i.e., G a (σ , ς )x = F(σ , ς ),
(31)
which is also called the canonical equilibrium equation [10]. Clearly, for a given (σ , ς), if F(σ , ς ) is in the column space of G a (σ , ς), denoted by Col (G a ), the solution of the equation (31) can be written in the form: x = G a+ (σ , ς )F(σ , ς ),
(32)
where G a+ is the Moore–Penrose generalized inverse of G a . Thus, the canonical dual feasible space Sc can be defined as ∗ Sc = {(σ , ς) ∈ Rm + × Ea | F(σ , ς ) ∈ Col (G a )},
(33)
and the canonical dual function P d can be formulated as
1 P d (σ , ς ) = − F(σ , ς)T G a+ (σ , ς )F(σ , ς ) − σ T (V ∗ (ς) + d). 2
(34)
Since (x) is a quadratic operator, its Gâteaux derivative is an affine operator t (x) = ∇(x) = x T Bαβ + CαT β .
By Gao and Strang [27], the complementary operator c (x) of t is defined by 1 c (x) = (x) − t (x)x = − x T Bαβ x. 2
(35)
Thus, the complementary gap function [27] can be defined as
1 1 G(x, σ , ς ) = σ T −c (x); ς + x T Ax = x T G a (σ , ς )x. 2 2
(36)
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
This gap function plays an important role in nonconvex analysis and global optimization. Clearly, G(x, σ , ς ) ≥ 0 ∀x ∈ Xa if G a (σ , ς) 0. Let Sc+ = {(σ , ς ) ∈ Sc | G a (σ , ς ) 0},
Sc−
237
238 239
240 241 242 243 244 245
= {(σ , ς ) ∈ Sc | G a (σ , ς ) ≺ 0}.
Theorem 3 (Triality Theory) Suppose that (x) is a quadratic operator defined by (27) and the condition (ii) in (24) holds. ¯ ∈ Sc is a critical point of (P d ), then x¯ = G a+ (σ¯ , ς)F( ¯ ¯ is a KKT point of If (σ¯ , ς) σ¯ , ς) ¯ (P ) and P(¯x) = P d (σ¯ , ς). ¯ ∈ Sc+ , then (σ¯ , ς¯ ) is a global maximizer of P d (σ , ς ) on Sc+ , If the critical point (σ¯ , ς) the vector x¯ is a global minimizer of P(x) on Xc , and P(¯x) = min P(x) = x∈Xc
248
or
251
253 254
255 256 257 258 259 260 261 262 263
264 265 266 267 268 269
271
(39)
P d (σ , ς ) = P d (σ¯ , ς¯ )
(40)
max
¯ P d (σ , ς) = P d (σ¯ , ς).
(41)
x∈Xo
(σ ,ς )∈So
P(¯x) = max P(x) =
(σ ,ς )∈So
¯ ∈ Sc is a critical point of (P d ), we have Proof If (σ¯ , ς) ¯ = 0, δς P d (σ¯ , ς¯ ) = σ¯ T (¯x) − ∇V ∗ (ς) d
∗
0 ≤ σ¯ ⊥ δσ P (σ¯ , ς¯ ) = (¯x); ς¯ − V (ς¯ ) − d ≤ 0,
(42) (43)
where x¯ = G a+ (σ¯ , ς¯ )F(σ¯ , ς¯ ). Equation (42) asserts that if σ¯ α = 0, then the corresponding ¯ is a canonical duality pair on Ea × Ea∗ , ξα (¯x) = α (¯x) = ∇ςα V ∗ (ς¯ ). By the fact that ((¯x), ς) ¯ = V ((¯x)) = g(¯x). Therefrom the equivalent relations in (15), we have (¯x); ς¯ − V ∗ (ς) fore, the complementarity condition in (43) leads to σ¯ α (gα (¯x) − dα ) = 0. If σ¯ α = 0, we must have the criticality condition gα (¯x) − dα = 0. This shows that if (σ¯ , ς¯ ) is a critical point of P d (σ , ς), the vector x¯ = G a+ (σ¯ , ς¯ )F(σ¯ , ς¯ ) is a KKT point of (P ). Since the total complementary function (x, σ , ς ) is a saddle function on Xa × Sc+ and a super-critical function on Xa × Sc− (i.e., is concave in both x ∈ Xa and (σ , ς ) ∈ Sc− ) , the remainder of the theorem follows from the triality theory developed in [10,12,16]. ⊔ ⊓ Remark 1 Note that the dual variable σ ∈ Rm + , i.e., it is restricted by the inequality constraint ¯ is σ ≥ 0 ∈ Rm . From the proof of Theorem 1 (as well as Theorem 3) we know that if (¯x, σ¯ , ς) a critical point of , then x¯ and ς¯ satisfy the criticality conditions (21) and (22), respectively, while σ¯ satisfies the KKT conditions (23). Given a nonconvex quadratic objective function P(x), the global minimizer x¯ is located on the boundary of the feasible set Xc , i.e., there exists at least one σ¯ α > 0 such that ¯ − dα = gα (¯x) − dα = 0. ∇σα (¯x, σ¯ , ς¯ ) = α (¯x); ς¯α − Vα∗ (ς)
270
272
P d (σ , ς ) = P d (σ¯ , ς¯ ).
min
P(¯x) = min P(x) =
x∈Xo
252
max
(σ ,ς )∈Sc+
¯ ∈ Sc− , then on the neighborhood Xo × So of (¯x, σ¯ ), we have If the critical point (σ¯ , ς) either
249
250
(38)
Then from Theorems 1, 2, and the triality theory developed in [10,12], we have the following result:
246
247
(37)
unc orre cted
Author Proof
236
of
235
pro
234
i.e., the dual variable σ¯ α is a critical point of (x, σ , ς ), and the corresponding vector x¯ is KKT point of P(x). By the fact that the canonical dual feasible space Sc+ is a closed convex
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
276
277 278
279
280
281
282 283
284
285 286 287 288 289 290 291 292 293 294 295 296 297
Sc‡ = {(σ , ς ) ∈ Sc | G a (σ , ς) ≻ 0},
(44)
of
Author Proof
275
the Hessian matrix G a (σ , ς ) is invertible and the canonical dual function can be formulated as 1 P d (σ , ς) = − F(σ , ς )T G a−1 (σ , ς )F(σ , ς ) − σ T (V ∗ (ς ) + d). 2
(45)
pro
274
¯ could be located on the boundary of Sc+ . In this case, the matrix set, the critical point (σ¯ , ς) G a (σ , ς ) may be singular and the canonical dual problem may have more than one critical point on Sc+ . Particularly, on the open canonical dual feasible space
In this case, the canonical dual problem d (P+ ) : max P d (σ , ς ) : (σ , ς ) ∈ Sc‡
(46)
usually has at most one solution (σ¯ , ς¯ ), which is a critical point of P d (σ , ς ). By Theorem 3, the vector x¯ = G a−1 (σ¯ , ς¯ )F(σ¯ , ς¯ )
is a solution to (P ), which is located on the boundary of Xc under the nonconvexity of P(x). The related existence conditions will be discussed in the following section. The generalized complementary function (x, σ , ς ) defined in (16) is actually the secondorder Lagrangian in the canonical duality theory (see Sect. 4.1.2 in [10]). For the quadratic geometrical mapping (x), the canonical dual function P d (σ , ς ) can be formulated explicitly, which is also called the pure complementary energy function in finite deformation theory (see [7–9]). The analytical solution of type (32) was first obtained in nonconvex variational problems [7,11] and finite deformation mechanics [9]. Similar results also appear in global optimization [12,16,17]. It is interesting to note that many nonconvex primal problems in nonconvex analysis and global optimization share the same form of the canonical dual function as defined in (34) (see [8,12,20,26]). In the case that (x) is a higher order nonlinear function, the sequential canonical dual transformation can be used to formulate a higher order complementary function (see Chap. 4 in [10]).
unc orre cted
273
301
Theorem 3 shows that by the canonical dual transformation, the nonconvex primal problem (P ) can be transformed to a concave maximization dual problem over a convex feasible space Sc+ , which can be solved via well-developed nonlinear programming techniques (cf. [40]).
302
3 Applications to quadratic constrained problems
298 299 300
303 304
305
306 307 308
We begin by considering the following nonconvex quadratic minimization problem with quadratic inequality constraints, denoted by (Pq ): min P(x) = 21 x T Ax − x T f (47) (Pq ) : s.t. 21 x T Bα x + x T Cα ≤ dα , α = 1, . . . , m, where Bα = Biαj = B αji ∈ Rn×n , Cα = Ciα ∈ Rn , ∀ α = 1, . . . , m, and d = {dα } ∈ Rm is a vector. Due to the nonconvex cost function and nonconvex inequality constraints, this problem is known to be NP-hard [42].
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
312 313 314 315
of
311
Compared with (27), we have pα = 1 and the canonical function V (ξ ) = ξ is a self-mapping. Therefore, the canonical dual variable ς = ∇V (ξ ) = I is an identity matrix in Rm×m and V ∗ (ς) = sta{ξ ; ς − ξ | ξ ∈ Rm } = 0. In this case, the generalized complementary function (28) has a very simple form: q (x, σ ) =
316
317
where G q (σ ) = A +
318
319
320
321
330 331 332 333 334
G(x, σ ) =
337
σα Cα .
(50)
α=1
(51)
(52)
1 T x G q (σ )x, 2
(53)
which is nonnegative on Rn if G q (σ ) 0. Let
Sq+ = {σ ∈ Sq | G q (σ ) 0}, and Sq− = {σ ∈ Sq | G q (σ ) ≺ 0}.
(54)
Then the canonical dual problem for this quadratic constrained problem is given by (Pqd ) : max{Pqd (σ ) : σ ∈ Sq+ }.
(55)
From Theorem 3 we have the following result:
Theorem 4 The problem (Pqd ) is canonically dual to the primal problem (Pq ) in the sense that for each critical point σ¯ ∈ Sq of (Pqd ), the vector x¯ = G q+ (σ¯ )Fq (σ¯ ) is a KKT point of (Pq ) and P(¯x) = P d (σ¯ ). Particularly, if the critical point σ¯ ∈ Sq+ , then σ¯ is a global maximizer of (Pqd ), the vector x¯ is a global minimizer of (Pq ), and P(¯x) = min P(x) = max Pqd (σ ) = Pqd (σ¯ ). x∈Xc σ ∈Sq+
335
336
m
In this case, the complementary gap function has a simple form:
328
329
α=1
and Fq (σ ) = f −
1 Pqd (σ ) = − Fq (σ )T G q+ (σ )Fq (σ ) − σ T d. 2
326
327
σα Bα ,
(49)
the canonical dual function Pqd can be formulated as
324
325
m
Therefore, on the dual feasible space Sq = σ ∈ Rm + | Fq (σ ) ∈ Col (G q ) ,
322
323
1 T x G q (σ )x − x T Fq (σ ) − σ T d, 2
unc orre cted
Author Proof
310
Since the constraint g(x) is a vector-valued quadratic function defined on Xa = Rn , we simply let 1 T α x B x + x T Cα : Rn → Rm . (48) g(x) = (x) = 2
pro
309
(56)
If G q (σ¯ ) ≻ 0, then σ¯ is a unique global maximizer of (Pqd ) and the vector x¯ = G q−1 (σ¯ )Fq (σ¯ ) is a unique global minimizer of (Pq ).
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
341
342 343 344 345 346
347 348
349
350 351
352
P(¯x) = min P(x) = min Pqd (σ ) = Pqd (σ¯ ). x∈Xo σ ∈ So
This theorem shows that the Hessian matrix of the complementary gap function G(x, σ ) provides sufficient and uniqueness conditions for globally minimizing the quadratic constrained problem (Pq ). We note that this theorem is actually a special application of the general canonical duality theory developed in [10,12,16,27]. In order to study the existence theory, we need to introduce the following sets: ∂ Sq = σ ∈ Rm | det G q (σ ) = 0 , (58) + ∂ Sq = σ ∈ Sq | det G q (σ ) = 0 . (59) Similar to the general results proposed in [29], we then have the following result:
Theorem 5 Suppose that for the given matrices A, {Bα }, {Cα } and vectors f, d, there exists at least one σ 0 ∈ Sq+ such that G q (σ 0 ) 0 and lim
σ → ∞ σ ∈ Sq+
353 354 355 356
357
359
360 361 362 363 364 365 366 367 368
369 370
371
Pqd (σ ) = −∞.
(60)
Then the canonical dual problem (Pqd ) has at least one KKT point σ¯ ∈ Sq+ . If σ¯ ∈ Sq+ is also a critical point of Pqd (σ ), then x¯ = G q+ (σ¯ )Fq (σ¯ ) is a global minimizer for the primal problem (Pq ). + Moreover, if ∂ Sq ⊂ Rm + , there exists at least one σ 0 ∈ Sq such that G q (σ 0 ) ≻ 0, and lim
σ → ∂ Sq+ σ ∈ Sq+
358
(57)
of
Author Proof
340
pro
339
However, if the critical point σ¯ ∈ Sq− , then σ¯ is a local minimizer of Pqd (σ ) on the neighborhood So ⊂ Sq− if and only if x¯ is a local minimizer of P(x) on the neighborhood Xo ⊂ Xc , i.e.,
unc orre cted
338
Pqd (σ ) = −∞,
(61)
then the canonical dual problem (Pqd ) has a unique global maximizer σ¯ ∈ Sq+ and x¯ = G q−1 (σ¯ )Fq (σ¯ ) is a unique global minimizer for the primal problem (Pq ). Proof By the fact that the feasible space Sq+ is a semi-closed convex set whose boundary ∂ Sq+ is a hyper-surface in Rm , if there exists a σ 0 such that G q (σ 0 ) 0, then Sq+ is not empty. Since the canonical dual function Pqd (σ ) is continuous and concave on Sq+ , which is finite on ∂ Sq+ , if the condition (60) holds, then Pqd (σ ) has at least one maximizer on Sq+ . m + Moreover, if ∂ Sq ⊂ Rm + , then Sq ⊂ R+ . If there exists a σ 0 such that G q (σ 0 ) ≻ 0, + then Sq is non-empty and has at least one interior point. Under the conditions (60) and (61), the canonical dual function Pqd (σ ) is strictly concave and coercive on the open convex set Sq+ \∂ Sq+ . Therefore, the canonical dual problem (Pq ) has a unique maximizer σ¯ ∈ Sq+ that is a critical point of Pqd (σ ). ⊔ ⊓ Theorem 5 shows that under the conditions (60) and (61), the canonical dual function Pqd (σ ) has a unique maximizer σ¯ on the open feasible space Sq‡ = {σ ∈ Sq | G q (σ ) ≻ 0}.
(62)
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
373
375 376
Particularly, if m = 1 and C = 0, the problem (Pq ) has only one quadratic constraint g(x) = 21 x T Bx ≤ d. Therefore, the canonical dual function has only one variable [18]: 1 Pqd (σ ) = − f T (A + σ B)−1 f − σ d, 2
378
379
380 381 382 383
384 385 386 387
388
389 390 391 392 393
(64)
pro
377
and the criticality condition ∇ Pqd (σ ) = 0 leads to a nonlinear algebraic equation
1 T (65) f (A + σ B)−1 B(A + σ B)−1 f = d, 2 which can be solved easily to obtain all dual solutions. Moreover, if B = I is an identity matrix in Rn , then the constraint 21 x T Bx = 21 x2 ≤ d is an n-dimensional sphere. It was shown in [17,18] that number of solutions depends on the number of negative eigenvalues of A. These canonical dual solutions play an important role in trust region methods.
unc orre cted
Author Proof
374
In this case, the matrix G q (σ ) is invertible on Sq‡ and the canonical dual function Pqd can be written as 1 Pqd (σ ) = − Fq (σ )T G q−1 (σ )Fq (σ ) − σ T d. (63) 2
of
372
Remark 2 It is instructive to see connections between (63) and the classical Lagrangian Theory proposed in (3)–(5). Under the assumption that det G q (σ ) = 0, the necessary optimality condition ∇x L(x, σ ) = 0 in (3) leads to the unique solution x = G q−1 (σ )Fq (σ ). Substituting this into (4) we get d Pq (σ ) if σ ∈ Sq‡ , P ∗ (σ ) = inf L(x, σ ) = x∈Xa −∞ otherwise. Therefore, under the sufficient (global) optimality condition σ ∈ Sq‡ , the Lagrangian dual function P ∗ (σ ) is identical to the canonical dual function Pqd (σ ) as defined in (63). By Theorem 4, we know that if σ¯ ∈ Sq‡ is a critical point of Pqd (σ ), then the corresponding
x¯ = G q−1 (σ¯ )Fq (σ¯ ) ∈ Xc satisfies σ¯ ⊥ (g(¯x) − d), and (¯x, σ¯ ) ∈ Rn × Sq‡ is a saddle point solution with the following strong duality condition holding: inf P(x) = sup P ∗ (σ ). σ ∈Sq‡
394
x∈Xc
395 396
However, in many applications if {Bα } (α = 1, . . . , m) are either indefinite or zero matrices, the dual feasible space Sq‡ could be an empty set and in this case, P ∗ (σ ) = inf L(x, σ ) = −∞.
397
x∈Xa
398 399 400 401
Therefore, the classical Lagrangian duality theory cannot be used to solve the primal problem (see Example 2 in the next section). But, by the complementary-dual principle (the first part of Theorem 4) we know that the primal problem is equivalent to the following canonical dual minimal stationary problem: min sta{Pqd (σ ) : σ ∈ Sq }.
402
403 404 405
(66)
For each critical point σ¯ ∈ Sq , the vector x¯ = G q−1 (σ¯ )Fq (σ¯ ) is a KKT point of the primal problem. If σ¯ ∈ Sq− is a local minimizer of Pqd (σ ), by the triality theory we know that x¯ is a local minimizer of (Pq ). ⊔ ⊓
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
pro
of
Author Proof
J Glob Optim
Fig. 1 Graph of P(x) (left); contours of P(x) and boundary of Xc (right) for Example 1
407 408 409 410 411 412 413 414 415 416 417 418
419
420
421 422 423 424
425
426
427
428 429 430 431
432
This remark shows that for nonconvex minimization problems with quadratic constraints, the Lagrangian dual P ∗ (σ ) is identical to the canonical dual P d (σ ) only on the canonical dual feasible space Sq‡ . The sufficient global optimality conditions in Sq‡ close the duality gap that exists in the classical Lagrangian duality theory. The triality theory (Theorem 4) can be used to develop effective canonical dual approaches for solving a wide class of primal problems. The general sufficient global optimality conditions for nonconvex minimization problems was first proposed by Gao and Strang in [27], where G(x, σ ) is called the complementary gap function associated with the quadratic operator (x). They discovered that under the condition G(x, σ ) ≥ 0, ∀x ∈ Xa , the critical point of the total complementary function q (x, σ ) leads to global optimal solutions to the primal problem. We note that similar results and the so-called L-subdifferential condition proposed in the recent papers [34,35] are actually special applications of the general canonical duality theory developed in [10,27].
unc orre cted
406
Example 1 In 2-D space, if we let 3 0.5 1 0 1 A= , B= , and f = , 0.5 −2.0 0 0.5 1.5 then the matrix A is indefinite, while B is positive definite. Setting d = 2, the graph of the primal function P(x) = 21 x T Ax − x T f is a saddle surface (see Fig. 1), and the boundary of the feasible set Xc = {x ∈ R2 | 21 x T Bx ≤ d} is an ellipse (see Fig. 1). In this case, the canonical dual function (64) can be formulated as −1 1 3 + σ 0.5 1 Pqd (σ ) = − (1 1.5) − 2σ, 0.5 −2 + 0.5σ 1.5 2 which has four critical points (see Fig. 2):
σ¯ 1 = 5.08 > σ¯ 2 = 3.06 > σ¯ 3 = −2.46 > σ¯ 4 = −3.68.
Since G q (σ¯ 1 ) ≻ 0 and G q (σ¯ 4 ) ≺ 0, the triality theory yields that x1 = (−0.05, 2.83)T is a global minimizer located on the boundary of Xc , and x4 = (−1.95, −0.64)T is a local maximizer. From the graph of P d (σ ) we can see that x2 = (0.39, −2.77)T is a local minimizer, and x3 = (2.0, −0.16)T is a local maximizer. We have P(x1 ) = −12.25 < P(x2 ) = −4.24 < P(x3 ) = 4.04 < P(x4 ) = 8.81.
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
pro
of
Author Proof
J Glob Optim
Fig. 2 Graph of P d (σ )
434
435
436
437
438 439 440 441
442
443
444
445 446 447
448
449 450 451 452
453
4 Applications to box and integer constrained problems
We now turn our attention to the box constrained quadratic minimization problem: min P(x) = 21 x T Ax − x T f (Pb ) : s.t. − 1 ≤ xi ≤ 1, i = 1, . . . , n.
unc orre cted
433
An associated problem is the quadratic integer (Boolean) programming problem min P(x) = 21 x T Ax − x T f (Pi ) : s.t. xi ∈ {−1, 1}, i = 1, . . . , n.
(67)
(68)
Both problems are known to be NP-hard and have been subjected to considerable study over the past several decades (see [47,48]). In order to use the canonical dual transformation, the key step is to rewrite the linear box constraints in the following canonical form: 1 T α g(x) = (x) = x B x ≤ d, (69) 2 where Bα = {Biαj } ∈ Rn×n , ∀ α = 1, . . . , n, i.e., 1 if i = j = α, α Bi j = 0 otherwise, n and d = 21 is a vector whose components are all 21 . Thus, the box constrained problem is actually the most simple case of the problem (Pq ). Particularly, if the inequality constraint in (69) is replaced by 1 T α g(x) = (x) = (70) x B x = d, 2
then x has to be in {−1, 1}n . Therefore, the Boolean programming problem (Pi ) is actually a special case of the quadratically constrained quadratic program (where each equality constraint is equivalently represented by two inequalities). The canonical dual problem in this case has an even simpler form (see [20]): 1 T (71) (Pbd ) : max Pbd (σ ) = − f T G + (σ )f − σ d : σ ∈ S b , b 2
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
G b (σ ) = A +
456
458 459 460 461 462 463
464 465 466
467
468
469
470 471 472 473
474 475
476
477
478
479 480 481 482 483 484 485 486 487
σα Bα ,
(72)
α
Sb = {σ ∈ Rn+ | f ∈ Col (G b )}.
(73)
Theorem 6 If σ¯ ∈ Sb is a critical point of the canonical dual problem (Pbd ), then x¯ = G+ x) = Pbd (σ¯ ). If σ¯ > 0, b (σ¯ )f is a KKT point of the box constrained problem (Pb ) and P(¯ n then x¯ ∈ {−1, 1} is a KKT point of the Boolean programming problem (Pi ).
pro
457
n
If G b (σ¯ ) ≻ 0, then x¯ = G −1 b (σ¯ )f is a global minimizer of the problem (Pb ). If G b (σ¯ ) ≻ 0 and σ¯ > 0, then x¯ = G −1 b (σ¯ )f is a global minimizer of the problem (Pi ). ⊔ ⊓ The proof of this theorem can be found in [20]. This theorem shows that by the canonical dual transformation, the nonconvex integer programming problem (Pi ) can be converted to the following continuous concave maximization dual problem: 1 T −1 + d d T (Pi ) : max Pb (σ ) = − f G b (σ )f − σ d : σ ∈ Si , (74) 2
unc orre cted
Author Proof
455
where
of
454
where Si+ = {σ ∈ Rn+ | σ > 0, G b (σ ) ≻ 0}. By the fact that d = { 21 }n > 0 and lim
σ →∞
Pbd (σ ) = −∞,
we know from Theorem 5 that the canonical dual problem (Pbd ) has at least one KKT point σ¯ ∈ Sb+ . If σ¯ ∈ Si+ ⊂ Sb+ , then x¯ = G −1 b (σ¯ )f is a global minimizer for (Pi ). A detailed study on canonical dual approaches for 0–1 programming and multi-integer programming is given in [3,25,49]. Example 2 For a given vector f ∈ Rn , we consider the following constrained convex maximization problem: max{x + f2 : x∞ ≤ 1},
(75)
which is equivalent to the following concave quadratic minimization problem: 1 T T T min P(x) = − x x − x f : |xi | ≤ 1, ∀i = 1, . . . , n, x x ≤ r , 2
(76)
where r > n to ensure that the additional quadratic constraint x T x ≤ r in the feasible space ∀i = 1, . . . , n, x T x ≤ r } is never active. Therefore, (76) is indeed a box constrained concave minimization problem. It is known that for high dimensional nonconvex constrained optimization problems, to check which constraints are active is fundamentally difficult [42]. If we let n = 2, r = 100, and f = (1, 1)T , the optimal solution is x¯ = (1, 1)T with objective value P(¯x) = −3. To illustrate the difficulty of applying the classical Lagrangian duality theory directly to (76), we first introduce Lagrange multipliers (σ1 , σ2 , σ3 , σ4 )T ∈ R4+ to relax the linear box constraints −1 ≤ xi ≤ 1, i = 1, 2, and σ5 ≥ 0 to relax the quadratic Xc = {x ∈ Rn | − 1 ≤ xi ≤ 1,
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
488
2
1 σi (xi − 1) − σi+2 (xi + 1) L(x, σ ) = − (x12 + x22 ) − (x1 + x2 ) + 2
490
with the Lagrangian dual function given by
P ∗ (σ ) = min L(x, σ ).
492
pro
491
x∈R2
493
When σ5 < 1, we get P ∗ (σ ) = −∞. When σ5 = 1, we obtain max{P ∗ (σ ) : σ5 = 1} = −52 σ ≥0
494
495 496
of
i=1
1 + σ5 x12 + x22 − 100 , 2
at the solution σ0 = (1, 1, 0, 0, 1). Finally, for any given σ ∈ Sr = {σ ∈ R5+ | σ5 > 1}, the Lagrangian dual function can be obtained as 4
497
498
P ∗ (σ ) = −
501 502 503
sup{P ∗ (σ ) : σ ∈ Sr } = −52,
realized as σ → σ o = (1, 1, 0, 0, 1)T . Since σ o is not a critical point of P ∗ (σ ), both Theorems 4 and 6 do not apply for this case and the associated xo is not a KKT point of the primal problem (76). Hence, there exists a duality gap between the primal and the Lagrangian dual problem, i.e., P(¯x) = min P(x) = −3 > −52 = max P ∗ (σ ) = P ∗ (σ o ). x∈Xc σ ∈R5+
504
505
506
507
508
i=1
It is easy to check that the solution to the Lagrangian dual problem
499
500
1 σi − 50σ5 . (1 − σ1 + σ3 )2 + (1 − σ2 + σ4 )2 − 2(σ5 − 1)
unc orre cted
Author Proof
489
constraint 21 x T x ≤ 50. The Lagrangian associated with (76) is
To close this duality gap, we rewrite the constraints in the canonical form ⎫ ⎧1 ⎫ ⎧1 2 ⎪ ⎪ ⎨2 ⎪ ⎬ ⎪ ⎨ 2 x1 ⎬ ≤ 21 g(x) = (x) = 21 x22 =d ⎪ ⎪ ⎩ ⎪ ⎭ ⎪ ⎩1 2 ⎭ 2 50 2 (x 1 + x 2 )
as defined in (69) with 1 if i = j = α, Biαj = 0 otherwise,
i, j, α = 1, 2,
Bi3j =
1 if i = j, 0 if i = j,
i, j = 1, 2
511
and d = 0.5(1, 1, 100)T . In effect, this reformulates (76) for this instance as follows: 1 2 1 1 2 1 2 1 2 1 2 2 x ≤ , (x + x2 ) ≤ 50 . min P(x) = − (x1 + x2 ) − (x1 + x2 ) : x1 ≤ , 2 2 2 2 2 2 2 1 (77)
512
The associated total complementary function (49) has a simple form:
509
510
513
q (x, σ ) =
1 1 1 2 x1 (σ1 + σ3 − 1) + x22 (σ2 + σ3 − 1) − (x1 + x2 ) − (σ1 + σ2 ) − 50σ3 . 2 2 2
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
517
518 519 520 521 522 523 524 525 526 527
528 529 530 531 532 533 534
535
536 537 538
539
540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556
of
516
The optimal solution for this concave maximization problem is σ¯ = (2, 2, 0)T with the optimal value Pqd (σ¯ ) = −3. Observe that σ¯ 3 = 0 reflects the fact that the quadratic constraint x T x ≤ r is inactive. Since σ¯ ∈ Sq+ is a critical point of Pqd (σ ), therefore, the vector x¯ = G q−1 (σ¯ )f = (1, 1)T is a global minimizer of the primal problem with zero duality gap. Finally, it is insightful to note that once the primal problem has been reformulated as (77), the classical Lagrangian dual is then given by max{P ∗ (σ ) : σ ≥ 0}, where P ∗ (σ ) = minx {q (x, σ )}. It is readily verified that for σ ∈ Sq+ , P ∗ (σ ) is given by Pqd (σ ) as in (78), while P ∗ (σ ) = −∞ otherwise. Hence, under the sufficient global optimality condition σ ∈ Sq+ , the Lagrangian dual for the canonically reformulated problem (77) is precisely given by the canonical dual (78).
pro
Author Proof
515
Then, on the canonical dual feasible space Sq+ = {σ ∈ R3+ | σi + σ3 − 1 > 0, i = 1, 2} the canonical dual problem (Pqd ) is 1 1 1 1 − (σ1 + σ2 ) − 50σ3 : σ ∈ Sq+ . max Pqd (σ ) = − + 2 σ1 + σ3 − 1 σ2 + σ3 − 1 2 (78)
Remark 3 This example shows the difficulty of directly applying the classical Lagrangian duality for solving nonconvex minimization problems with linear (including both box and integer) constraints. The classical Lagrangian duality theory was originally developed for linearly constrained convex variational problems in analytical mechanics, where the Lagrange multipliers and the linear constraints possess certain perfect duality (work-conjugate) relations [10,38]. In convex analysis, the classical (saddle) Lagrangian duality theory was generalized for solving the following convex minimization problem [2,44]:
unc orre cted
514
min{P(x) = W (x) − F(x)},
where is a linear operator, W (y) is a convex (nonsmooth) function, and F(x) is a concave (or linear) function. By using the Fenchel transformation, the so-called Fenchel– Moreau–Rockafellar dual problem is formulated as max{P d (σ ) = F ♯ (∗ σ ) − W ♯ (σ )},
where ∗ is an adjoint operator of . In constrained problems, if the function W (x) is an indicator of the linear (geometrical) constraint x ≤ d as defined by Equation (7), then by the Lagrange multiplier law (see [10], p. 36–37), the Lagrange multiplier σ for the primal problem should be a solution for the dual problem. Dually, if the Fenchel conjugate F ♯ is an indicator of the balance constraint ∗ σ = f [10], then the Lagrange multiplier x for the dual problem should be a solution to the primal problem. The convexity of the primal function P(x) leads to the strong duality min P(x) = max P d (σ ). In physics and continuum mechanics, the perfect dual function is called the complementary energy and the strong duality relation is controlled by certain conservation laws. However, the Lagrange multiplier method has been misused for solving general nonlinear constrained nonconvex problems. The primal problem (76) in Example 2 has both linear and nonlinear (quadratic) constraints, the Lagrange multipliers σi , i = 1, 2, 3, 4 are dual to the linear constraints, while σ5 is dual to the quadratic constraint. Since the linear and nonlinear constraints are different geometrical measures, their corresponding dual variables, i.e., the Lagrange multipliers σi , i = 1, 2, 3, 4, and σ5 are in different metric spaces with different (physical) units. Therefore, the classical Lagrangian dual problem in this case does not make physical sense. The weak Lagrangian duality theory leads to various duality gaps and violates the conservation law.
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593
594
595
596
of
Author Proof
559
Generally speaking, if the objective function P(x) is nonconvex, the optimal solutions are usually located on the boundary of the feasible set. To determine active constraints is a fundamentally difficult task. For a nonconvex polynomial P(x), the effective domain of the Lagrangian dual P ∗ (σ ) = supσ L(x, σ ) is usually empty. Therefore, many extended Lagrangian duality methods have been developed. As indicated in [12,16,27], the key step in the canonical dual transformation is to choose a (nonlinear) geometrical operator ξ = (x) such that the duality relation ς = ς (ξ ) defined by (13) is invertible (one-to-one) and the nonconvex primal problem can be written in the canonical form (12). Detailed discussion on the geometrical operator (x) and the canonical dual pairs (ξ , ς ) are given in [10] (Chap. 6). For most of problems in nonconvex analysis and global optimization, the quadratic geometrical operator (x) can be used to formulate canonical dual problems [12,27]. By the fact that the total complementary function is identical to the Lagrangian for the reformulated primal problem (77), the function q (x, σ ) was also called the nonlinear (or extended) Lagrangian in [10]. In physics and continuum mechanics, since the terminology complementarity represents perfect duality, it is more appropriate to name q (x, σ ) as the total complementary function. According to the canonical duality theory, when the primal problem is reformulated in the canonical form, the canonical dual function and its feasible space can be formulated precisely via the canonical dual transformation. Both global and local extremality conditions are governed by the triality theory. It is interesting to note that by using the quadratic geometrical measure, different primal problems in differential fields can be transformed to a unified canonical dual form. Actually, the canonical dual function Pqd (σ ) was first proposed in nonconvex analysis and mechanics [7–9], which leads to analytical solutions for a class of nonconvex variational/boundary value problems [11,22,23]. In continuum mechanics, the quadratic measure ξ = (x) corresponds to the CauchyRiemann type strain tensor, while its canonical dual ς = ∇V (ξ ) is the well-known second Piola–Kirchhoff stress tensor. For most of hyper-elastic materials, the strain energy V (ξ ) is a convex function of the canonical strain measure ξ (see [10], Chap. 6). In plasticity of solids, the box constraint x√ ∞ ≤ 1 corresponds to the well-known Tresca yield condition, while the condition x2 ≤ n corresponds to the von Mises yield condition (see [10], p. 404). In the famous experiment by Taylor and Quinney (1931), the experimental data for most metals lies between the two criteria; however, the data are generally closer to the von Mises yield condition. By using the quadratic operator (x) = x22 , the canonical dual transformation was first proposed to solve nonlinear finite element programming problems in large scale plastic limit analyses of structures [5,6]. The vector-valued quadratic canonical dual transformation (x) = {xi2 } for solving box and integer constrained problems was presented in [3,20,25,49]. ⊔ ⊓
pro
558
unc orre cted
557
5 Nonconvex polynomial constrained problems
We now assume that g(x) is a general fourth order polynomial constraint given by ⎫ ⎧ 2 ⎬ ⎨ 1 1 T α Dβ x Bβ x + x T Cαβ − E βα g(x) = ≤ d, ⎭ ⎩ 2 α 2
(79)
β∈Iα
597 598
α } ∈ Rn are given as before, I is a (finite) index where Bαβ = {Biαjβ } ∈ Rn×n and Cαβ = {Ciβ α β
set that depends on each index α = 1, . . . , m, and {Dα } and {E βα } are two given second order
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
603 604
605
606
607
where the range of Ea depends on the tensors {Bαβ } and {Cαβ }, the canonical function ⎫ ⎧ ⎨ 1 2 ⎬ D β ξ α − E βα : Ea → Rm V (ξ ) = ⎭ ⎩ 2 α β β∈Iα
608
609
610 611
612
is a quadratic function. Thus, the canonical duality relation
ς = ∇V (ξ ) = {Dαβ (ξβα − E βα )} : Ea → Ea∗
β
is a linear mapping, where the range of Ea∗ depends on the tensors {Dα } and {E βα }. The Legendre conjugate V ∗ can be defined uniquely as: ⎧ ⎫ ⎬ ⎨
1 α β β 2 ) + E ς (ς . (80) V ∗ (ς ) = α α β β ⎭ ⎩ 2Dα
unc orre cted
Author Proof
602
of
600 601
β
tensors. We assume that Dα > 0, ∀α ∈ {1, . . . , m}, β ∈ Iα . This function is also called the fourth order canonical polynomial as discussed in [10,19], which arises in many applications (e.g., in Euclidean distance problems [28,36,45], post-buckling analysis in large deformation beam theory [13], finite element analysis for phase transitions of super-conductivity [29], and sensor network localization [30]). By introducing a geometrical measure 1 T α ξ = (x) = {ξβα } = x Bβ x + x T Cαβ : Rn → Ea , 2
pro
599
β∈Iα
613
614
Substituting this into (34), the canonical dual function has the following form: m
1 1 β 2 α β d T + (ς ) +E β ςα +dα P (σ , ς )=− F(σ , ς ) G a (σ , ς )F(σ , ς )− , σα β α 2 2Dα α=1 β∈Iα
615
616
617
618 619 620
621
622 623 624 625 626
627
(81)
which is concave on the dual feasible space
∗ Sc+ = {(σ , ς ) ∈ Rm + × Ea | F(σ , ς) ∈ Col (G a ), G a (σ , ς ) 0}.
(82)
Remark 4 Similar to Remark 2, it is again insightful to view the connection between the canonical dual (81) and the classical Lagrangian dual defined by (3)–(5). In this case, we have as in (3) that L(x, σ ) =
1 T x Ax − x T f + σ T (g(x) − d) . 2
(83)
Clearly, without introducing the canonical dual pair (ξ , ς), the Fenchel-Moreau-Rockafellar dual P ∗ (σ ) = inf x∈Xa L(x, σ ) defined by (4) cannot be defined explicitly due to the high order nonlinearity of the constraint g(x) ≤ d. However, by using the canonical dual transformation ξ = (x) and the chain rule, the necessary condition for the optimization in (4), i.e., ∇x L(x, σ ) = 0 leads to Ax − f +
m
σα ςαβ (Bαβ x + Cαβ ) = 0.
(84)
α=1 β∈Iα
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
This is the canonical equilibrium equation G a (σ , ς )x = F(σ , ς ), where ς≡
630 631
∂ V ((x)) ∂ξ
(85)
is as defined above. If G a (σ , ς) is invertible for (σ , ς ) ∈ Sc as defined in (33), we get from (84) that x uniquely satisfies x = G a−1 (σ , ς )F(σ , ς ). Furthermore, by (84), we get m 1 1
1 T σ α ς βα [x T Bβα x + x T Cβα ]. x Ax = x T f − 2 2 2
632
633 634 635 636 637 638 639 640 641 642 643 644
645
646
647
648
649 650 651
Substituting for 21 x T Ax in (83) using (86), and then applying the optimality condition x = G a−1 (σ , ς)F(σ , ς), the Lagrangian dual function reduces precisely to P d (σ , ς ) defined in (81) under the global optimality condition (σ , ς ) ∈ Sc+ . Note that ∇ς (x, σ , ς ) = 0 produces the inverse of the identity (85) under the relevant case when σ α = 0, ∀α = 1, . . . , m, thus validating the foregoing derivation. In finite deformation theory, the geometrical measure ξ = (x) is called the canonical strain tensor and the canonical duality relation (85) is called the constitutive law. This one-to-one constitutive relation leads to the canonical dual function P d . The associated Theorem 1 is called the pure complementary variational principle, first developed in nonconvex variational analysis [7,8]. This theorem solves an open problem in nonconvex mechanics posed by Hellinger (1914) and Reissner (1953) (see [43]) and it is known as the Gao principle [39]. ⊔ ⊓ We now present some special case applications for (79).
5.1 Quadratic minimization with one nonconvex polynomial constraint We first assume that the primal problem has only one nonconvex constraint: 2 1 1 T T g(x) = x Bx + x c − η ≤ d, 2 2
G a (σ, ς) = A + σ ς B, F(σ, ς) = f − σ ςc.
The canonical dual function is
1 P d (σ, ς) = − FT (σ, ς)G a+ (σ, ς)F(σ, ς) − σ 2
654
655 656 657 658
1 2 ς + ης + d . 2
(88)
Example 3 In 2-D space, let B be an identity matrix, c = 0, and let A be a diagonal matrix with a11 = 0.6, a12 = a21 = 0, and a22 = −0.5. Setting f = (0.2, −0.1)T , d = 1, and η = 1.5, the constraint g(x) ≤ d is an annulus (see Fig. 3 (right)). Solving the dual problem, we get σ¯ = 0.3829201, and ς¯ = 1.4142136.
659
660
(87)
where B is an n × n matrix, c ∈ Rn is a vector, and η > 0 is a constant. In physics, this nonconvex fourth order polynomial is known as the double-well function and it appears in many applications [10]. In this case, m = |Iα | = 1, and
652
653
(86)
pro
α=1 β∈Iα
unc orre cted
Author Proof
629
of
628
The primal solution
x¯ =
661
0.1752033 −2.4078477
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
pro
of
Author Proof
J Glob Optim
Fig. 3 Graph of P(x) (left); contours of P(x) and boundary of Xc (right) for Example 3
662
is located on the boundary of the feasible set Xc (see Fig. 3) and we have P(¯x) = −1.7160493 = P d (σ¯ , ς¯ ).
663
5.2 Combined quadratic and nonconvex polynomial constraints
665
We now consider the problem with the following two constraints:
1 T 1 x B x + x T C1 ≤ d1 , 2 2 1 1 T 2 T 2 g2 (x) = x B x + x C − η ≤ d2 . 2 2 g1 (x) =
666
667
668
669
670
671
672 673
In this case, m = 2, Iα = {1}, for α = 1, 2, and the geometrical operator 1 α T α ξ = (x) = xB x + x C : Rn → R2 2 is a 2-vector. The canonical function V (ξ ) is a vector-valued function 1 V (ξ ) = ξ1 , (ξ2 − η)2 . 2 The canonical dual variable is ς = ∇V (ξ ) = {1, ξ2 − η}. Since ς1 = 1, we let ς2 = ς. Thus, the canonical dual function has only three variables (σ1 , σ2 , ς) ∈ R3 , i.e., 1 P d (σ1 , σ2 , ς) = − F(σ1 , σ2 , ς)T G a+ (σ1 , σ2 , ς)F(σ1 , σ2 , ς) 2 1 2 − σ 1 d1 − σ 2 ς + ης + d2 , 2
674
675
676
677
678 679 680 681
unc orre cted
664
where
(89)
G a (σ1 , σ2 , ς) = A + σ1 B1 + σ2 ςB2 , F(σ1 , σ2 , ς) = f − σ1 C1 − σ2 ςC2 . Example 4 We consider A to be a 2 × 2 diagonal matrix, i.e., a11 = −0.4, a12 = a21 = 0, and a22 = 0.6. Setting f = (0.3, −0.15)T , B1 = B2 = I, C1 = C2 = 0, d1 = 2, d2 = 1.2, and η = 1.7, the graph of the objective function P(x1 , x2 ) is a saddle surface (Fig. 4(left)), the constraint g1 (x1 , x2 ) = 21 (x12 + x22 ) ≤ 2 is a disk of radius 2, while
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
pro
of
Author Proof
J Glob Optim
(a)
(b)
Fig. 4 (a) Graph of P(x); (b) Contours of P(x), constraints g1 (x1 , x2 ) ≤ d1 (disk with radius R ≤ 2, dashed circle), and g2 (x1 , x2 ) ≤ d2 (annulus with radius 0.55 ≤ R ≤ 2.55) for Example 4
683
2 g2 (x1 , x2 ) = 21 21 (x12 + x22 ) − 1.7 ≤ 1.2 represents an annulus (see Fig. 4 (right)). Solving the dual problem, we get
unc orre cted
682
σ¯ 1 = 0.5503198, σ¯ 2 = 0, and ς¯ = 0.3159349.
684
685
The primal solution is therefore
x¯ =
686
687
1.9957445 , −0.1303985
which is located on the boundary g1 (x¯1 , x¯2 ) = 0, and
P(¯x) = −1.4097812 = P d (σ¯ 1 , σ¯ 2 , ς). ¯
688
690
Concrete applications of quadratic minimization with two nonconvex polynomial constraints are given in [28].
691
6 Concluding remarks and open problems
689
692 693 694 695 696 697 698 699
700 701 702 703 704 705
We have presented a detailed application of the canonical duality theory to the general differentiable nonconvex optimization problem. This problem arises in many real-world applications. Using the canonical dual transformation, a unified canonical dual problem was formulated with zero duality gap, which can be solved by well-developed nonlinear optimization methods. Both global and local optimizers can be identified by the triality theory. Insightful connections of this canonical duality with the classical Lagrangian duality have also been presented for two special applications in order to highlight the main constructs that enable the perfect duality results. Furthermore, these applications show how: (1) the n-dimensional nonconvex constrained problem (Pq ) in Sect. 3 can be reformulated as an m-dimensional concave maximization dual problem (Pqd ) over a convex space Sq+ with m < n (m = 1 in Example 1); (2) the nonconvex discrete integer programming problem (Pi ) in Sect. 4 can be converted to a concave maximization dual problem (Pid ) over a convex continuous space Si+ , which can be solved easily if Si+ is non-empty.
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
709 710 711 712 713 714 715 716 717 718 719 720 721 722 723
of
Author Proof
708
Generally speaking, optimal solutions for constrained nonconvex minimization problems are usually KKT points located on the boundary of the feasible sets. Due to the lack of global optimality criteria, it is very difficult for direct methods and the classical Lagrangian relaxations to find global minimizers. Therefore, most of these problems are considered to be NP-hard (see [42,46]). However, by the canonical duality theory, these KKT points can be easily determined by the critical points of the canonical dual problems. The triality theory can be used to develop potentially powerful algorithms for solving these problems. The canonical duality theory concepts and methodologies presented in this article can be used and generalized to solve many other difficult problems in global optimization, nonconvex analysis and mechanics, network communication, and scientific computations. In general, so long as the geometrical operator is chosen properly, the canonical dual transformation method can be used to formulate perfect dual problems and the triality theory can be used to establish useful theoretical results. If the convex dual feasible space Sc+ is not empty, the canonical dual max{P d (σ , ς) : (σ , ς ) ∈ Sc+ } can be solved easily by well-developed convex minimization techniques. As indicated in [21], the primal problem (P d ) could be NP-hard for the class of problems where the canonical dual function has no critical point in Sc+ . In this case, the primal problem (19) is equivalent to the following canonical dual minimal stationary problem:
pro
707
(Pgd ) : min sta{P d (σ , ς) : (σ , ς) ∈ Sc }.
724
725 726 727
unc orre cted
706
(90)
Since P d (σ , ς ) is usually nonconvex on Sc , to solve this minimal stationary problem could be a challenging task and many theoretical issues remain open. For further details and comprehensive applications of the canonical duality theory, we refer the reader to [4,10,21,24,26,45].
728
732
Acknowledgements The authors would like to thank the two anonymous referees and an associate editor for their detailed comments and insightful suggestions. In particular, Example 2 was suggested by one of reviewers and Remark 3 is based on the questions raised by this reviewer and the associate editor. This research was supported by the National Science Foundation under grants CCF-0514768 and CMMI-0552676.
733
References
729 730 731
734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753
1. Bazaraa, M.S., Sherali, H.D., Shetty, C.M.: Nonlinear Programming: Theory and Algorithms, 3rd edn. Wiley, New York (2006) 2. Ekeland, I., Temam, R.: Convex Analysis and Variational Problems. North-Holland, Amsterdam (1976) 3. Fang, S.C., Gao, D.Y., Sheu, R.L., Wu, S.Y.: Canonical dual approach for solving 0-1 quadratic programming problems. J. Ind. Manag. Optim. 4(1), 125–142 (2008) 4. Fang, S.C., Gao, D.Y., Sheu, R.L., Xing, W.X.: Global optimization for a class of fractional programming problems. J. Glob. Optim., (2008) (in press) 5. Gao, D.Y.: Extended bounding theorems of limit analysis. Appl. Math. Mech. 4, 571–584 (1983) 6. Gao, D.Y.: Panpenalty finite element programming for limit analysis. Comput. Struct. 28, 749– 755 (1988) 7. Gao, D.Y.: Duality, triality and complementary extremun principles in nonconvex parametric variational problems with applications. IMA J. Appl. Math. 61, 199–235 (1998) 8. Gao, D.Y.: Pure complementary energy principle and triality theory in finite elasticity. Mech. Res. Comm. 26(1), 31–37 (1999) 9. Gao, D.Y.: General analytic solutions and complementary variational principles for large deformation nonsmooth mechanics. Meccanica 34, 169–198 (1999) 10. Gao, D.Y.: Duality Principles in Nonconvex Systems: Theory, Methods and Applications. pp. 454 Kluwer Academic Publishers, Dordrecht (2000) 11. Gao, D.Y.: Analytic solution and triality theory for nonconvex and nonsmooth vatiational problems with applications. Nonlinear Anal. 42(7), 1161–1193 (2000)
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813
of
Author Proof
758
pro
756 757
12. Gao, D.Y.: Canonical dual transformation method and generalized triality theory in nonsmooth global optimization. J. Glob. Optim. 17(1/4), 127–160 (2000) 13. Gao, D.Y.: Finite deformation beam models and triality theory in dynamical post-buckling analysis. Int. J. Non-Linear Mech. 5, 103–131 (2000) 14. Gao, D.Y.: Tri-duality in global optimization. In: Floudas, C.A., Pardalos, P.D. (eds.) Encyclopedia of Optimization, Vol. 1, pp. 485–491. Kluwer Academic Publishers, Dordrecht (2001) 15. Gao, D.Y.: Nonconvex Semi-linear Problems and Canonical Dual Solutions. Advances in Mechanics and Mathematics, Vol. II. pp. 261–312. Kluwer Academic Publishers, Dordrecht (2003) 16. Gao, D.Y.: Perfect duality theory and complete solutions to a class of global optimization problems. Optimization 52(4–5), 467–493 (2003) 17. Gao, D.Y.: Complete solutions to constrained quadratic optimization problems, Special issue on Duality. J. Glob. Optim. 29, 377–399 (2004) 18. Gao, D.Y.: Sufficient conditions and perfect duality in nonconvex minimization with inequality constraints. J. Ind. Manag. Optim. 1, 59–69 (2005) 19. Gao, D.Y.: Complete solutions and extremality criteria to polynomial optimization problems. J. Glob. Optim. 35, 131–143 (2006) 20. Gao, D.Y.: Solutions and optimality to box constrained nonconvex minimization problems. J. Ind. Maneg. Optim. 3(2), 293–304 (2007) 21. Gao, D.Y.: Advances in canonical duality theory with applications to global optimization. In: Ierapetriou, M., Bassett, M., Pistikopoulos, S. (eds.) Proceedings of the Fifth International Conference on Foundations of Computer-Aided Process Operations, pp.73–82. Omni Press, Cambridge, MA (2008) 22. Gao, D.Y., Ogden, R.W.: Closed-form solutions, extremality and nonsmoothness criteria in a large deformation elasticity problem. Zeitschrift für Angewandte Mathematik Und Physik 59(3), 498– 517 (2008) 23. Gao, D.Y., Ogden, R.W.: Multi-solutions to nonconvex variational problems with implications for phase transitions and numerical computation. Q. J. Mech. Appl. Math. 61(4), 497–522 (2008) 24. Gao, D.Y., Ruan, N.: Solutions and optimality criteria for nonconvex quadratic-exponential minimization problem. Math. Methods Oper. Res. 67(3), 479–496 (2008) 25. Gao, D.Y., Ruan, N.: On the solutions to quadratic minimization problems with box and integer constraints. J. Glob. Optim. (2008) (in press) 26. Gao, D.Y., Sherali, H.D.: Canonical Duality Theory: Connections Between Nonconvex Mechanics and Global Optimization, Advances in Applied Mathematics and Global Optimization. pp. 249– 316. Springer, New York (2008) 27. Gao, D.Y., Strang, G.: Geometric nonlinearity: potential energy, complementary energy, and the gap function. Q. Appl. Math. 47(3), 487–504 (1989) 28. Gao, D.Y., Yang, W.C.: Complete solutions to minimal distance problem between two nonconvex surfaces. Optimization 57(5) (2008) 29. Gao, D.Y., Yu, H.F.: Multi-scale modelling and canonical dual finite element method in phase transitions of solids. Int. J. Solids Struct. 45, 3660–3673 (2008) 30. Gao, D.Y., Ruan, N., Pardalos, P.M.: Canonical dual solutions to sum of fourth-order polynomials minimization problems with applications to sensor network localization, to appear. In: Pardalos, P.M., Ye, Y., Boginski, V.L., Commander, C.W. (eds.) Sensors: Theory, Algorithms, and Applications. Springer, Berlin (2008) 31. Hansen, P., Jaumard, B., Ruiz, M., Xiong, J.: Global minimization of indefinite quadratic functions subject to box constraints. Technical report, Technical Report G-91-54, Gread. École Polytechnique, Université McGill, Montreal (1991) 32. Hellinger, E.: Die allgemeine Ansätze der Mechanik der Kontinua. Encyklopädie der Mathematischen Wissenschaften IV(Part 4), 602–694 (1914) 33. Horst, R., Pardalos, P.M., Thoai, N.V.: Introduction to Global Optimization. Kluwer Academic Publishers, Dordrecht (2000) 34. Jeyakumar, V., Rubinov, A.M., Wu, Z.Y.: Sufficient global optimality conditions for non-convex quadratic minimization problems with box constraints. J. Glob. Optim. 36(3), 471–481 (2006) 35. Jeyakumar, V., Rubinov, A.M., Wu, Z.Y.: Non-convex quadratic minimization problems with quadratic constraints: global optimality conditions. Math. Program. 110(3), 521–541 (2007) 36. Johnson, D.E., Cohen, E.: A framework for efficient minimum distance computations. In: Proc. IEEE International Conference on Robotics and Automation, pp. 3678–3684. Leuven, Belgium (1998) 37. Koiter, W.T.: On the complementary energy theorem in nonlinear elasticity theory. In: G. Fichera (ed.) Trends Appl. Pure Math. Mech., Pitman, London (1976) 38. Lagrange, J.L.: Mécanique Analytique. Gauthier-Villars, Paris (1788) 39. Li, S.F., Gupta, A.: On dual configuration forces. J. Elast. 84, 13–31 (2006)
unc orre cted
754 755
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small
J Glob Optim
Author Proof
817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832
of
816
40. Nash, S.G., Sofer, A.: Linear and Nonlinear Programming. McGraw-Hill, New York (1996) 41. Ogden, R.W.: A note on variational theorems in non-linear elastostatics. Math. Proc. Camb. Phil. Soc 77, 609–615 (1975) 42. Pardalos, P.M., Vavasis, S.A.: Quadratic programming with one negative eigenvalue is NP-hard. J. Glob. Optim. 21, 843–855 (1991) 43. Reissner, E.: On a variational theorem for finite elastic deformations. J. Math. Phys., 32(2–3), 129135 (1953). (See also Selected Works in Applied Mechanics and Mathematics, Jones and Bartlett Publishers, Boston, MA, 1996) 44. Rockafellar, R.T.: Convex Analysis. Princeton University Press, Princeton (1970) 45. Ruan, N., Gao, D.Y., Jiao, Y.: Canonical dual least square method for solving general nonlinear systems of quadratic equations. Comput. Optim. Appl. (published online: http://www.springerlink.com/ content/c6090221p4g41858/) (2008). DOI:10.1007/s10589-008-9222-5 46. Sahni, S.: Computationally related problems. SIAM J. Comp. 3, 262–279 (1974) 47. Sherali, H.D., Tuncbilek, C.: A global optimization for polynomial programming problem using a reformulation-linearization technique. J. Glob. Optim. 2, 101–112 (1992) 48. Sherali, H.D., Tuncbilek, C.: New reformulation-linearization technique based relaxation for univariate and multivariate polynomial programming problems. Oper. Res. Lett. 21(1), 1–10 (1997) 49. Wang, Z., Fang, S.C., Gao, D.Y., Xing, W.: Global extremal conditions for multi-integer quadratic programming. J. Ind. Manag. Optim. 4(2), 213–226 (2008)
pro
815
unc orre cted
814
123 Journal: JOGO MS: JOGO-07-692 CMS: 10898_2009_9399
TYPESET
DISK
LE
CP Disp.:2009/1/15 Pages: 25 Layout: Small