Issue 
Int. J. Simul. Multidisci. Des. Optim.
Volume 10, 2019



Article Number  A10  
Number of page(s)  7  
DOI  https://doi.org/10.1051/smdo/2019010  
Published online  07 June 2019 
Research Article
Modified CourantBeltrami penalty function and a duality gap for invex optimization problem
^{1}
School of Mathematical Sciences,
Main Campus, USM,
11800
Geligor,
Pulau Penang,
Malaysia
^{2}
Department of Mathematics, Yusuf Maitama Sule University, Kano,
700241
Kabuga,
Kano,
Nigeria
^{*} email: mansurkust@yahoo.com
Received:
1
November
2018
Accepted:
13
May
2019
In this paper, we modified a CourantBeltrami penalty function method for constrained optimization problem to study a duality for convex nonlinear mathematical programming problems. KarushKuhnTucker (KKT) optimality conditions for the penalized problem has been used to derived KKT multiplier based on the imposed additional hypotheses on the constraint function g. A zeroduality gap between an optimization problem constituted by invex functions with respect to the same function η and their Lagrangian dual problems has also been established. The examples have been provided to illustrate and proved the result for the broader class of convex functions, termed invex functions.
Key words: CourantBeltrami penalty function / penalized problem / Lagrangian dual
© M. Hassan and A. Baharum, published by EDP Sciences, 2019
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1 Introduction
Duality principle provided an avenue to view an optimization problem from one of the two perspectives, the primal problem (original formulation) or the dual problem (an alternative formulation). The purpose of duality in mathematical programming is to devise an alternative to the optimization problem which might be more convenient computationally or analytically. One interesting property of the dual problem is that its solution need not be equal to that of the primal problem. However, it provides a lower bound to the solution of the primal minimization problem. In general, the difference between the optimal values of the two problems is term as duality gap, if the optimal coincide then no duality gap occurs.
Based on the recent development, one of the suitable approaches in solving nonlinear optimization problem is an unconstrained optimization technique, this technique by extension can also be used to solve constrained optimization problem via a penalty function method. The idea is to carry out by transforming the original nonlinear constrained optimization problem to a single or a sequence of unconstrained optimization problems [1].
Several types of researches have been conducted to demonstrate the idea of a penalty function method and Lagrangian duality in contrast to the conventional approach for solving the constrained optimization problem [2–9].
The concept of duality and penalty function method are essential topics in optimization theory, methods and its applications [10]. There are many advancements regarding those topics in mathematical programming. The work of Changyu et al. [11] proposes a general dual program via generalized nonlinear Lagrangian functions for the constrained optimization problem. However, a dual program consists of a class of general dual programs that possess some explicit structures as special cases. Gao et al. [12] justified that a dual canonical transformation can be used to solve the general nonconvex quadratic minimization problem with nonconvex constraints, this can only be achieved by converting a nonconvex problem into a canonical dual problem with zero duality gap. Antczak [13] combined the idea of exact penalty function and duality theory to investigate and prove the zero duality gap between optimization problems constituted by invex functions introduced by Hanson [14] named by Craven [15] and their Lagrangian dual under some mild assumptions. Ernst and Volle [16] pay more attention to conic convex and established a general zero duality gap for conic convex via generalized CourantBeltrami penalty function. Kanzi and Soleimanidamaneh [17] proposed and reviewed the Slater constraint qualification (CQ) specifically for a semiinfinite optimization problem, which involved upper semicontinuous quasiconvex objective and constraint functions, KarushKuhnTucker (KKT) necessary and sufficient optimality conditions were provided.
The distance function was used by SoleimaniDamaneh [18] to illustrates a penalization mechanism, it was specifically designed for providing necessary and sufficient conditions to the solutions of the variational inequality problem. Moreover, SoleimaniDamaneh [19] used mordukhovich's subdifferential to solve nonsmooth optimization and variational problems.
In this paper, we modified a CourantBeltrami penalty function method, prove and obtain a few optimality and duality results. Precisely, for smooth optimization problems involving invex functions with respect to the same function η via a modified CourantBeltrami (MCB) penalty function method. Lagrange multiplier vectors based on MCB penalty has been derived. Moreover, a zeroduality gap under some mild assumptions was also established and validated with some examples.
The entire presentation is organized as follows: In Section 2, some preliminary definitions and results that will be useful in the subsequent sections were presented. In Section 3, we present the concept of a penalty function approach and the modification made to CourantBeltrami penalty function. In Section 4, the derivation of KKT multiplier in view of MCB penalty was presented. In Section 5, the Lagrangian duality and some examples were discussed and finally, we conclude the paper.
2 Preliminary result
We consider the following optimization problem:(1)where f : X → R and g _{ i } : X → R, i ∈ I, are continuously differentiable functions on a nonempty set X ⊂ R ^{ n }.
Some notations that are used throughout this presentation will be introduced. Let F = {x ∈ R ^{ n } : g _{ i }(x) ≤ 0,⩝ i = 1, 2, … , m}, be the set of all feasible solutions of the problem (1).
If the set of indices I partition into I _{1} and I _{2} with I _{1} ∩ I _{2} = ∅ , I _{1} ∩ I _{2} = I, the set of active constraints at a point x ∈ R ^{ n } can be denoted as .
Definition 1. [13] If the set of all feasible solutions in the problem (1) is not empty, we say that problem (1) is consistent.
Definition 2. [13] If there exists a strictly feasible solution , that is, , i ∈ I, then problem (1) is superconsistent, and the feasible point is called a Slater point for the problem (1).
Under slightly reasonable convexity assumptions of f (x), and g _{ i } (x) , the KKT conditions are also necessary and sufficient for optimality of such optimization problem (1).
Theorem 1. Let be an optimal solution to the problem (1) and the suitable constraint qualification [10] be satisfied at . Then there exist the Lagrange multipliers , i = 1, … , m such that(2) (3) (4)
Let assume that is an optimal solution to mathematical programming problem (1) and a suitable constraint qualification be satisfied. Furthermore, the necessary optimality of KKT conditions holds at with the optimal vector of KKT multiplier ξ ∈ R ^{ m }. Then the Lagrangian can be defined as(5)
Invex function is the wider class of convex introduced by Hanson [14] and named by Craven [15]. This new class of function is termed generalized convex functions.
Definition 3. [14] Let f : X → R be a differentiable function on X ⊂ R ^{ n } and u ∈ R ^{ n }. If, there exists a vectorvalued function η : R ^{ n } × R ^{ n } → R ^{ n } such that, ∀x ∈ X, the following inequality(6)
Holds, then the function f is said to be an invex (strictly invex) function with respect to η at u on X.
If (6) holds at each point u ∈ R ^{ n }, then f is said to be an invex (strictly invex) function with respect to η on R ^{ n }.
For a problem (1) to be an invex optimization problem, all functions constituting the problem (1) must be an invex function on R ^{ n } with respect to the same function η. If the optimization problem (1) is superconsistent with an invex constraint g _{ i } on R ^{ n } (w.r.t. the same function η) then the problem (1) satisfies the Slater constraint qualification (CQ).
3 Penalty function
Penalty method is one of the suitable approaches for solving nonlinear constrained optimization problems. The idea is achievable if constrained optimization is replaced by a single or a sequence of unconstrained problems. Ideally, the solutions of the transformed problem will always converge to the solution of the original problem. The primary role of the penalty term is to penalize the problem whenever a constraint is violated and to search for the optimum among the feasible points. There are several penalty functions in the literature, and the most popular among them was the absolute value penalty function introduced by Zangwill [1]. For a given constraint g _{ i }(x) ≤ 0, the absolute value penalty function is (7)
Note that the function is defined by , penalty function (7) was later modified to quadratic form, popularly known as CourantBeltrami penalty function [8,9](8)
Definition 4. [1] A continuous function p : R ^{ n } → R, is said to be a generalized penalty function (penalty function) for the constrained optimization problem (1) if it satisfied the following conditions:

p(x) = 0 for all feasible points x to the problem (1)

p(x) > 0 for infeasible points to the problem (1)
Note, that the conditions in Definition 4 are direct interpretations of whether the constraints function g _{ i } is violated or not.
We modified a quadratic form of penalty function (8), popularly known as CourantBeltrami penalty function, to a logarithmic form based on the new logarithmic penalty function that was restricted to equality constraints introduced by Hassan and Baharum [20]. The modified CourantBeltrami (MCB) penalty function performs the same role as the existing penalty functions. The MCB penalty function can be express in the following form: (9)
Let a nonnegative constant c be a penalty parameter, c _{ k } ≥ 0, k = 1, 2, … may be a sequence of penalty parameters with c _{ k+1} ≥ c _{ k } ∀ k and .
Accordingly, we converted the considered optimization problem (1) into the single of unconstrained problems based on the penalty function (9) as follows(10)
We should call the problem (10) modified CourantBeltrami penalized optimization problem.
Theorem 2 [13]. Suppose that f, g, and p are continuous functions. Let {x _{ k }}, k = 1, 2, … be a sequence of solutions in the penalized problem (10). Then any limit of the convergent subsequence {x _{ k j }} solves the original nonlinear programming problem (1).
4 KarushKuhnTucker multiplier for the penalized problem
In optimization problems, the first order necessary conditions for a nonlinear optimization problem to be optimal is KarushKuhnTucker (KKT) conditions, or KuhnTucker conditions if some of the constraint qualifications are satisfied. Nevertheless, CourantBeltrami penalty function may not be differentiable at a point g _{ i } (x) = 0 for some i ∈ I. But for the constrained optimization both objective function and constraints may be partially differentiable on R ^{ n } while the penalized problem is not, since the differentiability is not among the properties of max {0, g _{ i } (x)}. According to Proposition 1 [21], some additional assumptions may be imposed on the constraint function g _{ i } (x), i.e. if the constraint g _{ i } (x) has continuous firstorder partial derivatives on R ^{ n }, for this reason, admit the same. Therefore, (11)
Considering equation (11), if p (x): R ^{ n } → R is a modified CourantBeltrami (MCB) penalty function and the constraints g _{ i } (x) has continuous firstorder partial derivative on R ^{ n }, then
Theorem 3. Let x* be an optimal solution to the problem (1) and it satisfies the firstorder necessary optimality conditions of the constrained problem (1). Moreover, a suitable CQ be satisfied at x*, then x* is a solution to the penalized problem (10).
Proof. If x* is a feasible point which satisfies the firstorder necessary optimality conditions of the problem, then
Then (12) can be rewritten aswhere is a vector of KKT multiplier and for all i ∈ I. Therefore, Lagrange multipliers for the penalized problem (10) can be represented as . □
5 Lagrangian duality
We consider the following dual problem for the considered nonlinear optimization problem (1) (13)
Definition 5. A vector ξ is said to be a feasible solution for the dual problem (13) if
Note that if the problem (13) has at least one feasible vector ξ then the problem is called a consistent and any feasible vector satisfying is an optimal solution to the problem (13).
The relationship between primal and dual problem for the invex problem (i.e. if all the functions constituting the problem are invex on R ^{ n } w.r.t. the same function η) is either zero or a positive real number. Moreover, if a feasible solution of the dual problem returned a primal objective value lower bound, then it is referred as the weak duality, while in the absence of duality gap (when their difference is zero), then it is called strong duality.
Let P = inf {f (x) : g _{ i } (x) ≤ 0, i = 1, 2, … , m . x ∈ R ^{ n }} and D = sup {ϕ (ξ) : ξ ∈ R ^{ m }, ξ ≥ 0} be the primal and the dual optimal respectively. The following inequality related both optima: P ≥ D (if the problem (1) and (13) are consistent).
Let G represent the duality gap between the two problems that is, G = P − D ≥ 0.
Theorem 4. (Weak duality theorem): Let the functions constituting the problem (1) be continuously differentiable on R ^{ n }. Suppose that the problem (1) and (13) are consistent, then their optima are finite and G ≥ 0.
Proof. Let problem (1) and (13) has any feasible solution x and ξ, respectively. The feasibility of x implies that g _{ i }(x) ≤ 0, i = 1, 2, … , m. If ξ is feasible to the problem (13). Then ξ _{ i } g _{ i } (x) ≤ 0, i = 1, 2, … , m. So,(14)
The righthand side of the inequality (14) is Lagrange function. Therefore, f(x) ≥ L(x, ξ).
Since both x and ξ are randomly chosen feasible points in (1) and (13) respectively, then(15)
It follows directly from inequality (15) that(16)
Therefore, (16) and (17) justified that P and D are finite if (1) and (13) are consistent and P ≥ D. This verified the relationship between the two problems. i.e. G ≥ 0. □
Definition 6. A continuous function f (x) that is defined on R ^{ n } is coercive if
That is, for any constant M > 0 ∃ R _{ M } > 0 such that f (x) > M whenever x > R _{ M }.
Theorem 5. (Strong duality theorem): Let the functions constituting the problem (1) be continuously differentiable on R ^{ n }. Further, assume that those functions in the problem (1) are invex with respect to the same function η, also an objective function in (1) is coercive. If an optimization problem (1) is consistent then its dual problem (13) is also consistent and their optimal solutions coincide, that is, G = 0.
Proof. To prove Theorem 5, we use the modified CourantBeltrami penalty function method. As described in this approach, we need to construct an unconstrained optimization problem as follows:
Suppose that at the kth iterations, x _{ k } is an optimal solution to the penalized problem (10). Therefore, for every positive integer c _{ k }, P(x _{ k },c _{ k }) = min {P(x, c _{ k }) : x ∈ R ^{ n } } . Moreover, the sequence {x _{ k }} is bounded and of all its convergent subsequences have limits as described in the penalty function method (see Prop. 15 [13]), these limits are the solutions to the original problem (1). By Theorem 3, it follows that
Since x _{ k } is an optimal solution of the problem (10), let assume that is a convergent subsequence of {x _{ k }}, then the condition below is satisfied.(18)
According to KKT multiplier defined in (*), the Lagrange multipliers are given by
Accordingly, for each j,
.
From (**) is nonnegative. Therefore, for each j, Lagrange multipliers satisfy , then, by (18) and (**), we have .
Since the functions constituting the problem (1) are invex functions on R ^{ n } with respect to the same function η. Thus, based on Definition 3, ∀x ∈ R ^{ n }, we have(19)
Then, multiplying the second part of (19) by the Lagrange multiplier vector, we get(20)
Upon combining and rearranging the first part of (19) and (20), we have(21)
Then, according to (21) the Lagrange function is an invex function with respect to the same function η as in Definition 3 with stationary points according to equation , and the points are global minimum of the function L. Therefore,(22)
From (22), it follows that the Lagrange multiplier vector is feasible for the dual problem (13). By hypothesis, the optimization problem (1) is consistent with the coercive objective function. Then, the problem has a solution with primal optimal value P > − ∞. Consequently, for each j (23)
The reason for (23) is based on the assumptions that the limit point of the convergent subsequence is an optimal solution to the original problem (1). By the construction of modified CourantBeltrami penalized problem, we have(24)and considering the definition of g ^{+} (x), it is obvious that(25)
But is not feasible in the original optimization problem, then in (25) can be expanded and rewritten in the following form:(26)
Then, by (**) inequality (26) can be replaced with(27)
Therefore, according to definition of Lagrange function, we have
Since is a convergent subsequence of {x _{ k }}, then by (22)
Consequently, by (23) and (28) (29)
Since the sequence {x _{ k }} converge to , which is optimal to the original problem (1) and f is a continuous function on R ^{ n }, then we conclude that ⇒P ≤ D. Also, by Theorem 4, P ≥ D. Combining the two results (i.e. G ≤ 0 and G ≥ 0), P = D. □
The results established in this paper will be illustrated with some examples, using a modified CourantBeltrami penalty function method. Precisely, for invex optimization problem.
Example 1. [22] Consider the following optimization problem:
This problem can be verified that the objective function f and both constraints functions g _{1} and g _{2} are invex functions on the set X with respect to the same function η defined bywhere
Modified CourantBeltrami penalty function method defined by (10) can be used to solve the problem. Converting the problem in to unconstrained problem, we obtained the following:(30)
Note that, the set of feasible solutions F = {(x _{1}, x _{2}) ∈ X : x _{1} ≥ 0, x _{2} ≥ 0}. The points (0, 0) is an optimal to the problem, since the conditions (2)–(4) are satisfied with Lagrange multipliers , and penalty parameter it is also a minimizer to penalized problem (30).
Furthermore, according to the definition of Lagrange function, the Lagrangian dual for the considered problem can be constructed as follows:
The optimal values of ϕ (ξ) and P (x, c) are the same considering that all the stated assumptions in Theorem 5 are satisfied, the duality gap is 0 (i.e. G = 0).
Let us now consider the case when some assumptions in theorem were not fulfilled. In this case, the duality gap may no longer be equal to 0 (G ≠ 0).
Example 2. [13] Consider the following optimization problem:
Obviously, the objective f is coercive, but it is not invex on R with respect to any function η : R × R → R, due to the facts that the stationary point x = 0 of f is not its global minimum point.
The penalized problem can be constructed as follows:(31)
The feasible point x = − 1 is an optimal in (31) with the optimal value . Thus, the Lagrangian dual as defined by (13) is as follows:(32)
Clearly, the dual problem (32) is consistent since with . Hence which is greater than 0.
6 Conclusion
In this paper, we modified a penalty function that should be called a modified CourantBeltrami (MCB) penalty function method. We combined the concept of Lagrangian dual and penalty function approach to investigate and prove the relationship between an optimization problem (1) and its Lagrangian dual problem (13). The zeroduality gap for invex optimization problem has been established under some moderate assumptions. Some examples were presented which verified that those assumptions are essential to establish a zeroduality gap between the two problems. In the future research, MCB penalty function method will be applied to solve;

Multiobjective constrained optimization problem and

The practical problem from any of the following areas;

Engineering

Decision theory and

The chemical process, etc.

References
 W.I. Zangwill, Nonlinear programming via penalty functions, Manag. Sci. 13, 344 (1967) [CrossRef] [Google Scholar]
 B.W. Kort, D.P. Bertsekas, Combined primaldual and penalty method for convex programming, SIAM J. Cont. Optim. 14, 268 (1976) [CrossRef] [Google Scholar]
 R.V. Rao, Jaya: a simple and new optimization algorithm for solving constrained and unconstrained optimization problems, Int. J. Ind. Eng. Comput. 7, 19 (2016) [Google Scholar]
 A. Cherukuri, E. Mallada, J. Cortes, Asymptotic convergence of constrained primaldual dynamics, Syst. Control Lett. 87, 10 (2016) [CrossRef] [Google Scholar]
 C. Kanzow, D. Steck, Augmented Lagrangian and exact penalty methods for quasivariational inequalities. Comput. Optim. Appl. 69, 801 (2017) [CrossRef] [Google Scholar]
 R.A. Shandiz, E. Tohidi, Decrease of the penalty parameter in differentiable penalty function methods, Theor. Econ. Lett. 1, 8 (2011) [CrossRef] [Google Scholar]
 G.D. Pillo, L. Grippo, A continuously differentiable exact penalty function for nonlinear programming problems with inequality constraints, SIAM J. Cont. Optim. 23, 72 (1985) [CrossRef] [Google Scholar]
 Z. Chen, Y. Dai, A line search exact penalty method with biobject strategy for nonlinear constrained optimization, J. Comput. Appl. Math. 300, 245 (2016) [CrossRef] [Google Scholar]
 X.L. Sun, D. Li, Logarithmicexponential penalty formulation for integer programming, Appl. Math. Lett. 12, 73 (1999) [CrossRef] [Google Scholar]
 M. Bazaraa, H. Sherali, C. Shetty, Nonlinear Programming Theory and Algorithms (John Wiley & Sons, Inc. New Jersey, 2006), 3rd edn. [Google Scholar]
 W. Changyu, X. Yang, X. Yang, Nonlinear lagrange duality theorems and penalty function methods in continuous optimization, J. Glob. Optim. 27, 473 (2003) [CrossRef] [Google Scholar]
 D.Y. Gao, N. Ruan, H.D. Sherali, Solutions and optimality criteria for nonconvex constrained global optimization problems with connections between canonical and Lagrangian duality, J. Glob. Optim. 45, 473 (2009) [CrossRef] [Google Scholar]
 T. Antczak, Penalty function methods and a duality gap for invex optimization problems, Nonlinear Anal. Theory Methods Appl. 71, 3322 (2009) [CrossRef] [Google Scholar]
 M. Hanson, On sufficiency of the KuhnTucker conditions in nondifferentiable programming, J. Math. Anal. Appl. 80, 545 (1981) [CrossRef] [Google Scholar]
 B.D. Craven, Invex functions and constrained local minima, Bull. Aust. Math. Soc. 24, 357 (1981) [CrossRef] [MathSciNet] [Google Scholar]
 E. Ernst, M. Volle, Generalized CourantBeltrami penalty functions and zero duality gap for conic convex programs, Positivity 17, 945 (2013) [CrossRef] [Google Scholar]
 N. Kanzi, M. Soleimanidamaneh, Slater CQ, optimality and duality for quasiconvex semiinfinite optimization problems, J. Math. Anal. Appl. 434, 638 (2016) [CrossRef] [Google Scholar]
 M. Soleimanidamaneh, Penalization for variational inequalities, Appl. Math. Lett. 22, 347 (2009) [CrossRef] [Google Scholar]
 M. Soleimanidamaneh, Nonsmooth optimization using Mordukhovich's subdifferential, SIAM J. Control Optim. 48, 3403 (2010) [CrossRef] [Google Scholar]
 M. Hassan, A. Baharum, A new logarithmic penalty function approach for nonlinear constrained optimization problem, Deci. Sci. Lett. 8, 3 (2019) [Google Scholar]
 D.P. Bertsekas, On penalty and multiplier methods for constrained minimization, SIAM J. Cont. 14, 216 (1976) [CrossRef] [Google Scholar]
 T. Antczak, Exact penalty functions method for mathematical programming problems involving invex functions, Eur. J. Oper. Res. 198, 29 (2009) [CrossRef] [Google Scholar]
Cite this article as: Mansur Hassan, Adam Baharum, Modified courantBeltrami penalty function and a duality gap for invex optimization problem, Int. J. Simul. Multidisci. Des. Optim. 10, A10 (2019)
Current usage metrics show cumulative count of Article Views (fulltext article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 4896 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.