Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Motivation

In this section, we look for necessary and/or sufficient conditions for optimality of the constrained problem

minimizef0(x)subject tofi(x)0,i=1,,mhi(x)=0,i=1,,p\begin{split} \minimize&\quad f_0(x)\\ \text{subject to}&\quad f_i(x) \leq 0, \quad i = 1, \ldots, m\\ &\quad h_i(x) = 0, \quad i = 1, \ldots, p \end{split}

with Lagrange dual problem

maximizeg(λ,ν):=infxD(f0(x)+i=1mλifi(x)+i=1pνihi(x))subject toλ0\begin{split} \maximize&\quad g(\lambda, \nu) := \inf_{x\in \mathcal{D}} \left(f_0(x) + \sum_{i=1}^m \lambda_i f_i(x) + \sum_{i=1}^p\nu_i h_i(x)\right)\\ \text{subject to}&\quad \lambda \geq 0 \end{split}

Recall that, for unconstrained problems, such necessary and / or sufficient conditions are discussed in Chapter 2.

Complementary Slackness

Let xx^\star and (λ,ν)(\lambda^\star, \nu^\star) be primal and dual optimal, respectively. Then

f0(x)=g(λ,ν)(strong duality)=infx(f0(x)+i=1mλifi(x)+i=1pνihi(x))(def of g)f0(x)+i=1mλifi(x)+i=1pνihi(x)(inf property)f0(x)(x,λ are feasible)\begin{align*} f_0(x^\star) &= g(\lambda^\star, \nu^\star)&\text{(strong duality)}\\ &=\inf_{x} \left(f_0(x)+ \sum_{i=1}^m \lambda_i^\star f_i(x) + \sum_{i=1}^p \nu_i^\star h_i(x)\right)&\text{(def of $g$)}\\ &\leq f_0(x^\star)+ \sum_{i=1}^m \lambda_i^\star f_i(x^\star) + \sum_{i=1}^p \nu_i^\star h_i(x^\star)&\text{($\inf$ property)}\\ &\leq f_0(x^\star)&\text{($x^\star, \lambda^\star$ are feasible)} \end{align*}

Since by definition, i=1mλifi(x)0\sum_{i=1}^m \lambda_i^\star f_i(x^\star) \leq 0, we conclude that i=1mλifi(x)=0\sum_{i=1}^m \lambda_i^\star f_i(x^\star) = 0 and since each term in the sum is negative, one has the complementary slackness property

λifi(x)=0,i=1,,m\lambda_i^\star f_i(x^\star) = 0, \quad i=1, \ldots, m

or in other terms,

λi>0fi(x)=0fi(x)<0λi=0\lambda_i^\star > 0 \Rightarrow f_i(x^\star) = 0 \Longleftrightarrow f_i(x^\star) < 0 \Rightarrow \lambda_i^\star = 0

This means that the ii-th optimal Lagrange multiplier λi\lambda_i^\star is zero as long as the associated inequality constraint is inactive (fi(x)<0f_i(x^\star) < 0). Conversely, if λi>0\lambda_i^\star > 0, this means the constraint is active (fi(x)=0f_i(x^\star) = 0).

KKT conditions

Under the assumptions above, we can formulate two KKT conditions for nonconvex and convex problems, respectively.

Hence, for any (differentiable) optimization problem where strong duality holds, KKT conditions are necessary conditions for optimality of the triplet (x,λ,ν)(x^\star, \lambda^\star, \nu^\star).

For convex problems, the KKT conditions become sufficient. Recall that a general convex problem can be put into the form

minimizef0(x)subject tofi(x)0,i=1,,mAx=b \begin{split} \minimize&\quad f_0(x)\\ \text{subject to}&\quad f_i(x) \leq 0, \quad i = 1, \ldots, m\\ &\quad Ax = b \end{split}

In other words, for a convex problem, if the triplet (x~,λ~,ν~)(\tilde{x}, \tilde\lambda, \tilde\nu) satisfies KKT conditions then we have found optimal points and strong duality holds.

Understanding why KKT works:

In general, for convex problems KKT conditions are only sufficient. However, provided that strong duality holds, they become necessary and sufficient.

In particular, Slater’s condition implies strong duality and that the dual optimum is attained.

Proposition 1 generalizes the necessary and sufficient condition of unconstrained convex problems (f0(x)=0\nabla f_0(x) = 0) to the general constrained convex case.

Summary

Comments