Slater's Condition in General

Slater’s condition is a fundamental constraint qualification in convex optimization that guarantees strong duality. It essentially demands that there’s room to perturb the constraints without the problem becoming infeasible, and as classically stated, imposes interiority conditions on the set of feasible perturbations. However, we don’t actually need to rely on the topological structure to capture the idea behind Slater’s condition. In this post, we’ll review the classical result, then explore a generalization that works in vector spaces without any topological structure.

Classical Slater’s condition

Review of some definitions

Before stating the classical result, let’s recall some key concepts about optimization in vector spaces. Throughout, all vector spaces are real vector spaces (vector spaces over ).

Dual space. For a topological vector space , the topological dual (or simply dual) consists of all continuous linear functionals . A linear functional is continuous if and only if it is bounded on some neighborhood of the origin, i.e., there exists a neighborhood of the origin and such that for all . (In normed spaces, this is equivalent to for all .) In finite dimensions, all linear functionals are continuous.

Positive cones and order. A subset of a vector space is a cone if for all and . (Note that we require .) A convex cone defines a partial order on by declaring (or simply ) if and only if . We call a positive cone when used to define such an order. When is a topological vector space and has nonempty interior, we write (or simply ) if .

For example, in , the standard positive cone is , giving the componentwise order. In function spaces, we might take .

Dual cone. Given a cone in a normed space , the dual cone is

The dual cone consists of continuous linear functionals that are “nonnegative” on . When , we have as well.

Cone-convex functions. A function (where is ordered by cone ) is -convex if for any and ,

Equivalently, is -convex if the set is convex in .

The classical theorem

We can now state the classical form of Slater’s condition for convex optimization problems in normed spaces, as appears in Exercise 8.7 of Luenberger [1].

Theorem: Slater's Condition (Classical)

Consider the convex optimization problem

(1)

with variable , where

  • is a convex subset of a vector space ,
  • is a convex function,
  • is a -convex function into a normed space ordered by positive cone , and
  • is an affine function into the finite-dimensional normed space .

Slater’s condition states that if

  • problem (1) has finite optimal value ,
  • there exists such that and , and
  • (where is the image of under ),

then there are and such that

i.e., strong duality holds and the dual optimum is obtained.

I was assigned this exercise as a homework problem in Winter 2024. I do not know if there is a standard proof for this result, but both the proof that I found and the official solution took the same general strategy, which I outline below.

Proof strategy

The basic idea is to prove this is to apply the Hahn–Banach separation theorem to separate the point from the set given by

The set contains all triples for which there exists a feasible point achieving objective value strictly less than under constraint perturbations . Since , , and are convex and is affine, is a convex set.

The Hahn–Banach separation theorem guarantees a separating hyperplane is has nonempty interior. This hyperplane is given by a nonzero linear functional satisfying

Once and are established (proofs omitted), we can divide by and define and . Equality is obtained due to weak duality.

The tricky part is showing that has nonempty interior, which is necessary to apply the Hahn–Banach separation theorem. This is the part that uses the various interiority conditions in the hypotheses.

Proof that

We construct an explicit open set contained in . Since and , we have room to perturb each of the constraints individually. The most natural thing to try is to see if we can perturb each of the constraints in any direction, and satisfy some uniform bound on . That is, we want to find an open set of the form about , lying in .

Individual perturbations. Let . Since is a normed space, there exists such that the open ball . So, the constraint is still feasible with any perturbation (in particular, take ).

Since and is finite-dimensional, we can choose a basis for and identify (where ) such that the unit cube . So, is still feasible with perturbations in the interior of the unit cube.

The challenge is that if we perturb the equality constraint, moving from to , then we must change . A prior, this might consume the available slack that we have for the inequality constraint, or could blow up as we take close to the boundary of the cube. We need to show that these effects can be controlled.

Quantifying the impact of equality constraint perturbations. To bound the impact of perturbations to the equality constraint, we need only investigate the effect of perturbations in the extreme directions (the vertices of ). This works because is affine and , , and are convex. To make this more precise, let be the vertices. For each , select a representative such that .

Consider the convex hull . This consists of points given by

for with . Since is convex, for each . Furthermore, by affinity of , which generates the entire cube.

Crucially, the behavior of and at is controlled by their values at the representatives . Applying convexity gives

where we define . Note that as long as , then we are guaranteed by our earlier construction.

Controlling the impact of equality constraint perturbations. Since is finite, the values of and at these points are necessarily bounded. We define the maximum costs associated with these extreme directions as

For any , we have and . While the objective is therefore uniformly bounded, it’s possible that , so that and, possibly, .

To remedy this, we scale down the cube . Choose a scaling factor such that . (If , then any works.) Now define the set

It isn’t hard to see that covers the scaled cube . For any given , we certainly have . But we also have

So, and we have some extra space to further perturb the inequality constraints while keeping feasible.

The open set. We now define and , so that we get the open set . We verify that is, indeed, contained in . Let .

Since , there exists such that . For this , we have .

For the inequality constraint, we need , or equivalently, . Since , we have

Since and , the triangle inequality gives . Therefore, . We conclude , hence .

Thus, . Since and is open, .

A more general Slater’s condition

The core idea of Slater’s condition is that we can perturb the constraints in any direction without losing feasibility or blowing up . As it turns out, this idea can be captured without reference to a topology. We can instead use purely algebraic constructs and obtain a more general form of Slater’s condition that applies in arbitrary (real) vector spaces.

More definitions

We first need to introduce some additional concepts that make sense in arbitrary vector spaces, without requiring any topology.

Algebraic dual space. For any vector space , the algebraic dual is the space of all linear functionals . This is in contrast to the topological dual , which you’ll recall consists of continuous linear functionals only. We have in general, with equality when is finite-dimensional.

Algebraic dual cone. For a cone in a vector space , the algebraic dual cone is

A key fact is that if has nonempty interior (when is a topological vector space), then every functional in is automatically continuous, so in this case.

Core. The core of a set in a vector space , denoted , is the set of points such that for every , there exists with for all . This just says that starting from any , you can pick any direction and you’ll be able to travel at least some distance in that direction without leaving .

The core generalizes the notion of interior. In a topological vector space, if has nonempty interior, then . For convex sets in finite-dimensional spaces, (they coincide). For convex sets in infinite-dimensional spaces, the core can be nonempty even when the interior is empty.

The General Result

With these definitions in hand, we can state the generalized Slater condition. I am actually not aware of any sources that provide this result. I will add a reference if I can find one!

Theorem: Slater's Condition (General)

Consider the convex optimization problem

(2)

with variable , where

  • is a convex subset of a vector space ,
  • is a convex function,
  • is a -convex function into a vector space ordered by positive cone , and
  • is an affine function into a vector space .

Define the cone-extended constraint image set

Note that is a convex subset of .

The general Slater’s condition states that if

  • problem (2) has finite optimal value and
  • ,

then there are and such that

i.e., strong duality holds and the dual optimum is obtained.

Proof of the general result

In the proof of the classical result, we spent a lot of time checking that we could jointly perturb the equality and inequality constraints while maintaining feasibilty and a uniform bound on . In this case, however, the core condition permits us to jointly perturb the constraints for free. The proof strategy is then to analyze the sensitivity of the optimal cost to small changes in the constraints. The generalized Slater’s condition guarantees that this sensitivity is well-behaved and finite in every possible direction of perturbation. This allows the construction of a linear underestimator for the cost behavior using the Hahn–Banach theorem, which gives the Lagrange multipliers necessary to prove strong duality.

The value function. Consider the value function given by

This function measures the optimal value of the problem when we perturb the constraints by . Clearly is convex and . Without loss of generality, we can translate so that .

Directional derivative of at . Define the directional derivative of at the origin as

Since is convex and we have assumed , the ratio is nondecreasing in for . This ensures that the limit exists (though a priori it might be ) and equals the infimum, i.e.,

In fact, is finite-valued. Since , for any direction , there exists such that . This means , so

Therefore, is finite-valued on all of .

Verifying that is sublinear. We next verify that is sublinear. It is positively homogeneous because, by a change of variables,

for all and . It is also subadditive since convexity of gives

Thus is a sublinear function (finite-valued, positively homogeneous, and subadditive).

Applying Hahn–Banach. Since is a sublinear function mapping to , the Hahn–Banach theorem guarantees the existence of a linear functional such that for all . Write for some .

is a subgradient. We have for all (take in the infimum defining ). Thus,

Rearranging, we obtain the subgradient inequality

(3)

This shows that is a subgradient of at the origin.

. We must check that for all . For any , we have , so . This holds due to monotonicity: the value function is nonincreasing in its first argument, since the feasible set grows as increases with respect to . Applying (3), we also have . Combining these gives , which implies . Thus, .

Strong duality. For any , we have , so substituting into the subgradient inequality (3) yields

Since by definition, we have

By weak duality, the right side is at most , so we have equality. Thus strong duality holds with Lagrange multipliers .

Use in the classical result

The general Slater’s condition can be used to prove the classical Slater’s condition.

Suppose is a normed space, is finite-dimensional, and we have

  • the existence of with (strict inequality) and , and
  • the condition .

This is the classical Slater’s condition. We can show that this implies the general Slater’s condition, that . We leave this as an exercise. Applying the general theorem, we obtain multipliers for which strong duality holds. Moreover, these multipliers actually lie in . This is because implies (so ) and is finite-dimensional (so ).

References

[1] Luenberger, D. G. (1969). Optimization by Vector Space Methods. John Wiley & Sons.