Theorem
- A finite intersection of halfspaces is convex.
- Cauchy-Schwarz inequality: |aHb| ≤ ||a|| · ||b||
- Each halfspace is a convex set
- The intersection of any number of convex sets is also convex
- The product of two convex functions is convex if one of them is nonnegative.
- For any cvx opt problem, any local solution is also global.
Concept
quasiconvex
Definition: A function
f
f
f is quasiconvex if its domain is convex and for any two points
x
1
x_1
x1 and
x
2
x_2
x2 in this domain, and for any
λ
\lambda
λ in the interval
[
0
,
1
]
[0,1]
[0,1], the following holds:
f
(
λ
x
1
+
(
1
−
λ
)
x
2
)
≤
max
{
f
(
x
1
)
,
f
(
x
2
)
}
f\left(\lambda x_1+(1-\lambda) x_2\right) \leq \max \left\{f\left(x_1\right), f\left(x_2\right)\right\}
f(λx1+(1−λ)x2)≤max{f(x1),f(x2)}
For a function to be quasiconvex, its sublevel sets must be convex.
Jensen’s Inequality
In convex function condition. Let
λ
1
,
λ
2
,
…
,
λ
n
\lambda_1, \lambda_2, \ldots, \lambda_n
λ1,λ2,…,λn be non-negative weights such that
λ
1
+
λ
2
+
⋯
+
λ
n
=
1
\lambda_1+\lambda_2+\cdots+\lambda_n=1
λ1+λ2+⋯+λn=1. Then Jensen’s Inequality states that:
f
(
λ
1
x
1
+
λ
2
x
2
+
⋯
+
λ
n
x
n
)
≤
λ
1
f
(
x
1
)
+
λ
2
f
(
x
2
)
+
⋯
+
λ
n
f
(
x
n
)
f\left(\lambda_1 x_1+\lambda_2 x_2+\cdots+\lambda_n x_n\right) \leq \lambda_1 f\left(x_1\right)+\lambda_2 f\left(x_2\right)+\cdots+\lambda_n f\left(x_n\right)
f(λ1x1+λ2x2+⋯+λnxn)≤λ1f(x1)+λ2f(x2)+⋯+λnf(xn)
Problem
How to determine if a fuction it is convex, concave, quasiconvex, or quasiconcave
For single variable:
Quasiconvexity:
- First Derivative Test (for differentiable functions): A differentiable function f f f is quasiconvex on an interval if its derivative f ′ ( x ) f^{\prime}(x) f′(x) does not change sign from positive to negative.
For furthermore to check convexity:
2. First Derivative Test: For functions of a single variable, if the first derivative
f
′
(
x
)
f^{\prime}(x)
f′(x) is non-decreasing on the interval, then
f
f
f is convex on that interval.
3. Second Derivative Test: For twice differentiable functions of a single variable, if the second derivative
f
′
′
(
x
)
f^{\prime \prime}(x)
f′′(x) is non-negative for all
x
x
x in the domain, then
f
f
f is convex on that domain.
Quasiconcavity:
Opposite to quasiconvexity: its derivative
f
′
(
x
)
f^{\prime}(x)
f′(x) does not change sign from negative to positive
Concavity:
Opposite to convexity: first derivative
f
′
(
x
)
f^{\prime}(x)
f′(x) is non-increasing; second derivative
f
′
′
(
x
)
f^{\prime \prime}(x)
f′′(x) is non-positive for all
x
x
x in the domain
For multi-dimension variables:
See Hessian matrix, if that is negative semidefinite -> concave, positive semidefinite-> convex.
If infinite matrix, then neither convex nor concave.
Determining quasiconvexity (quasiconcavity) directly through the Hessian matrix is generally not possible, as the concept of quasiconvexity is less rigid and more general than convexity. Here we have to check the sublevel (superlevel) sets.
Note, a linear function is always convex and concave as the for every two points equality holds:
f
(
λ
x
+
(
1
−
λ
)
y
)
=
λ
f
(
x
)
+
(
1
−
λ
)
f
(
y
)
f(\lambda x+(1-\lambda) y)=\lambda f(x)+(1-\lambda) f(y)
f(λx+(1−λ)y)=λf(x)+(1−λ)f(y)
Composition: One dimension case:
f
(
x
)
=
h
(
g
(
x
)
)
f(x)=h(g(x))
f(x)=h(g(x))
f
f
f is convex if
h
h
h is convex and nondecreasing, and
g
g
g is convex
f
f
f is convex if
h
h
h is convex and nonincreasing, and
g
g
g is concave,
f
f
f is concave if
h
h
h is concave and nondecreasing, and
g
g
g is concave
f
f
f is concave if
h
h
h is concave and nonincreasing, and
g
g
g is convex
General form to standard form
In the standard form, every constraint is expressed as an equality. slack variable is crucial in converting inequality to equality.
Derive a dual problem
- Using Lagrange multipliers for the inequality and equality constraints to form Lagrangian. The Lagrange multipliers for inequality constraints are typically constrained to be non-negative.
- minimizing the Lagrangian with respect to the primal variables
- The dual problem is then formulated by maximizing the dual function over all dual variables.