[笔记] Convex Optimization 2015.11.25

最新推荐文章于 2025-10-10 18:20:20 发布

原创最新推荐文章于 2025-10-10 18:20:20 发布 · 1.2k 阅读

·

0

·

CC 4.0 BY-SA版权

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

文章标签：

笔记专栏收录该内容

12 篇文章

订阅专栏

本文探讨了范数与其共轭函数的关系，并给出了多种常见函数的共轭表达形式。此外，还介绍了凸集间的超平面分离定理及其证明过程。

$\lVert y \rVert _* = \sup \{ x^T y : \lVert x \rVert \le 1 \} \implies x^T y \le \lVert x \rVert \cdot \lVert y \rVert _*$
(because $\frac{x^T}{\lVert x \rVert} y \le \lVert y \rVert _*$ )
Want inequality of type: $x^T y \le f(x) + "f^*(y)"$ for “general” $f$ (Fenchel’s Inequality)

Definition: For f:Rn→R, the conjugate f∗ of f is defined by f∗(y)=supx(xTy−f(x))
with domf∗= set of y’s for which sup is <∞.
- Example:
  1. $f(x) = a^T x + b (x \in \mathbb{R}^n)$
    $f^*(y) = \underset{x}{\sup} x^T y - a^T x - b = \begin{cases} \infty & \text{if } y \neq a \\ -b & \text{if } y = a \end{cases}$
  2. $f(x) = -\log x (x \gt 0)$
    $(xy + \log x)' = y + \frac{1}{x} = 0 \implies x = -\frac{1}{y}$
    $f^*(y) = \underset{x \gt 0}{\sup} x^T y + \log x = \begin{cases} \infty & \text{if } y \ge 0 \\ -\log(-y) - 1 & \text{if } y \lt 0 \end{cases}$
  3. $f(x) = e^x (x \in \mathbb{R})$
    $(xy - e^x)' = y - e^x = 0 \implies x = \log y$
    $f^*(y) = \underset{x}{\sup} x^T y - e^x = \begin{cases} \infty & \text{if } y \lt 0 \\ y\log y - y & \text{if } y \ge 0 \end{cases}$
  4. $f(x) = x \log x (x \ge 0)$
    $(xy - x \log x)' = y - \log x - 1 = 0 \implies x = e^{y - 1}$
    $f^*(y) = \underset{x \ge 0}{\sup} x^T y - x \log x = y e^{y - 1} - (y - 1) e^{y - 1} = e^{y - 1}$
  5. $f(x) = \frac{1}{2} x^T Qx$ with $Q \in S_{++}^n$
    $f^*(y) = \underset{x}{\sup} x^T y - \frac{1}{2} x^T Qx = y^T Q^{-1} y - \frac{1}{2} y^T Q^{-1} y = \frac{1}{2} y^T Q^{-1} y$
    ( $\underset{x}{\inf} x^T Ax + x^T b \implies \text{best} x = -\frac{1}{2} A^{-1} b$ )
    So $x = Q^{-1} y$
    $\implies x^T y \le \frac{1}{2} x^T Qx + \frac{1}{2} y^T Q^{-1} y$ , for all $Q \succ 0$
  6. $f(x) = \log \left( \sum _{i = 1}^n e^{x_i} \right)$
    $f*(y) = \underset{x}{\sup} x^T y - \log \left( \sum _{i = 1}^n e^{x_i} \right)$
    $\left(xy - \log \left( \sum _{i = 1}^n e^{x_i} \right) \right)' = y - \frac{e^{x_i}}{\sum _{i = 1}^n e^{x_i}} = 0$
    $\implies y_i = \frac{e^{x_i}}{\sum _{i = 1}^n e^{x_i}}, y \succeq 0, 1^T y = 1$
    assume for simplicity, $y \succ 0$
    put $x_i = \log (y_i)$ , then $\sum e^{x_i} = 1^T y = 1$ and optimality conditions hold
    then $f^*(y) = \sum _{i = 1}^n y_i \log (y_i) - \log (1^T y) = \sum _{i = 1}^n y_i \log (y_i)$
  7. $f(x) = \lVert x \rVert$
    $f^*(y) = \underset{x}{\sup} x^T y - \lVert x \rVert = \begin{cases} 0 & \text{if } \lVert y \rVert _* \le 1 \\ \infty & \text{if } \lVert y \rVert _* \gt 1 \end{cases}$
    $x^T y - \lVert x \rVert \le \lVert x \rVert \cdot \lVert y \rVert _* - \lVert x \rVert = \lVert x \rVert (\lVert y \rVert _* - 1) \le 0$ if $\lVert y \rVert _* - 1 \le 0$
  8. $f(x) = \frac{1}{2} \lVert x \rVert ^2$
    $f^*(y) = \underset{x}{\sup} x^T y - \frac{1}{2} \lVert x \rVert ^2 = \frac{1}{2} \lVert y \rVert _*^2$
    $x^T y - \frac{1}{2} \lVert x \rVert ^2 \le \lVert x \rVert \cdot \lVert y \rVert _* - \frac{1}{2} \lVert x \rVert ^2 \le \frac{1}{2} \lVert y \rVert _*^2$ ( $\lVert x \rVert = \lVert y \rVert_*$ )
    $\implies x^T y \le \frac{1}{2} \lVert x \rVert ^2 + \frac{1}{2} \lVert y \rVert _*^2$
- Proof of general hyperplane seperation:
  Let $C \subseteq \mathbb{R}^n$ be a convex set, $H \subseteq \mathbb{R}$ be the affine subspace of smallest dimention containing $C$ , we write $C_{\varepsilon} = \{ x : B_{\varepsilon} (x) \bigcap H \subseteq C \}$
  then $C_{\varepsilon} \subseteq "\text{relint} (C)" = \underset{\varepsilon \gt 0}{\bigcup} C_{\varepsilon}$ . (relint: relative interior)
  ( $C \subseteq \overline{\text{relint}(C)}$ , $C$ is a subset of closure of $\text{relint}(C)$ )
  Let $C, D$ be disjoint convex sets. Then for every $\varepsilon \gt 0$ the sets $A_{\varepsilon} = \overline{C_{\varepsilon}} \bigcap B_{\frac{1}{\varepsilon}}(0)$ , $\overline{D}$ are closed disjoint convex sets with $\overline{C_{\varepsilon}} \bigcap B_{\frac{1}{\varepsilon}}(0)$ bounded, and $\text{dist}(A_{\varepsilon}, \overline{D}) \ge \varepsilon \gt 0$ .
  So $\exists A_{\varepsilon} \in \mathbb{R}^n$ , $a_{\varepsilon} \neq 0$ , $b_{\varepsilon} \in \mathbb{R}$ s.t. $(a_{\varepsilon}, b_{\varepsilon})$ define a seperating hyperplane for $A_{\varepsilon}, \overline{D}$ .
  $a_{\varepsilon}^T x \le b_{\varepsilon} \; \forall x \in A_{\varepsilon}$ , $a_{\varepsilon}^T x \ge b_{\varepsilon} \; \forall x \in \overline{D}$
  WLOG $\lVert a_{\varepsilon} \rVert = 1$
  The sequence $(\vec{a}_\frac{1}{n})_{n = 1}^{\infty}$ is a sequence of unit vectors and so has a convergent subsequence, say WLOG convergent to $a_0 \in \mathbb{R}^n$ .
  can assume sequence $b_{\frac{1}{n}}$ is bonded (or else one of the sets $C, D$ is empty)
  and so also convergent to some value $b_0 \in \mathbb{R}$ .
  Want to show $(a_0, b_0)$ is SH for $C, D$ , i.e., that
  $a_0^T x \le b_0 \; \forall x \in C, \, a_0^T x \ge b_0 \; \forall x \in D$
  (Assume $C$ is not a point, proof like above; then assume D is not a point, switch $C, D$ .
  If $C, D$ are points, obious true.)
Log-convexity and log-concavity
- Definition: $f : \mathbb{R}^n \to \mathbb{R}_{\gt 0}$ is log-convex (log-concave) if $\log (f)$ is convex (concave).
- Convexity:
$\log (f(\theta x + (1 - \theta) y)) \le \theta \log (f(x)) + (1 - \theta) \log (f(y)) = \log (f(x)^{\theta} f(y)^{1 - \theta})$
$\iff f(\theta x + (1 - \theta)y) \le f(x)^{\theta} f(y)^{1 - \theta}$
- Remark 2: log-convex $\implies$ convex, $f(x) = e^{\log f(x)}$ , (composition function, QED)
concave $\implies$ log-concave

评论

成就一亿技术人!

拼手气红包6.0元

还能输入1000个字符

添加红包

插入表情

表情包

代码片

HTML/XML
objective-c
Ruby
PHP
C
C++
JavaScript
Python
Java
CSS
SQL
其它

条评论被折叠查看

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。