note: weak, sparse, high-dimensional signal

原创于 2019-04-22 23:17:58 发布 · 163 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#统计

统计专栏收录该内容

3 篇文章

订阅专栏

本文探讨了高维参数空间的概念及其在统计学习中的应用，特别是在参数向量中存在大量零元素的情况。文章强调了在高维稀疏问题中估计参数的可能性，并介绍了使用正则化似然函数的方法，如岭回归和LASSO，通过引入不同的先验分布来实现噪声压缩和信号保持。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

                    
                    Reminder of key notions. High dimensional parameter space means : p&gt;&gt;np&gt;&gt;np>>n. Sparsity: the parameter vector has many zero elements. It means the parameter resides in a subspace. When the subspace dimension is smaller than nnn, the parameter is estimable.
In a sparse high-dimensional (SHD) problem, one does not know the location of the zeros; otherwise the parameter can be directly cast to the lower dimensional subspace.
Penalized log-likelihood can be interpreted as a posterior log density. The ridge log-likelihood is interpreted as the result of using a normal prior; lasso a Laplace prior. The key insight of this method is to use a zero-inflated prior to shrink noise and a fat-tailed prior to preserve signal.
Scale mixture of normals. X=YσX=Y\sigmaX=Yσ where YYY is a standard normal and σ\sigmaσ is a continuous random variable on (0,∞)(0,\infty)(0,∞). [West 1987 Biometrika paper]. A lot of well-known distributions are in this family: t, logistic, Laplace, and obviously, the instantaneous distribution generated by the stochastic vol Brownian motion in finance.