VC-Dimension and Rademacher Complexity-based bounds

最新推荐文章于 2024-12-03 10:45:29 发布

原创最新推荐文章于 2024-12-03 10:45:29 发布

· 615 阅读

0 ·

版权

文章标签：

#python

机器学习理论专栏收录该内容

95 篇文章

订阅专栏

本文概述了VC维和Rademacher复杂度这两个在机器学习中重要的理论概念，解释了它们如何度量模型的复杂度和学习能力，通过实例和公式阐述了集合族和分类模型的VC维定义，以及Rademacher复杂性的计算方法。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

VC-Dimension和Rademacher complexity是机器学习中常提到的度量复杂的的概念，一直远观而没有亵玩，今天对这个概念进行学习记录。

VC-Dimension

全称为Vapnik-Chervonenkis dimension，从wiki上搞来一段定义

In Vapnik–Chervonenkis theory, the Vapnik–Chervonenkis (VC) dimension is a measure of the capacity (complexity, expressive power, richness, or flexibility) of a space of functions that can be learned by a statistical classification algorithm. It is defined as the cardinality of the largest set of points that the algorithm can shatter. It was originally defined by Vladimir Vapnik and Alexey Chervonenkis.[1]

即：在VC 理论中，VC维度是一种度量由统计分类算法得到的函数空间的容量(复杂度，表达能力，丰富性或灵活性)的方法。它被定义为算法可以破坏的最大点集的基数。

集合族的VC维

令为集合族，是一个集合。它们的交被定义为

我们说集合被shatter，如果包含的所有子集，即：

的VC维是被破坏的最大基数，如果任意大的子集能被破坏，那么VC维是。

分类模型的VC维

一个分类模型有着参数向量被称为shatter一个集合的数据点如果对这些数据点所有赋予的标签，都存在使得在评估这个集合的数据点的时候无错误。
模型的VC维是能被shatter的最大点数。

例子

例如一条直线在平面分类，那么最多二分类的点数为3，所以VC维为3.

Rademacher complexity

In computational learning theory (machine learning and theory of computation), Rademacher complexity, named after Hans Rademacher, measures richness of a class of real-valued functions with respect to a probability distribution.

在计算学习理论中(机器学习和计算理论)，Rademacher comlexity由Hans Rademacher命名，度量一类基于概率分布实值函数的丰富性。