Linear/Logistic/Softmax Regression对比

最新推荐文章于 2025-03-20 10:26:05 发布

原创

最新推荐文章于 2025-03-20 10:26:05 发布 · 1k 阅读

3 ·

CC 4.0 BY-SA版权

文章标签：

#机器学习 #广义线性模型 #线性模型 #逻辑回归 #softmax

本文对比了Linear Regression、Logistic Regression和Softmax Regression三种常见的机器学习模型，它们都属于广义线性模型。Linear Regression用于回归，Logistic Regression用于二分类，Softmax Regression则用于多分类。三者在模型输出、损失函数和梯度上有相似之处，其中Softmax Regression可视为Logistic Regression的多类别扩展。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Linear/Logistic/Softmax Regression是常见的机器学习模型，且都是广义线性模型的一种，有诸多相似点，详细对比之。原文见Linear/Logistic/Softmax Regression对比。

概述

Linear Regression是回归模型，Logistic Regression是二分类模型，Softmax Regression是多分类模型，但三者都属于广义线性「输入的线性组合」模型「GLM」。

其中Softmax Regression可以看做Logistic Regression在多类别上的拓展。

Softmax Regression (synonyms: Multinomial Logistic, Maximum Entropy Classifier, or just Multi-class Logistic Regression) is a generalization of logistic regression that we can use for multi-class classification (under the assumption that the classes are mutually exclusive).

符号约定

样本 $x^{(i)}, y^{(i)})$
样本数 $m$
特征维度 $n$
Linear Regression输出 $y^{(i)}$
Logistic Regression类别 $y^{(i)}\in\{0,1\}$
Softmax Regression类别 $y^{(i)}\in\{1,\ldots,K\}$
Softmax Regression类别数 $K$
损失函数 $J(\theta)$
Indicator函数 $I\{boolean\}$

模型参数对比

Linear Regression，维度为 $\cdot 1)$ 的向量

$\theta = \begin{bmatrix} \vert \\ \theta \\ \vert \end{bmatrix}$

Logistic Regression，维度为 $\cdot 1)$ 的向量

$\theta = \begin{bmatrix} \vert \\ \theta \\ \vert \end{bmatrix}$

Softmax Regression，维度为 $\cdot K)$ 的矩阵

$\theta = \begin{bmatrix} \vert & \vert & \vert & \vert \\ \theta^{(1)} & \theta^{(2)} & \dots & \theta^{(K)} \\ \vert & \vert & \vert & \vert \\ \end{bmatrix}$