Assignment1_SVM

最新推荐文章于 2025-08-15 16:44:13 发布

转载最新推荐文章于 2025-08-15 16:44:13 发布 · 104 阅读

0 ·

CC 4.0 BY-SA版权

原文链接：http://www.cnblogs.com/zhangli-ncu/p/7612780.html

文章标签：

#数据结构与算法 #python

本文详细介绍了多类别SVM损失函数的计算方法及梯度推导过程，并提供了向量化实现方式。通过实例说明了如何利用正则化项防止过拟合，以及如何计算损失值和权重梯度。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

The Multiclass SVM loss for the i-th example is then formalized as follows:

The most common regularization penalty is the L2 norm that discourages large weights through an elementwise quadratic penalty over all parameters:

The full Multiclass SVM loss becomes:

Gradient:

def svm_loss_vectorized(W, X, y, reg):
  """
  Structured SVM loss function, vectorized implementation .

  Inputs have dimension D, there are C classes, and we operate on minibatches
  of N examples.

  Inputs:
  - W: A numpy array of shape (D, C) containing weights.
  - X: A numpy array of shape (N, D) containing a minibatch of data.
  - y: A numpy array of shape (N,) containing training labels; y[i] = c means
    that X[i] has label c, where 0 <= c < C.
  - reg: (float) regularization strength

  Returns a tuple of:
  - loss as single float
  - gradient with respect to weights W; an array of same shape as W
  """

    loss = 0.0
    dw = np.zeros(W.shape)
    num_train = X.shape[0]       # num_train : N
    scores = X.dot(W)       # scores.shape = (N,C)
    scores_correct = scores[np.arange(num_train),y]     # 1 by N
    scores_correct = np.reshape(scores_correct,(-1,1))   # N by 1
    margins = scores - scores_correct + 1.0  # delta = 1.0
    margins[np.arange(num_train),y] = 0     # j != yi
    margins[margins < 0] = 0     #max( 0 , s_j - s_yi + delta)
    regular_loss = reg * np.sum(W**2)  
    loss = 1/num_train * np.sum(margins) + regular_loss

    margins[margins>0] = 1.0
    row_sum = margins.sum(axis = 1)  # 1 by N , sum in row
    margins[np.arange(num_train),y] = -row_sum
    dw = (X.T).dot(margins)/num_train + regular_loss    #D by C  : D*N * N*C 
    return loss,dw