【小白笔记】梯度下降（Gradient Descent）来训练模型

原创于 2025-12-17 22:24:39 发布 · 120 阅读

CC 4.0 BY-SA版权

文章标签：

import numpy as np

# 读取训练样本数 k
k = int(input())
# 读取训练数据：共 k*4 个整数（每样本 3 特征 + 1 标签）
data = list(map(int, input().split()))
# 读取预测样本数 n
n = int(input())
# 读取待预测数据：共 n*3 个整数（每样本 3 特征）
pred_data = list(map(int, input().split()))

# 构建训练集 x (特征) 和 y (标签)
x = []
y = []
for i in range(k):
    start = i * 4
    feature = data[start:start + 3]      # 前3个是特征
    price = data[start + 3]              # 第4个是价格（标签）
    x.append(feature)
    y.append(price)

x = np.array(x, dtype=float)             # 转为浮点型（梯度下降需要）
x = np.c_[np.ones(x.shape[0]), x]        # 添加偏置列（全1）
y = np.array(y, dtype=float)

# ========== 梯度下降替代正规方程 ==========
m, p = x.shape                           # m: 样本数, p: 特征数（含偏置）
theta = np.zeros(p)                      # 初始化参数 theta 为 0

learning_rate = 0.01                     # 学习率（可调）
num_iterations = 10000                   # 迭代次数（可调）

# 梯度下降主循环
for _ in range(num_iterations):
    # 计算预测值
    y_pred = x @ theta
    # 计算梯度：(1/m) * X^T (Xθ - y)
    gradient = (1 / m) * x.T @ (y_pred - y)
    # 更新参数
    theta = theta - learning_rate * gradient

# ========== 预测部分（保持不变）==========
x_pred = []
for i in range(n):
    start = i * 3
    feature = pred_data[start:start + 3]
    x_pred.append(feature)

x_pred = np.array(x_pred, dtype=float)
x_pred = np.c_[np.ones(x_pred.shape[0]), x_pred]  # 添加偏置列

# 使用学到的 theta 进行预测
y_pred = x_pred @ theta
y_pred_rounded = [round(i) for i in y_pred]

# 输出结果（拼接成字符串）
print(''.join(map(str, y_pred_rounded)))

🔧 关键改动说明

原代码（正规方程）	新代码（梯度下降）
`theta = np.linalg.inv(x.T@x)@x.T@y`	用 `for` 循环迭代更新 `theta`
不需要设置超参数	需要设置 `learning_rate` 和 `num_iterations`
一步求解	多次迭代逼近最优解