Python——实现感知机_python实现感知机-优快云博客

本文链接：https://blog.youkuaiyun.com/kouweizhu/article/details/146507211

文章目录

一、准备工作

1. 数据文件

新建txt文件，打开并写入以下数据，保存为images1.csv文件

x1,x2,y
153,432,-1
220,262,-1
118,214,-1
474,384,1
485,411,1
233,430,-1
396,321,1
484,349,1
429,259,1
286,220,1
399,433,-1
403,300,1
252,34,1
497,372,1
379,416,-1
76,163,-1
263,112,1
26,193,-1
61,473,-1
420,253,1

2. 数据展示图

在图中用圆点表示y=1的数据、用叉号表示y=−1的数据

3. 判别式和权重更新表达式

感知机停止学习的标准是根据精度来决定是否停止，这里姑且先重复10次。

4. 绘图部分

使权重向量成为法线向量的直线方程是内积为0的x的集合。所以对它进行移项变形，最终绘出以下表达式的图形

二、完整代码

import numpy as np
import matplotlib.pyplot as plt

# 读入训练数据
train = np.loadtxt('images1.csv', delimiter=',', skiprows=1)
train_x = train[:,0:2]
train_y = train[:,2]

# 权重初始化
w = np.random.rand(2)   # 随机初始化权重向量 w，维度为2（对应两个特征）

# 判别函数
def f(x):
    if np.dot(w, x) >= 0:
        return 1
    else:
        return -1

"""
感知机是在线学习算法，每次发现一个错误样本就立即更新权重，而不是等到所有样本处理完（与批量梯度下降不同）
想象你在考试前复习错题：
epoch=10：表示你把错题本复习10遍。
内层循环：每一遍复习时，你逐个查看错题（x 是题目，y 是正确答案），如果发现自己还是做错了，就调整你的解题思路（更新权重w）
"""

epoch = 10

# 更新次数
count = 0

# 学习权重
for _ in range(epoch):
    # 将特征矩阵 train_x 和标签向量 train_y 按行一一配对，生成 (x, y) 元组。
    # 例如：若 train_x 是 [[1,2], [3,4]]，train_y 是 [1, -1]，则 zip 后得到 ([1,2], 1) 和 ([3,4], -1)。
    for x, y in zip(train_x, train_y):
        if f(x) != y:
            w = w + y * x    #如果预测结果f(x) 与真实标签y不一致，更新权重w

            # 输出日志
            count += 1
            print('第 {} 次 : w = {}'.format(count, w))

# 绘图确认
x1 = np.arange(0, 500)
plt.plot(train_x[train_y ==  1, 0], train_x[train_y ==  1, 1], 'o')
plt.plot(train_x[train_y == -1, 0], train_x[train_y == -1, 1], 'x')
plt.plot(x1, -w[0] / w[1] * x1, linestyle='dashed')     # 绘制分类直线（斜截式）
plt.show()