AI嵌入式K210项目（25）-手写数字识别

K210嵌入式平台上的数字识别实验与教程

原创已于 2024-02-05 17:09:35 修改 · 1.8k 阅读

18 ·

CC 4.0 BY-SA版权

文章标签：

#人工智能 #K210 #嵌入式 #手写数字识别 #AI嵌入式

于 2024-01-30 15:52:36 首次发布

K210开发板专栏收录该内容

30 篇文章

订阅专栏

本文详细描述了如何在K210开发板上使用KPU进行手写和打印数字识别，包括实验准备、过程中的图像预处理和模型运行，以及如何通过IDE连接和调整阈值优化识别准确率。

文章目录

前言
一、实验准备
二、实验过程
三、实验结果
总结

前言

本节课主要学习K210识别数字的功能，能识别手写的数字和打印的数字。

一、实验准备

请先将模型文件导入内存卡上，再将内存卡插入到K210开发板的内存卡插槽上，具体操作步骤请参考：

AI嵌入式K210项目（21）-AI模型文件导入至TF卡

本实验使用/sd/KPU/mnist/uint8_mnist_cnn_model.kmodel模型；

数字识别需要用的内存卡加载模型文件，所以需要提前将模型文件导入内存卡，再将内存卡插入K210开发板的内存卡卡槽里，如果无法读取到内存卡里的模型文件，则会报错。

二、实验过程

导入相关库，并初始化摄像头和LCD显示屏；

import sensor, image, time, lcd
from maix import KPU
import gc

lcd.init(freq=15000000)
sensor.reset()                      # Reset and initialize the sensor. It will
                                    # run automatically, call sensor.run(0) to stop
sensor.set_pixformat(sensor.RGB565) # Set pixel format to RGB565 (or GRAYSCALE)
sensor.set_framesize(sensor.QVGA)   # Set frame size to QVGA (320x240)
sensor.set_windowing((224, 224))
sensor.skip_frames(time = 1000)     # Wait for settings take effect.
clock = time.clock()                # Create a clock object to track the FPS.

初始化KPU相关的参数，kpu需要加载kmodel文件，本次实验需要的模型文件路径为：/sd/KPU/mnist/uint8_mnist_cnn_model.kmodel

kpu = KPU()
kpu.load_kmodel("/sd/KPU/mnist/uint8_mnist_cnn_model.kmodel")

新建while循环读取摄像头画面，然后复制一个112*112大小的画面，对像素进行取反等处理，再将图像传入KPU里进行计算，与模型文件做运算，最终得到最优识别结果和识别分数。

while True:
    gc.collect()
    img = sensor.snapshot()
    img_mnist1=img.to_grayscale(1)        #convert to gray
    img_mnist2=img_mnist1.resize(112,112)
    a=img_mnist2.invert()                 #invert picture as mnist need
    a=img_mnist2.strech_char(1)           #preprocessing pictures, eliminate dark corner
    a=img_mnist2.pix_to_ai()

    out = kpu.run_with_output(img_mnist2, getlist=True)
    max_mnist = max(out)
    index_mnist = out.index(max_mnist)
    #score = KPU.sigmoid(max_mnist)
    display_str = "num: %d" % index_mnist
    print(display_str)
    a=img.draw_string(4,3,display_str,color=(0,0,0),scale=2)
    lcd.display(img)

kpu.deinit()