int8量化的resnet50在ImageNet数据集上的推断结果

最新推荐文章于 2024-11-18 20:48:22 发布

JachinMa

最新推荐文章于 2024-11-18 20:48:22 发布

阅读量2.5k

点赞数

本文链接：https://blog.youkuaiyun.com/JachinMa/article/details/106505953

版权

本文介绍了将ResNet50模型（除全连接层外）量化到8位精度后，在ImageNet数据集上进行推断的结果。在1个和3个CPU核心上运行，量化模型的fps相比于全精度模型提升了1.x倍，平均延迟减少了30%-50%。尽管量化模型的fps波动和某些batch_size处出现异常fps值，但其性能提升明显。推断代码采用单线程，官方还提供了多线程方法。量化方法为Post-Training Optimization，对权重和激活层进行校准。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

将resnet50除全连接层外的所有层均量化为8bit。

结果如图：
在这里插入图片描述
这是使用1个cpu核心的结果。

这是使用3个cpu核心的结果。

作为对比，这是全精度模型的结果。
在这里插入图片描述
1个cpu核心

3个cpu核心

从结果对比可以看出，量化模型的fps较全精度模型提升了1.x倍，而平均延迟则分别减少了30%和50%。

将转换后的IR模型在Ubuntu上使用cpu做了推断，fps结果和DL Workbench一致，延迟因为不会测试就没有测。

推断python代码如下：

#!/usr/bin/env python
"""
 Copyright (C) 2018-2020 Intel Corporation

 Licensed under the Apache License, Version 2.0 (the "License");
 you may not use this file except in compliance with the License.
 You may obtain a copy of the License at

      http://www.apache.org/licenses/LICENSE-2.0

 Unless required by applicable law or agreed to in writing, software
 distributed under the License is distributed on an "AS IS" BASIS,
 WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 See the License for the specific language governing permissions and
 limitations under the License.
"""
from __future__ import print_function
import sys
import os
from argparse import ArgumentParser, SUPPRESS
import cv2
import numpy as np
import logging as log
from openvino.inference_engine import IECore
import time

def build_argparser():
    parser = ArgumentParser(add_help=False)
    args = parser.add_argument_group('Options')
    args.add_argument('-h', '--help', action='help', default=SUPPRESS, help='Show this help message and exit.')
    args.add_argument("-m", "--model", help="Required. Path to an .xml file with a trained model.", required=True,
                      type=str)
    args.add_argument("-i", "--input", help="Required. Path to a folder with images or path to an image files",
                      required=True,
                      type=str, nargs="+")
    args.add_argument("-l", "--cpu_extension",
                      help="Optional. Required for CPU custom layers. "
                           "MKLDNN (CPU)-targeted custom layers. Absolute path to a shared library with the"
                           " kernels implementations.", type=str, default=None)
    args.add_argument("-d", "--device"

最低0.47元/天解锁文章