目录
NeuroPilot 离线工具(Quantization, Converter etc.)常见问题
**在将模型从Caffe/PyTorch转换为TFLite格式后,因为输入从NCHW更改为NHWC,需要额外添加一个转置节点,有可能避免增加这个额外的操作吗?
**Per-axis Quantization(Per-channel Quantization)和 Per-tensor Quantization两种量化方式解析
**模型量化后精度不达标的情况,MTK建议有什么继续优化的方案? 混合精度在端侧是否有成熟方案?
**如何判断模型OP是symmetric 还是asymmetric minmax?它们之间的误差范围是多少?
Neuron SDK常见问题(包括Tools和Neuron RuntimeAPI)
** TFlite编译dla时,报错不支持float32,如何进一步处理?
**使用neuron runtime API(V1,V2)进行模型推理时的内存分配&释放过程
**如何解读使用ncc-tflite编译模型时下“—show-memory-summary”参数后印出的数据中Static和Temporary的数据?
**使用NP5.x版本的NeuronSDK尝试将Tflite编译为dla档案时遇到“MDLA:data type mismatch for input and filter”问题
**NeuronRuntimeV2_create API参数解析
**NeuronRuntime_setInput/output的内存操作逻辑和和使用规则是什么?
NeuroPilot 离线工具(Quantization, Converter etc.)常见问题
**在将模型从Caffe/PyTorch转换为TFLite格式后,因为输入从NCHW更改为NHWC,需要额外添加一个转置节点,有可能避免增加这个额外的操作吗?
【问题描述】1
在将模型从Caffe/PyTorch转换为TFLite格式后,因为输入从NCHW更改为NHWC,需要额外添加一个转置节点,有可能避免增加这个额外的操作吗?1

【解答】
为了避免添加这个额外的节点,您可以向Caffe/PyTorch模型添加一个wrapper。
对于Caffe,参见NeuroPilot SDK文档:Developer Tools > Model Development > Converter > Converter Tool Examples > Converting from Caffe > Floating-point (with NHWC Input/Output Tensors)。
对于PyTorch,参见NeuroPilot SDK文档:Developer Tools > Model Development > Converter > Converter Tool Examples > Converting from PyTorch > Floating-point (with NHWC Input/Output Tensors)。
html lang="zh-CN">
<head>
<meta charset="utf-8">
<link rel="canonical" href="https://blog.youkuaiyun.com/csdn_te_Blog_002/article/details/144774831"/>
<meta http-equiv="content-type" content="text/html; charset=utf-8">
<meta name="renderer" content="webkit"/>
<meta name="force-rendering" content="webkit"/>
<meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1"/>
<meta name="viewport" content="width=device-width, initial-scale=1.0, minimum-scale=1.0, maximum-scale=1.0, user-scalable=no">
<meta name="report" content='{"pid":"blog","spm":"1001.2101"}'>
<meta name="referrer" content="always">
<meta http-equiv="Cache-Control" content="no-siteapp" /><link rel="alternate" media="handheld" href="#" />
<meta name="shenma-site-verification" content="5a59773ab8077d4a62bf469ab966a63b_1497598848">
<meta name="applicable-device" content="pc">
<link href="https://g.csdnimg.cn/static/logo/favicon32.ico" rel="shortcut icon" type="image/x-icon" />
<title>NeuroPilot 离线工具-优快云博客</title>
<script type="application/ld+json">
{"datePublished":"2024-12-27T17:57:35+08:00","@type":"Article","author":[{"@type":"Person","name":"csdn_te_blog_002","url":"https://blog.youkuaiyun.com/csdn_te_blog_002"}],"dateModified":"2025-09-26T10:13:21+08:00","@context":"https://schema.org","headline":"NeuroPilot 离线工具"}
</script>
<script>
(function(){
var el = document.createElement("script");
el.src = "https://s3a.pstatp.com/toutiao/push.js?1abfa13dfe74d72d41d83c86d240de427e7cac50c51ead53b2e79d40c7952a23ed7716d05b4a0f683a653eab3e214672511de2457e74e99286eb2c33f4428830";
el.id = "ttzz";
var s = document.getElementsByTagName("script")[0];
s.parentNode.insertBefore(el, s);
})(window)
</script>
<meta name="keywords" content="neuropilot">
<meta name="csdn-baidu-search" content='{"autorun":true,"install":true,"keyword":"neuropilot"}'>
<meta name="description" content="文章浏览阅读980次,点赞13次,收藏11次。(2)如果客户选择的集成方式为offline compile method(dla集成),则将上述一系列sub-models首先通过NeuronSDK编译为dla,然后加载到device端的应用程序(native executor binary,如neuronrt等),输入还使用最开始的那份input,最终得到每层sub-model的output*,将其与切分出来的output进行对比,找到有差异的output,最终定位问题OP(方法为对比两份output的余弦相似度)。两者分别的适宜的使用场景是什么?_neuropilot">
<link rel="stylesheet" type="text/css" href="https://csdnimg.cn/release/blogv2/dist/pc/css/changjiang_detail_enter-1183fd50c0.min.css">
<script type="application/ld+json">{"@context":"https://ziyuan.baidu.com/contexts/cambrian.jsonld","@id":"https://blog.youkuaiyun.com/csdn_te_Blog_002/article/details/144774831","appid":"1638831770136827","pubDate":"2024-12-27T17:57:35","title":"NeuroPilot 离线工具-优快云博客","upDate":"2024-12-27T17:57:35"}</script>
<script src="https://g.csdnimg.cn/lib/jquery/1.12.4/jquery.min.js" type="text/javascript"></script>
<script src="https://g.csdnimg.cn/lib/jquery-migrate/1.4.1/jquery-migrate.js" type="text/javascript"></script>
<script type="text/javascript">
var isCorporate = false;
var username = "csdn_te_blog_002";
var skinImg = "white";
var blog_address = "https://lytest03003.blog.youkuaiyun.com";
var currentUserName = "csdn_te_Blog_003";
var isOwner = false;
var loginUrl = "http://passport.youkuaiyun.com/account/login?from=https://blog.youkuaiyun.com/csdn_te_Blog_002/article/details/144774831";
var blogUrl = "https://blog.youkuaiyun.com/";
var inscodeHost = "https://inscode.youkuaiyun.com";
var appBlogDomain = "https://app-blog.youkuaiyun.com";
var avatar = "https://i-avatar.csdnimg.cn/4764c972b

最低0.47元/天 解锁文章
1161

被折叠的 条评论
为什么被折叠?



