PointNet网络结构详细解析

最新推荐文章于 2024-01-28 19:02:09 发布

原创

最新推荐文章于 2024-01-28 19:02:09 发布 · 8.4k 阅读

CC 4.0 BY-SA版权

文章标签：

PointNet通过学习输入点云的关键点来概括其特征，保持数据的原始特性，避免体素化造成的体积增大和信息损失。网络结构包括input transform和feature transform，使用对称函数确保输入顺序不变性。它在分类和分割任务中表现出色，同时T-net用于空间变换，增强网络的鲁棒性。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

一、重要知识点

Transforming point clouds data to regular 3D voxel grids or collections of images, however, renders(cause to be) data unnecessarily voluminous(length, vast) and introducing quantization artifacts, obscure(conceal) natural invariances of the data.
PointNet learns to summarize an input point cloud by a sparse set of key points, which roughly corresponds to the skeleton of objects.
将点云体素化会改变点云数据的原始特征，造成不必要的数据损失，并且额外增加了工作量，而 PointNet 采用了原始点云的输入方式，最大限度地保留了点云的空间特征，并在最终的测试中取得了很好的效果。
A symmetric function is invariant to the input order. For example, + and * operators are symmetric binary function.
Treat the input as a sequence to train an RNN.
相同的点云在空间中经过一定的刚性变化（旋转或平移），坐标发生变化，但希望网络都能正确的识别出物体（Special Transform Network, STN），但最终实验结果和后续论文PointNet++表示，STN并无多大作用。
基本思想：对输入点云中的每一个点学习其对应的空间编码，之后再利用所有点的特征，得到一个全局的点云特征。
第一次input transform是对空间中点云进行调整，直观上理解是旋转出一个