【论文笔记】CVPR2020 Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

最新推荐文章于 2022-12-15 15:36:19 发布

原创

最新推荐文章于 2022-12-15 15:36:19 发布 · 1.6k 阅读

8 ·

CC 4.0 BY-SA版权

文章标签：

#人工智能 #计算机视觉 #深度学习

该论文提出了一种解决骨架动作识别中长距离关系学习难题的方法，包括分离的多尺度聚合方案和统一的空间-时间图卷积模块（G3D）。通过创新的图卷积计算方式，有效处理了节点间距离对权重的影响，并增强了时空关系的学习。

使用GCN进行skeleton-based action recognition
在这里插入图片描述

Contribution

提出了两个设计：

a disentangled multi-scale aggregation scheme
a unified spatial-temporal graph convolutional module (G3D)

分别解决了两个问题：

unbiased weight problem: edge weights will be biased towards closer nodes against further nodes，对于距离较远的两个节点，他们之间的feature share的效果比较轻微，由于距离太远，weight很难传过去。学习long-range relationship比较困难。例如：scale = 7，真正到距离为7的节点的几率是很小的（这里没有完全理解）。（原始的multi-scale GCN见paper Actional-Structural Graph Convolutional Networks for Skeleton-based Action Recognition ）
factorised spatial-temporal relationship learning: A typical approach is to extract spatial relationships at each time step and then model temporal dynamics. 这样，在spacetime的三维空间里不存在直接的信息流，只能是先space，再time这样间接的提取关系。

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

gitkitten

关注关注

1
点赞
踩
8

收藏

觉得还不错? 一键收藏
4
评论
分享

复制链接

分享到 QQ

分享到新浪微博

扫一扫
举报

举报

专栏目录

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

qq_36852840的博客

05-27

2618

上周看了一下Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition（2020CVPR）这篇paper，然后分享一下我对于这篇论文的一些理解吧，当然还有的部分没有看太明白，希望以后大佬可以写一个对于该篇paper的解读。把论文链接和code链接都丢下面。 https://arxiv.org/pdf/2003.14111.pdfarxiv.org https://github.com/ke.

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Rec.pptx

06-05

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition 分离与统一的图卷积用于基于骨架的动作识别 CVPR 2020

4 条评论您还未登录，请先登录后发表或查看评论

4 条评论

qq_41618518 2021.02.22
没有完全理解

Dorics 2020.07.16
因为ntu rgb这个数据集是2019年出的，自然而然没什么文章在上面跑过实验。作者可能嫌自己做实验太麻烦了吧，刚好2s-AGCN都是2019年的SOTA，就用它来比对了
- gitkitten回复Dorics 2020.07.17
  [reply]Dorics[/reply]谢谢大佬指导！抱拳.jpg
- Dorics回复Dorics 2020.07.16
  [reply]Dorics[/reply]准确说是ntu rgbd120 这个数据集是2019出的 ntu rgbd60是2016年有的

[论文阅读] Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

BENULL的博客

08-06

4120

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

【论文笔记】MS-G3D：Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

LoveKKarlie_的博客

12-11

2609

提出一种分解(disentangle)多尺度图卷积的简单方法；提出统一的时空图卷积算子G3D；融合两种方法——MS-G3D，强大的特征提取器，具有跨时空的多尺度感受野。

论文笔记--Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

Lyndsey的博客

11-07

1101

Hello, 今天是论文笔记计划的第二天啦。今天为大家介绍下这篇“重磅级”论文，目前是该方向SOTA的论文，并且从处理上来看，与之前大家不断改进的ST-GCN的那些论文来看，引入了一些新的视角，还是值得我们学习的。（细心认真的读者借鉴我的论文笔记模版，摸索出一个属于你们最适宜的论文笔记模版。当我阅读一定量之后，我相信我的笔记模版侧重点也会开始发生变化，因为最适合自己的才是最好的。）其实在慢慢做论文的过程中，就有一点发现，你想用什么或者改什么的时候，那些“点”往往不是从普通的论文笔记找到的，而是在经过自

阅读Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition(CVPR2020)

qq_33331451的博客

07-19

1053

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition(CVPR2020) paper:https://arxiv.org/abs/2003.14111 code:github.com/kenziyuliu/ms-g3d 基于骨架动作识别的图卷积分解与统一AbstractIntroduction Abstract 基于骨架的动作识别算法中，时空图表示已广泛的应用于人体行为动力学的建模。为了.

论文阅读：（MS-G3D）Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

qq_36627158的博客

05-11

1000

目录 Summary Details 1、多尺度聚合 2、时空图卷积 G3D 算子 3、两者结合（MS-G3D） 4、整体网络框架论文名称：Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition（2020 CVPR）下载地址：https://openaccess.thecvf.com/content_CVPR_2020/papers/Liu_Disentanglin..

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition——MS-G3D论文解读

qq_44024204的博客

04-08

1311

目录MS-G3D_2020Problem in past researchSolutionSolution for biased weight problemCross-Spacetime Skip ConnectionsMulti-Scale G3DModel ArchitectureExperiment MS-G3D_2020 Author：Ziyu Liu, Hongwen Zhang, Zhenghao Chen, Zhiyong Wang, Wanli Ouyang Paper:https//

MS-G3D代码

LoveKKarlie_的博客

05-23

1241

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition CVPR2020 论文地址代码地址博客链接 MS-TCN： ms-tcn.py class MultiScale_TemporalConv：

多尺度动态图卷积神经网络----Multi-scale Dynamic Graph Convolutional Network for Hyperspectral Image Classificati

热门推荐

你这个代码我看不懂的博客

04-06

1万+

一、摘要卷积神经网络（CNN）在表示高光谱图像和实现高光谱图像分类方面表现出令人印象深刻的能力。然而，传统的CNN模型只能对固定大小和权重的规则正方形图像区域进行卷积，因此不能普遍适用于具有不同对象分布和几何外观的不同局部区域。因此，它们的分类性能仍有待提高，尤其是在类边界方面。为了缓解这一缺点，我们考虑采用最近提出的图卷积网络（GCN）进行高光谱图像分类，因为它可以对任意结构的非欧几里德数据进行卷积，并且适用于由图拓扑信息表示的不规则图像区域。与常用的GCN模型工作在固定图上不同，我们使图能够随着图卷积

【论文笔记】MGU-Net

小马各的博客

08-13

2326

【论文笔记】Multi-Scale GCN-Assisted Two-Stage Network for Joint Segmentation of Retinal Layers and Disc in Peripapillary OCT Images

【论文笔记】CVPR2020 Skeleton-Based Action Recognition with Shift Graph Convolutional Network

gitcat的博客

06-29

1913

用图卷积做action recogntion Contribution 提出了shift-GCN（spatial shift graph operations & temporal shift graph operations）that exceeds the state-of-the-art methods with more than 10× less computational cost，主要解决了过去工作的两个问题 heavy computational complexity of G

[paper]Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition

vinciblack的博客

03-06

5901

一.基本思想提出了一种新的 ST-GCN，即时空图卷积网络模型，用于解决基于人体骨架关键点的人类动作识别问题。动作是基于时间关系以及不同部位关键点之间的联系的。考虑到以上因素提出Spatial Temporal的卷积网络。 1.空间上：将骨架之间的关键点作为空间关系的输入（存在着不同点之间邻域大小不定的困难，考虑到用基于Graph的CNN网络模型）； 2.时间上：使用视频数据，利用图片之间...

（DGNN读书笔记）Skeleton-Based Action Recognition with Directed Graph Neural Networks-DGNN读书笔记

qq_38959366的博客

05-28

4745

Skeleton-Based Action Recognition with Directed Graph Neural Networks-DGNN读书笔记Skeleton-Based Action Recognition with Directed Graph Neural Networks1.摘要2. 引言3. 理论3.1骨骼信息3.2图构造3.3双向图神经网络（DGNN）3.3.1双向图神经网络 block Skeleton-Based Action Recognition with Directed

读书笔记5：Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition

b224618的博客

07-21

5465

这篇文章开篇就指出，我们的模型是要从人体动作的序列中选取出最informative的那些帧，而丢弃掉用处不大的部分。但是由于对于不同的视频序列，挑出最有代表性的帧的方法是不同的，因此，本文提出用深度增强学习来将帧的选择模拟为一个不断进步的progressive process。这篇文章处理的问题是skeleton based action recognition，提出的模型的示意图如下： ...

Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks（一）论文阅读

qq_44024204的博客

04-04

2067

目录MS-AAGCN_2019AbstractIntroductionRelated WorkGRAPH CONVOLUTIONAL NETWORKSA.Graph constructionB. Graph convolutionC. ImplementationMULTI-STREAM ATTENTION-ENHANCED ADAPTIVEA. Adaptive graph convolutional layerThe first sub-graph (BkB_{k}Bk)The second sub-

论文阅读17：Skeleton-Based Action Recognition WithFocusing-Diffusion Graph Convolutional Networks

gaocui883的博客

12-15

321

论文阅读17：Skeleton-Based Action Recognition With Focusing-Diffusion Graph Convolutional Networks

CVPR2020 3D目标检测论文综述：LiDAR-based方法与时空注意力

在2020年的计算机视觉与模式识别（Computer Vision and Pattern Recognition, CVPR）会议上，目标检测领域呈现出显著的发展，特别是3D目标检测技术的研究取得了重大突破。共有63篇论文集中在这一主题上，其中一项...