目录
论文名称:Self-Supervised Video Representation Learning With Odd-One-Out Networks(2017 CVPR)
论文作者:Basura Fernando, Hakan Bilen, Efstratios Gavves, Stephen Gould
Contributions
We propose a new self-supervised CNN pre-training technique based on a novel auxiliary task called odd-one-out learning. In this task, we sample subsequences from videos and ask the network to learn to predict the odd video subsequence. The odd video subsequence is sampled such that it has wrong temporal order of frames while the even ones have the correct temporal order. Our learning machine is implemented as multi-stream convolutional neural network, which is learned end-to-end. Using odd-one-out networks, we learn temporal representations for videos that generalizes to other related tasks such as action recognition.

Method
1、Model
O3N is composed of (N+1) input branches, each contains five Convolutional layers and weight

这篇2017年CVPR论文介绍了使用Odd-One-Out网络进行无监督视频特征学习的方法,通过检测视频中帧的错误时序来训练模型,最终生成通用的视频表示,适用于动作识别等任务。研究包括三种采样策略和三种帧编码方式。
最低0.47元/天 解锁文章
2267

被折叠的 条评论
为什么被折叠?



