Video Analysis 入门

这篇博客总结了视频分析领域的主要任务,包括Activity Net的五个任务:未修剪视频分类、修剪动作识别、时间动作提案、动作定位和视频事件密集描述。还介绍了常用的数据集如KTH、UCF101,以及视频理解任务如动作分类和时序检测。文章探讨了时间维度的挑战,提出了two stream CNN、C3D等技术,并提到了基于图卷积网络的ST-GCN方法。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

主要基于网上一些质量比较高的分享进行总结,以对视频分析领域的常见问题和方法有一个大致的理解。

Activity Net 2017的五个task:
Task 1: Untrimmed Video Classification (ActivityNet)
videos can contain more than one activity, and typically large time lapses of the video are not related with any activity of interest.

Task 2: Trimmed Action Recognition (Kinetics) [New]
videos contain a single activity, and all the clips have a standard duration of ten seconds.

Task 3: Temporal Action Proposals (ActivityNet) [New]
The goal is to produce a set of candidate temporal segments that are likely to contain a human action.

Task 4: Temporal Action Localization (ActivityNet)
This task is intended to evaluate the ability of algorithms to temporally localize activities in untrimmed video sequences. Here, videos can contain more than one activity instance, and mutiple activity categories can appear in the video.

Task 5: Dense-Captioning Events in Videos (Activity

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值