Beyond Caption To Narrative: Video Captioning With Multiple Sentences

本文提出了一种新的视频描述生成方法,通过时间分割视频、定位动作、从多个帧生成多个句子,并利用自然语言处理技术连接这些句子以形成类似故事的描述。这种方法能够生成内容更丰富的视频描述。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

Beyond Caption To Narrative: Video Captioning With Multiple Sentences

Recent advances in image captioning task have led to increasing interests in video captioning task. However, most works on video captioning are focused on generating single input of aggregated features, which hardly deviates from image captioning process and does not fully take advantage of dynamic contents present in videos. We attempt to generate video captions that convey richer contents by temporally segmenting the video with action localization, generating multiple captions from multiple frames, and connecting them with natural language processing techniques, in order to generate a story-like caption. We show that our proposed method can generate captions that are richer in contents and can compete with state-of-the-art method without explicitly using video-level features as input.
Comments: accepted to ICIP 2016
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1605.05440 [cs.CV]
  (or arXiv:1605.05440v1 [cs.CV] for this version)

Submission history

From: Andrew Shin [ view email
[v1] Wed, 18 May 2016 05:00:12 GMT (1186kb,D)
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值