Video tagging systems based on DNNs

本文探讨了手机中大规模视频的利用方式,提出通过视频标签系统实现内容的快速定位及对象信息检索。介绍系统的两步实施流程:关键帧定位与对象分类,并分析了其商业价值与现有竞争对手。

Need:

  1. With the ever-growth large-scale video in the mobile phone, so what will everyone get from these video? There are many videos contain something very interesting like a short comedy video. So if someone find something interesting in the video and want know more about it, they may not search it in the internet and find the information after watching this video due the poor memory. So if the advertiser have put some advertisements in the video ahead of time, it will be more convenient for the user to get some information. That’s very useful for the advertisers and the users.
  2. There are many videos in users’ phone. Maybe most of them are meaningful time mark. So someone want to look for some useful tools to tagging the meaningful object or want to know the object information. Then our video tagging systems will be very efficient for this work.

 

Approach:

  1. The video tagging project can be divided into two steps. The first one is the key frame localization. The second one is the object classification or object detection.
  2. The key frame localization can be realized by some conventional method like the HOG features split or some other method. This is a litter challenge because there is no very efficient way to get the really accuracy key frame. And I think it is a program optimization problem.
  3. The object classification can be realized by the deep convolutional neural network classifier or some other deep learning state-of-the-arts method. The problem is the labels may be not enough. So it can be a research problem.

 

Benefit:

  1. Everyone can be convenient to get some merchandise information by the tagged video which is processed by the mobile end application.
  2. Some people will summarize the meaningful moments and find some meaningful object.

 

Competitors:

There a video tagging system which has been released in the internet after my survey. The Website name is “Clarifai”. They can tag the video and get the object temporal information. And the classification accuracy is very high. So it is our main competitor.

 

 

 

                                                                                                         10/18/2015

  Fuchen Long

转载于:https://www.cnblogs.com/aidoer/p/4892399.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值