Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

本文介绍了一种新颖的无监督学习方法,用于学习人类行动类别。通过提取空间-时间兴趣点将视频序列表示为时空词集合,算法自动学习这些时空词的概率分布以及对应于人类行动类别的中间主题。该方法不仅能够进行不同行动的分类,还能同时在新颖且复杂的视频序列中定位多种行动。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

 

Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

Juan Carlos Niebles1,2, Hongcheng Wang1, Li Fei-Fei1

1University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
2Universidad del Norte, Barranquilla, Colombia

Summary

To automatically classify or localize different actions in video sequences is very useful for a variety of tasks, such as video surveillance, object-level video summarization, video indexing, digital library organization, etc. However, it remains a challenging task for computers to achieve robust action recognition due to cluttered background, camera motion, occlusion, and geometric and photometric variances of objects.

We present a  novel unsupervised learning method for learning human action categories. A video sequence is represented as a collection of spatial-temporal words by extracting space-time interest points. The algorithm learns the probability distributions of the spatial-temporal words and intermediate topics corresponding to human action categories automatically using a probabilistic Latent Semantic Analysis (pLSA) model. The learned model is then used for human action categorization and localization in a novel video, by maximizing the posterior of action category (topic) distributions. The contributions of this work are as follows:

  • Unsupervised learning of actions using 'video words' representation: We deploy a pLSA model with 'bag of video words' representation for video analysis;
  • Multiple action localization and categorization: Our approach is not only able to classify different actions, but also to localize different actions simultaneously in a novel and complex video sequence.

Our Algorithm

Resources

  • Juan Carlos Niebles, Hongcheng Wang and Li Fei-Fei, Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words, Accepted for Oral Presentation At British Machine Vision Conference (BMVC), Edinburgh, 2006.
    Full Text: PDF
  • Juan Carlos Niebles, Hongcheng Wang and Li Fei-Fei, Unsupervised Learning of Human Action Categories, in Video Proceedings, International Conference on Computer Vision and Pattern Recognition (VPCVPR), New York, 2006.
    Full Text: PDF (One Page)
    Video Demo: AVI
  • There is also a poster about this work, presented at IMA Workshop: Visual Learning and Recognition, Minneapolis, 2006.

Selected References

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值