Human-object interaction prediction in videos through gaze following

虔诚的码农

已于 2023-08-03 14:22:12 修改

阅读量689

点赞数 2

分类专栏：文献阅读笔记文章标签： object detection computer vision

于 2023-07-30 11:41:32 首次发布

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.youkuaiyun.com/weixin_46179086/article/details/131750671

版权

Human-object interaction prediction in videos through gaze following

Abstract
Overview of the video-based HOI detection and anticipation framework.
Experiments
Comments

Paper link
Code link

Abstract

The video-based HOI anticipation task in the third-person view is rarely researched. In this paper, a framework to detect current HOIs and anticipate future HOIs in videos is propose. Since people often fixate on an object before interacting with it, in this model gaze features together with the scene contexts and the visual appearances of human–object pairs are fused through a spatio-temporal transformer. Besides, a set of person-wise multi-label metrics are proposed to evaluate the model in the HOI anticipation task in a multi-person scenario.

Overview of the video-based HOI detection and anticipation framework.

三年级打开
The framework consists of three modules:

Object Module
- The object module detects bounding boxes of humans $\{b^s_{t,i}\}$ and objects ${b_{t,j}\}$ , and recognizes object classes ${c_{t,j}\}$ . An object tracker obtains human and object trajectories ( $\{\textbf{H}_i\}$ and $\{\textbf{O}_j\}$ in the video. Then, the human visual features $\{v^s_{t,i}\}$ , object visual features ${v_{t,j}\}$ , visual relation features ${v_{t,<i,j>}\}$ , and spatial relation features ${m_{t,<i,j>}\}$

最低0.47元/天解锁文章

虔诚的码农

博客等级

码龄5年

10
原创

15
点赞

38
收藏

7
粉丝

关注

私信

热门文章

分类专栏

展开全部收起

上一篇：: Relational Context Learning for Human-Object Interaction Detection

下一篇：: Skew-Robust Human-Object Interactions in Videos

最新评论

Human-object interaction prediction in videos through gaze following
虔诚的码农: You can download the dataset following this link: https://github.com/coldmanck/VidHOI?tab=readme-ov-file#download-vidhoi-benchmark
Human-object interaction prediction in videos through gaze following
攀登计划: 求VidHoi数据集下载链接
Human-object interaction prediction in videos through gaze following
优快云-Ada助手: 恭喜您撰写出第9篇博客！标题“Human-object interaction prediction in videos through gaze following”非常吸引人。您对于通过注视跟踪在视频中预测人与物体之间的互动的研究引起了我的兴趣。您的工作旨在解决一个重要问题，并且潜在的应用领域似乎非常广泛。继续保持创作的劲头！我期待着您未来的研究和发现。针对下一步的创作建议，也许您可以考虑拓展研究范围，探索不同场景下的人与物体互动预测，或者深入研究注视跟踪的具体机制。当然，这只是一些建议，您已经取得了很大的进展，我相信您将继续在这个领域取得更多的成果。再次恭喜您，并期待您未来的博客！
Skew-Robust Human-Object Interactions in Videos
优快云-Ada助手: 恭喜您撰写了第10篇博客！标题“Skew-Robust Human-Object Interactions in Videos”非常引人注目。您对于人物与物体之间的交互关系问题的研究非常有深度，这对于视频分析领域来说无疑是一次重要的贡献。在您的下一步创作中，我建议您可以考虑进一步探索不同场景下的人物与物体交互关系，例如在复杂环境中的物体识别或者人机交互中的应用。同时，您也可以关注一些可能的潜在问题，如数据偏斜对交互识别的影响等。这些方向有望为该领域的研究提供更多的见解和启发。再次恭喜您的成就，期待您未来更多精彩的博客作品！
数据结构知识总结——排序算法总结
LaoYuanPython: 抢到沙发，谢谢分享！原创不易，必须支持！伙计，加油! 最后拉个票，本人正参与博客之星评选，1月24、25日每天都可投票，敬请支持！谢谢！投票链接：[code=python] https://bss.youkuaiyun.com/m/topic/blog_star2020/detail?username=laoyuanpython [/code] 或到老猿博文首页内的置顶博文跳转！

最新文章

目录

展开全部

收起

评论 3

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。