MovieQA相关文章及代码链接

最新推荐文章于 2023-03-16 11:35:11 发布

untitled713

最新推荐文章于 2023-03-16 11:35:11 发布

阅读量771

点赞数

文章标签：机器学习深度学习

本文链接：https://blog.youkuaiyun.com/untitled_/article/details/106045904

版权

本文汇总了近年来关于MovieQA的研究论文及其代码实现，包括使用深度学习模型进行电影故事理解的多种方法，如记忆网络、注意力机制等。通过分析和实验，这些方法展示了在理解电影内容和回答相关问题上的进步，并在MovieQA数据集上取得了最佳性能。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

下面这些是我近半年来在arXiv上找的关于MovieQA的论文以及相关代码，如果有其他有关这方面的文章欢迎大家补充~
1、MovieQA: Understanding Stories in Movies through Question-Answering CVPR2016
Abstract：We introduce the MovieQA dataset which aims to evaluate automatic story comprehension from both video and text. The dataset consists of 14,944 questions about 408 movies with high semantic diversity. The questions range from simpler “Who” did “What” to “Whom”, to “Why” and “How” certain events occurred. Each question comes with a set of five possible answers; a correct one and four deceiving answers provided by human annotators. Our dataset is unique in that it contains multiple sources of information – video clips, plots, subtitles, scripts, and DVS. We analyze our data through various statistics and methods. We further extend existing QA techniques to show that question-answering with such open-ended semantics is hard. We make this data set public along with an evaluation benchmark to encourage inspiring work in this challenging domain.
CODE：MovieQA_CVPR2016
2、A Read-Write Memory Network for Movie Story Understanding ICCV2017
Abstract：We propose a novel memory network model named Read-Write Memory Network (RWMN) to perform question and answering tasks for large-scale, multimodal movie story understanding. The key focus of our RWMN model is to design the read network and the write network that consist of multiple convolutional layers, which enable memory read and write operations to have high capacity and flexibility. While existing memory-augmented network models treat each memory slot as an independent block, our use of multi-layered CNNs allows the model to read and write sequential memory cells as chunks, which is more reasonable to represent a sequential story because adjacent mem

最低0.47元/天解锁文章