刚找的综述性文章:这两篇我没怎么看不知道怎么样
Visual Question Answering: Datasets,Algorithms, and Future Challenges
Visual Question Answering: A Survey ofMethods and Datasets
论文:
A multi-world approach to question answeringabout real-world scenes based on uncertain input. NIPS, 2014.
比较早的一篇文章
Ask Your Neurons: A Neural-based Approach toAnswering Questions about Images. ICCV 2015
这篇文章也比较早,方法比较基础,VQA初期采用的方法
Where To Look: Focus Regions for VisualQuestion Answering。
加入attention机制的一篇文章
Image Question Answering using ConvolutionalNeural Network with Dynamic Parame