栗子研-优快云博客

原创【阅读笔记】Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for

题目：Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for VQA会议：EMNLP 2024 (CCF-B会）作者：Pu Jian（中科大），Donglei Yu（中科大），Jiajun Zhang（中科大）

2025-01-08 12:04:35 969

原创【阅读笔记】Improving Zero-shot Visual Question Answering via LargeLanguage Models with Reasoning Question

题目：Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts会议：ACMMM（ACM International Conference on Multimedia） 2023 （CCF-A会）

2024-12-22 22:20:43 962

原创【阅读笔记】Prompting large language model with context and pre-answer forknowledge-based VQA

题目：Prompting large language model with context and pre-answer for knowledge-based VQA期刊：PR 2024（Pattern Recogniztion）（CCF-B类）作者：Zhongjian Hu, Peng Yang, Yuanshuang Jiang, Zijian Bai （东南大学）

2024-12-22 12:02:45 687

原创【阅读笔记】Zero-shot Visual Question Answering using Knowledge Graph

题目：Zero-shot Visual Question Answering using Knowledge Graph期刊：ISWC 2021（B类会议）作者：陈卓(浙江大学)、陈矫彦(牛津大学)、耿玉霞(浙江大学)、Jeff Z. Pan(爱丁堡大学)、苑宗港(华为)、陈华钧(浙江大学)

2024-11-04 20:24:48 616

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

原创 【阅读笔记】Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for

原创 【阅读笔记】Improving Zero-shot Visual Question Answering via LargeLanguage Models with Reasoning Question

原创 【阅读笔记】Prompting large language model with context and pre-answer forknowledge-based VQA

原创 【阅读笔记】Zero-shot Visual Question Answering using Knowledge Graph

空空如也

空空如也

原创【阅读笔记】Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for

原创【阅读笔记】Improving Zero-shot Visual Question Answering via LargeLanguage Models with Reasoning Question

原创【阅读笔记】Prompting large language model with context and pre-answer forknowledge-based VQA

原创【阅读笔记】Zero-shot Visual Question Answering using Knowledge Graph