Question Answering: IR-QA

最新推荐文章于 2024-12-19 17:46:12 发布

原创最新推荐文章于 2024-12-19 17:46:12 发布 · 424 阅读

0 ·

CC 4.0 BY-SA版权

01.NLP 同时被 2 个专栏收录

17 篇文章

订阅专栏

05.QA

3 篇文章

订阅专栏

ClouderaFastForward的两位研究工程师将用两个月时间构建一个信息检索（IR）为基础的问答（QA）系统。系统包含两部分：文档检索器和文档阅读器。检索器作为搜索引擎，排名并返回相关文档；阅读器运用NLP技术，从候选文档中提取最符合问题的答案。他们计划实验多种Transformer架构（如BERT）来改进文档阅读器，并使用现成的搜索算法优化检索器。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

【转】NLP for Question Answering: IR-QA

Over the course of the next two months, two of Cloudera Fast Forward’s Research Engineers,
Melanie Beck and Ryan Micallef, will build a QA system following the information retrieval-based
method, by creating a document retriever and document reader.

We’ll focus our efforts on exploring and experimenting with various Transformer architectures
(like BERT) for the document reader, as well as off-the-shelf search engine algorithms for the retriever.

IR-QA

-Below we illustrate the workflow of a generic IR-based QA system. These systems generally
have two main components: the document retriever and the document reader.

The document retriever functions as the search engine, ranking and retrieving relevant
documents to which it has access. It supplies a set of candidate documents that could
answer the question (often with mixed results, per the Google search shown above).

The document reader consists of reading comprehension algorithms built with core
NLP techniques. This component processes the candidate documents and extracts
from one of them an explicit span of text that best satisfies the query. Let’s dive
deeper into each of these components.

======================

Ref:

https://experiments.fastforwardlabs.com/
https://qa.fastforwardlabs.com/
Building a QA System with BERT on Wikipedia
https://qa.fastforwardlabs.com/pytorch/hugging%20face/wikipedia/bert/transformers/2020/05/19/Getting_Started_with_QA.html

SQuAD2.0 https://rajpurkar.github.io/SQuAD-explorer/