阅读《Learning to Ask: Neural Question Generation for Reading Comprehension 》

最新推荐文章于 2022-03-12 16:08:37 发布

原创

最新推荐文章于 2022-03-12 16:08:37 发布 · 2.4k 阅读

2 ·

CC 4.0 BY-SA版权

文章标签：

#Question Answer #自然语言处理 #问题生成 #自然语言生成 #注意力

本文介绍了针对阅读理解的神经网络问题生成模型，该模型基于注意力机制，不依赖复杂的NLP管道。研究发现生成的问题更自然、需要推理回答。模型包括解码器和编码器，其中编码器利用双向LSTM和注意力机制。实验使用了斯坦福的SQuAD数据集，结果显示模型能有效生成与输入句子紧密相关的问题。

阅读《Learning to Ask: Neural Question Generation for Reading Comprehension 》

@(NLP)[自然语言生成|LSTM|QA|Attention]

Abstract

作者为解决机器生成问题，提出了一种基于注意力的序列学习模型并研究了句子级别和段落信息编码之间的影响。与以前的工作不同，他们的模型不依赖手工生成的规则或者复杂的NLP管道（不是很理解，原文为 Sophisticated NLP pipeline ）。人工评价生成的问题更自然，也更难回答，与原文在语法和句话上有区别，需要推理回答。

Introduction

Question generate function

In addition to the above applications, question generation systems can aid in the development of annotated data sets for natural language processing (NLP) research in reading comprehension and question answering. Indeed the creation of such datasets.

Example ：the natural qusetion and their answers

这里写图片描述

Natural question features

Vanderwende 指出学会问问题是NLP研究一个重要的问题，并且问题不仅仅是一个陈述句句子的句法转换。
自然的问题常常有以下特点：
- In particular, a natural sounding question often compresses the sentence on which it is based (e.g., question 3 in Figure 1)
- 一个自然而然的问题往往明白句子是基于什么的
- uses synonyms for terms in the passage (e.g., “form” for “produce” in question 2 and “get” for “produce” in question 3),
- 使用段落中的同义词
- refers to entities from preceding sentences or clauses (e.g., the use of “photosynthesis” in question 2).
- 涉及到前文或从句中的实体
- Othertimes, world knowledge is employed to produce a good question (e.g., identifying “photosynthesis” as a “life process” in question 1).
- 知识会被用来产生一个好问题

文章中提出的模型不同于以前的模型，它完全由数据驱动，没有手工生成的规则

Task Definition

Goal： to generate a natural question y relation information in the sentence

y can be a sequence of an arbitrary length: $[ y_1,…,y_{|y|} ]$ . Suppose the length of the input sentence is $M$ , $x$ could then be represented as a sequence of tokens $[x_1,...,x_M]$ . The QG task is defined as finding y, such that:

$y ¯ ¯ ¯ = a r g y m a x P (y | x) (1)$ $\overline{y} = arg_ymaxP(y|x) \tag{1}$