Effective Approaches to Attention-based Neural Machine Translation Encoder-Decoder+注意力机制: p(yi∣y1,...,yi−1,X)=g(yi−1,si,ci)p(y_i|y_1,...,y_{i-1},X)=g(y_{i-1},s_i,c_i)p(yi∣y1,...,yi−1,X)=g(yi−1,s