TensorFlow搭建双向LSTM实现时间序列预测（负荷预测）

Cyril_KI

已于 2024-10-09 22:44:58 修改

阅读量4.2k

点赞数 4

分类专栏：时间序列预测 TensorFlow 文章标签： lstm tensorflow 时间序列预测负荷预测

于 2022-08-20 21:51:22 首次发布

本文链接：https://blog.youkuaiyun.com/Cyril_KI/article/details/126444433

版权

TensorFlow 双向LSTM 时间序列预测模型定义 PyTorch比较

关键词由优快云通过智能技术生成

时间序列预测同时被 2 个专栏收录

51 篇文章

订阅专栏

TensorFlow

17 篇文章

订阅专栏

I. 前言

前面几篇文章中介绍的都是单向LSTM，这篇文章讲一下双向LSTM。

系列文章：

II. 原理

如果想利用TensorFlow来实现双向LSTM，则需要用到tf.keras.layers.Bidirectional，关于Bidirectional，官方API描述如下：

tf.keras.layers.Bidirectional(
    layer, merge_mode='concat', weights=None, backward_layer=None,
    **kwargs
)

在这里插入图片描述
其中：

layer：可以为LSTM或者GRU。
merge_mode：如PyTorch搭建双向LSTM实现时间序列预测（负荷预测）中描述，双向LSTM最终会得到两个方向上的输出，输出维度为(batch_size, seq_len, 2 * hidden_size)，我们可以对两个方向上的输出按照多种方式进行组合，但PyTorch需要手动拆分然后实现组合。在TensorFlow中，我们可以通过Bidirectional的merge_model参数定义组合方式，具体有(sum, mul, concat, ave, None)五种方式，默认为concat，也就是将两个输出拼接在一起。如果为None，则不进行组合，而是将两个方向上的输出以列表形式返回，这样可以让使用者自定义其他组合方式。
backward_layer：用于处理向后输入处理的实例。如果未提供，则作为参数传递的图层实例将用于自动生成后向图层。

III. 模型定义

双向LSTM定义如下：

class BiLSTM(keras.Model):
    def __init__(self, args):
        super(BiLSTM, self).__init__()
        self.lstm = Sequential()
        for i in range(args.num_layers):
            self.lstm.add(
                Bidirectional(layers.LSTM(units=args.hidden_size, input_shape=(args.seq_len, args.input_size),
                                          activation='tanh', return_sequences=True)))
        self.fc1 = layers.Dense(64, activation='relu')
        self.fc2 = layers.Dense(args.output_size)

    def call(self, data, training=None, mask=None):
        x = self.lstm(data)
        x = self.fc1(x)
        x = self.fc2(x)

        return x[:, -1:, :]

双向LSTM定义语句：

Bidirectional(layers.LSTM(units=args.hidden_size, input_shape=(args.seq_len, args.input_size),
                          activation='tanh', return_sequences=True)))