Keras 时序模型

最新推荐文章于 2024-02-01 16:29:26 发布

Thinking_boy1992

最新推荐文章于 2024-02-01 16:29:26 发布

阅读量8k

点赞数 4

CC 4.0 BY-SA版权

分类专栏： Keras库 Keras 文章标签： LSTM Keras RNN

本文链接：https://blog.youkuaiyun.com/Thinking_boy1992/article/details/53207177

本文介绍了如何在Keras中构建时序模型，包括LSTM和RNN层的使用，以及如何指定输入形状、合并层和编译模型。通过示例展示了训练过程，如LSTM的文本生成和序列分类。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

本文翻译自
时序模型就是层次的线性叠加。
你能够通过向构造函数传递层实例的列表构建序列模型：

from keras.models import Sequential
from keras.layers import Dense, Activation
model = Sequential([
    Dense(32, input_dim=784),
    Activation('relu'),
    Dense(10),
    Activation('softmax'),
])

也可以简单的通过.add()添加层：

model = Sequential()
model.add(Dense(32, input_dim=784))
model.add(Activation('relu'))

指定输入形状（Specifying the input shape）
模型需要知道输入的形状。正因为如此，序列模型的第一层（只有第一层，下面的层次能自动进行形状推测）需要接受有关输入形状的信息。这里有几种方式来做这些：
通过第一层的一个参数input_shape进行传递。这是一个形状元组（整数元组或None条目，None指示任何可能的整数都可以）。在Input_shape ,批维度没有被包含；
传进batch_input_shape 参数，它包含了批维度；制定一个固定的批大小是有用的；
一些2D层，例如Dense，支持通过参数input_dim指定输入的形状，一些3D时间层支持参数input_dim或input_length
正因为如此，以下三个代码片段完全相同：

model = Sequential()
model.add(Dense(32, input_shape=(784,)))

model = Sequential()
model.add(Dense(32, batch_input_shape=(None, 784)))
# note that batch dimension is "None" here,
# so the model will be able to process batches of any size.

model = Sequential()
model.add(Dense(32, input_dim=784))

下面的三个片段同样相同：

model = Sequential()
model.add(LSTM(32, input_shape=(10, 64)))

model = Sequential()
model.add(LSTM(32, batch_input_shape=(None, 10, 64)))

model = Sequential()
model.add(LSTM(32, input_length=10, input_dim=64))

合并层（The Merge layer）
多个Sequential 实例能够通过Merge层合并为单个的输出；
输出能够在一个新的Sequential模型中像第一层那样进行添加；例如，下面的模型把两个分离的输入分支进行整合；

from keras.layers import Merge

left_branch = Sequential()
left_branch.add(Dense(32, input_dim=784))

right_branch = Sequential()
right_branch.add(Dense(32, input_dim=784))

merged = Merge([left_branch, right_branch], mode='concat')

final_model = Sequential()
final_model.add(merged)
final_model.add(Dense(10, activation='softmax'))

如此的一个两分支模型能够被训练：

final_model.compile(optimizer='rmsprop', loss='categorical_crossentropy')
final_model.fit([input_data_1, input_data_2], targets)  # we pass one data array per model input

合并层支持一些预定义的模式：
sum(default):元素的总和；
concat：向量级联，可以通过concat_axis设置级联轴；
mul：元素相乘
ave:向量平均；
dot:点积；能够通过dot_axes参数指定轴进行减少；
cos:向量间的余弦距离；

还可以通过允许任意变换的函数作为mode参数进行传递；

merged = Merge([left_branch, right_branch], mode=lambda x: x[0] - x[1])

现在已经能够使用Keras来定义任何的模型。对于不能通过Sequential和Merge进行表示的复杂模型。能够使用 the functional API
编译（Compilation）
在训练一个模型之前，需要配置它的学习过程，这是通过compile函数来做的。它接受三个参数：
一个优化器：它可以是现有的优化器的字符串标识符（例如 rmsprop or adagrad），也可以是优化器类的实例；有关优化器
一个损失函数：这是模型想要最小化的目标函数，它可以是一个现存的损失函数的字符串标识符(such as categorical_crossentropy or mse)，也可以是一个目标函数；有关损失函数
一个度量值列表：对于任何的聚类问题你将要把它设置为metrics=[‘accuracy’]，一个度量值可以是一个已存在的度量的字符串标识符或者是一个自定义度量函数。有关度量值

# for a multi-class classification problem
model.compile(optimizer='rmsprop',
              loss='categorical_crossentropy',
              metrics=['accuracy'])

# for a binary classification problem
model.compile(optimizer='rmsprop',
              loss='binary_crossentropy',
              metrics=['accuracy'])

# for a mean squared error regression problem
model.compile(optimizer='rmsprop',
              loss='mse')

# for custom metrics
import keras.backend as K

def mean_pred(y_true, y_pred):
    return K.mean(y_pred)

def false_rates(y_true, y_pred):
    false_neg = ...
    false_pos = ...
    return {
        'false_ne