注意使用 nn.CrossEntropyLoss()作为损失函数，最后一层输出不要softmax了，因为nn.CrossEntropyLoss()已经包含了该操作_最后一层已经logsoftmax()了,所以不能nn.crossentropyloss()来计算了,-优快云博客

本文详细介绍了如何使用PyTorch构建一个前馈神经网络，包括定义网络结构、损失函数、数据预处理及计算损失的过程。并通过实例展示了log-softmax输出与负对数似然损失的运用。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

# Build a feed-forward network

model = nn.Sequential(nn.Linear(784, 128),
nn.ReLU(),
nn.Linear(128, 64),
nn.ReLU(),
nn.Linear(64, 10))

# Define the loss
criterion = nn.CrossEntropyLoss()

# Get our data
images, labels = next(iter(trainloader))
# Flatten images
images = images.view(images.shape[0], -1)

# Forward pass, get our logits
logits = model(images)
# Calculate the loss with the logits and the labels
loss = criterion(logits, labels)

print(loss)

使用 nn.LogSoftmax 或 F.log_softmax（文档）构建具有 log-softmax 输出的模型更方便。然后我们可以通过计算指数 torch.exp(output) 获得实际概率。对于 log-softmax 输出，你需要使用负对数似然损失 nn.NLLLoss（文档）。

练习：请构建一个返回 log-softmax 输出结果并使用负对数似然损失计算损失的模型。注意，对于 nn.LogSoftmax 和 F.log_softmax，你需要相应地设置 dim 关键字参数。dim=0 会计算各行的 softmax，使每列的和为 1，而 dim=1 会计算各列的 softmax，使每行的和为 1。思考下你希望输出是什么，并选择恰当的 dim。

https://classroom.udacity.com/nanodegrees/nd009-cn-advanced/parts/5f4d630c-d15a-412c-aaeb-b57ad61cd03c/modules/3aa9e812-62cd-4ae3-8fc4-593538f08455/lessons/9b014a97-2267-4f1b-af97-284b7dac2a58/concepts/572cd59e-540f-43d9-906f-33d22a4452a6