ImageNet Classification with Deep Convolutional Neural Networks

最新推荐文章于 2025-05-05 13:22:02 发布

原创最新推荐文章于 2025-05-05 13:22:02 发布

· 620 阅读

0 ·

版权

文章标签：

#深度学习 #神经网络 #cnn

该文介绍了训练大规模深度卷积神经网络在ImageNetLSVRC-2010数据集上的应用，测试结果显示了37.5%的top-1错误率和17.0%的top-5错误率。网络结构包括五个卷积层、池化层、全连接层和非饱和神经元。为了减少过拟合，采用了dropout方法和数据增强策略。AlexNet的创新在于其能够在当时的情况下处理高分辨率图像，并取得显著的分类效果。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

摘要

训练大规模，深层卷积神经网络去分类 the 1.2 million high-resolution images .
数据集：the ImageNet LSVRC-2010
- 测试集：top-1 and top-5 error rates of 37.5%
  and 17.0%
神经网络： 60 million parameters and 650,000 neurons,。包含五个卷积层。five convolutional layers
- max-pooling layers,
- three fully-connected layers with a final 1000-way softmax
- non-saturating neurons 非饱和神经元。
- To reduce overfitting： dropout方法

介绍

labeled high-resolution images：高分辨率的图像
CNN应用在大规模高分辨率图像上是昂贵的。
current GPUs, paired with a highly-optimized implementation of 2D convolution 有训练的能力。

模型修改方法

removing any convolutional layer (each of which contains no more than 1% of the model’s parameters) resulted inferior performance

数据集

ImageNet：15 million images
- 大约22000个类别。
- 收集图像工具： Mechanical Turk crowd-sourcing tool
- ILSVRC 数据集
  - 1.2 million training images,
  - 50,000 validation images
  - 150,000 testing images

架构

eight learned layers
- five convolutional and three fully-connected

ReLU Nonlinearity

$f (x) = t anh (x)$

$f(x) = (1+ e^{-x})^{-1}$

$f (x) = ma x (0, x)$

Training on Multiple GPUs

Local Response Normalization

在这里插入图片描述

Overlapping Pooling

整体架构

eight layers with weights
- the first five are convolutional
- the remaining three are fully-connected.
- 将最终输出放进 a 1000-way softmax
- The ReLU non-linearity is applied to the output of every convolutional and fully-connected layer.
- 第一层卷积滤波器核：
  - the 224×224×3 input image with 96 kernels of size 11×11×3 with a stride of 4 pixels
- 第二层将第一层的输出作为输入，with 256 kernels of size 5 × 5 × 48.
- The third convolutional layer has 384 kernels of size 3 × 3 × 256 connected to the (normalized, pooled) outputs of the second convolutional layer.
第四层卷积层： 384 kernels of size 3 × 3 × 192 ,
第五个卷积层： 256 kernels of size 3 × 3 × 192
The fully-connected layers have 4096 neurons each.