kaggle比赛--Dog Breed Identification狗狗品种识别大赛_dog breed identification challenge-优快云博客

本文总结了狗品种识别竞赛中排名前五的解决方案，详细解析了各方案的技术路线，包括特征提取、数据增强、模型构建及训练策略，为深度学习领域的图像识别研究提供了宝贵经验。

对截止至2020.8.1的top5方案进行总结归纳
比赛链接：dog-breed-identification
1、数据形式
2、top1总结
3、top2总结
4、top3总结
……

1、数据形式

120个类别，训练集共10222张图片，测试集10357张

具体：train和test根目录下为图片，labels为每个图片对应label，sample_submission为测试集和最终给出的csv格式范例。具体如下

import pandas as pd
labelfile = os.path.join(dirname,'labels.csv')
dataf = pd.read_csv(labelfile)
print('csv文件长度：', len(dataf))
dataf

在这里插入图片描述

# 载入测试集
df2 = pd.read_csv(dirname + 'sample_submission.csv')
df2.head()

在这里插入图片描述

2、top1方案总结

top1传送门
方案：
1、使用4个提取特征器提取特征（fc层之前的特征）
2、将特征融合
3、使用简单的cnn网络，包括dense层和dropout层进行训练
没有使用数据增强
结果：验证集最高为0.9413，loss最低为0.1982。

# 提取特征网络
def get_features(model_name, data_preprocessor, input_size, data):
    '''
    1- Create a feature extractor to extract features from the data.
    2- Returns the extracted features and the feature extractor.
    '''
    #Prepare pipeline.
    input_layer = Input(input_size)
    preprocessor = Lambda(data_preprocessor)(input_layer)
    base_model = model_name(weights='imagenet', include_top=False,
                            input_shape=input_size)(preprocessor)<