multi-class,multi-label与multi-task的区别

本文详细解释了多分类(Multiclass classification)、多标签分类(Multilabel classification)及多任务分类(Multi-task classification)的区别与联系,并通过实例说明了这些概念的应用场景。

转自:https://blog.youkuaiyun.com/golden1314521/article/details/51251252

一直很纠结Multi-class, Multi-label 以及 Multi-task 各自的区别和联系,最近找到了以下的说明资料:

  • Multiclass classification means a classification task with more than two classes; e.g., classify a set of images of fruits which may be oranges, apples, or pears. Multiclass classification makes the assumption that each sample is assigned to one and only one label: a fruit can be either an apple or a pear but not both at the same time.

  • Multilabel classification assigns to each sample a set of target labels. This can be thought as predicting properties of a data-point that are not mutually exclusive, such as topics that are relevant for a document. A text might be about any of religion, politics, finance or education at the same time or none of these.

  • Multioutput-multiclass classification and multi-task classification means that a single estimator has to handle several joint classification tasks. This is a generalization of the multi-label classification task, where the set of classification problem is restricted to binary classification, and of the multi-class classification task. The output format is a 2d numpy array or sparse matrix.

    The set of labels can be different for each output variable. For instance a sample could be assigned “pear” for an output variable that takes possible values in a finite set of species such as “pear”, “apple”, “orange” and “green” for a second output variable that takes possible values in a finite set of colors such as “green”, “red”, “orange”, “yellow”…

    This means that any classifiers handling multi-output multiclass or multi-task classification task supports the multi-label classification task as a special case. Multi-task classification is similar to the multi-output classification task with different model formulations. For more information, see the relevant estimator documentation.

可以看出:

  • Multiclass classification 就是多分类问题,比如年龄预测中把人分为小孩,年轻人,青年人和老年人这四个类别。Multiclass classification 与 binary classification相对应,性别预测只有男、女两个值,就属于后者。
  • Multilabel classification 是多标签分类,比如一个新闻稿A可以与{政治,体育,自然}有关,就可以打上这三个标签。而新闻稿B可能只与其中的{体育,自然}相关,就只能打上这两个标签。
  • Multioutput-multiclass classification 和 multi-task classification 指的是同一个东西。仍然举前边的新闻稿的例子,定义一个三个元素的向量,该向量第1、2和3个元素分别对应是否(分别取值1或0)与政治、体育和自然相关。那么新闻稿A可以表示为[1,1,1],而新闻稿B可以表示为[0,1,1],这就可以看成是multi-task classification 问题了。 从这个例子也可以看出,Multilabel classification是一种特殊的multi-task classification 问题。之所以说它特殊,是因为一般情况下,向量的元素可能会取多于两个值,比如同时要求预测年龄和性别,其中年龄有四个取值,而性别有两个取值。

这里写图片描述

2.2 Multi-Class CNN This section describes the multi-class implementation used to differentiate and classify four histopathological categories (ductal, lobular, mucinous, and papillary). The implementation followed the same engineering principles as the binary pipeline but used a categorical output and class-aware design choices to address the increased label complexity. 2.2.1 Data and Preprocessing The multi-class dataset was found on “BREAST CANCER HISTOPATHOLOGICAL DATABASE (BreakHisv1).” This dataset is modified to only include four types of malignant breast tumors, which are the four types of breast cancer used in training the model. The data structure is organized by breast cancer types, individual patient histopathology files, and the zoom factor of tissue screening. It is then assembled into a data frame of file paths and labels (class mapping: ductal carcinoma = 0, lobular carcinoma = 1, mucinous carcinoma = 2, papillary carcinoma = 3). The dataset contained 5,429 images distributed unevenly across classes (ductal: 3,451; mucinous: 792; lobular: 626; papillary: 560). An 80/20 train/validation split yielded 4,343 training and 1,086 validation samples. Images were loaded on-the-fly via ImageDataGenerator with rescaling (1./255) and light augmentation (rotation range=15, width/height shifts up to 0.1, horizontal flip). This on-the-fly strategy reduces memory pressure and preserves I/O efficiency for large image collections. Model architecture and training The multi-class CNN uses four convolutional blocks with progressively larger filters (32 → 64 → 128 → 256), each block containing Conv2D→BatchNormalization→MaxPooling2D→Dropout (0.25). After flattening, dense layers of 512 and 256 units with batch normalization and 0.5 dropout are applied; the final layer is Dense. The model is compiled with the Adam optimizer and categorical cross entropy loss to support mutually exclusive class prediction. Two experiment configurations exist in the codebase: (a) the original configuration (Batch Size = 32, Epochs = 30, LR = 0.001) that produced the run log included with this submission, and (b) a speed-optimized variant (Batch Size = 64, Epochs = 15, LR = 0.002) for faster iteration. Evaluation and Diagnostics Model evaluation uses a held-out generator and sklearn.metrics.classification_report and confusion_matrix to report per-class precision, recall, and F1-score; a seaborn heatmap visualizes the confusion matrix and matplotlib plots training/validation accuracy and loss. The run log for the original configuration reports a peak validation accuracy of 0.6529 (epoch 4) and a recorded training accuracy of 0.6967 (epoch 5) during the captured epochs; the full evaluation (per-class metrics and confusion-matrix counts) is then produced and included in the presentation. 2.2.4 Practical observations and improvements The dataset’s class imbalance (ductal dominance) is the main challenge for multi-class discrimination. Remedies to consider include class-weighted loss or oversampling of minority classes, focal loss to mitigate easy negative dominance, stronger augmentation targeted at minority classes, or stratified patch sampling. The model’s dense head (flatten → 512 units) yields ~19.4M parameters and can be prone to overfitting; replacing the flatten + dense stack with a GlobalAveragePooling2D followed by a small dense head or applying more aggressive dropout or L2 regularization may reduce overfitting. The literature suggests further gains from transfer learning (pretrained EfficientNet/ResNet backbones or DenTnet) and patch-level context models (BiLSTM on ordered patches) for cases where contextual arrangement of tissue is diagnostic. Step-by-step processing pipeline: Assemble DataFrame of filepaths and labels and report class counts. Create ImageDataGenerator instances for training (rescale + augmentation) and validation (rescale). Split into train/validation via train_test_split(..., stratify=df['label']) and create flow_from_dataframe generators. Build CNN (Conv blocks → Flatten → Dense → Softmax), compile with Adam + categorical_crossentropy. Train with model.fit using EarlyStopping and ReduceLROnPlateau callbacks; save the best model. Evaluate with a test generator: compute predicted classes, print classification_report, plot confusion matrix, and save accuracy/loss curves.分析multi-CNN模型的作用英文分析优点
09-25
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值