Low-shot Visual Recognition by Shrinking and Hallucinating Features

最新推荐文章于 2021-02-23 13:16:19 发布

翻译最新推荐文章于 2021-02-23 13:16:19 发布 · 1k 阅读

本文介绍了一种少样本学习框架，包含三个阶段：学习者由特征提取器和多类分类器组成；第一训练阶段（表示学习），从大量基础类别中学习特征表达；第二训练阶段（少样本学习），学习区分新类别；测试阶段，在未见过的测试图像上预测标签。核心在于通过类比转换生成新例。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

3个组成

It employs a learner, two training phases, and one testing phase.

1. learner

The learner is assumed to be composed of a feature extractor and a multi-class classifier.

2. traing phased one （representation learning）

During representation learning (training phase one), the
learner receives a fixed set of base categories $C_{base}$ , and a dataset D containing a large number of examples for each category in Cbase. The learner uses D to set the parameters of its feature extractor.

产生新的类别

Any two examples z1 and z2 belonging to the same category represent a plausible transformation. Then, given a novel category example x, we want to apply to x the transformation that sent z1 to z2. That is, we want to complete the transformation “analogy” z1 : z2 :: x : ?.

We do this by training a function G that takes as input the concatenated feature vectors of the three examples [φ(x), φ(z1), φ(z2)].G是MLP

3. second phase （low-shot learning）

the learner is given a set of categories $C_l$ that it must learn to distinguish. $C_l$ = $C_{base}$ $∪\cup$ $C_{novel}$ is a mix of base categories Cbase, and unseen novel categories Cnovel.
对于 $C_{novel}$ ，仅仅有k-shot