Training Protocol
-
backbone:
ResNet-101 or modified aligned Xception -
pretrain:
ImageNet-1K -
dataset:
PASCAL VOC 2012 (20 foreground object classes, 1 background class)
10582 (trainaug) training images, 1449 (val), 1456 (test) -
lr schedule:
“poly” policy (initial lr:
0.007)
initial learning rate is multiplied by (1−itermax_iter)power(1-\frac{iter}{max\_iter})^{power}(1−