際際滷

際際滷Share a Scribd company logo
Takuya Akiba*, Tommi Kerola*, Yusuke Niitani*,
Toru Ogawa*, Shotaro Sano*, and Shuji Suzuki*
*: Equal Contribution
PFDet: 2nd Place Solution to
Open Images Competition
: iwiwi
Kaggle Tokyo Meetup #2
: Shotaro Sano (@g_votte)
PFDet: 2nd Place Solutions to Open Images Competition
? Object Detection
? 500, 1.7M, Bounding Box 12M
? : 2018 7/3 ~ 8/30
Public LB
Private LB
PFDet: 2nd Place Solutions to Open Images Competition
?
? :
? :
C Bounding Boxes
C Bounding Box
?
?
?
Object Detection
Bounding Box
: mean Average Precision
? Average Precision
? Average Precision
C
C
Precision
IoU = >= 0.5
Confidence TP/FP precision@k
0.9 TP 100%
0.8 TP 100%
0.75 FP 66%
0.7 TP 75%
0.67 FP 60%
´ ´ ´
Average Precision
Object Detection
2007-2012 2014- 2018
MS COCO Open Images
# of classes 80 500
# of images 0.12M 1.7M
Increase by more than
x14
Object Detection
Faster RCNN
Bounding Box & Bounding Box
&
Bounding Box (NMS/NMW)
PFDet: 2nd Place Solutions to Open Images Competition
PFDet
Architecture
Faster RCNN
+ Feature Pyramid Networks
+ SE-ResNeXt
+ Pyramid Scene Parsing
+ Context Head
+ Non-maximal Weighted
Training Method
SGD
+ Sigmoid Loss
+ Cosine Annealing
+ Co-occurrence Loss
Faster RCNN
SE-ResNeXt
Feature Pyramid Networks
Expert Models
Sigmoid Loss
Cosine Annealing
Co-occurrence Loss
SE-ResNeXt
Feature Pyramid Networks
Expert Models
SE-ResNeXt
SE-ResNeXt
ResNeXt
SENet
Feature Pyramid Networks
Feature Pyramid Networks
? ResNet (& upsampling) feature map
? Region Proposal Network
Pressure Cooker: 17 images
Person: 800k images
238 classes appear in <1000 images
Image from https://storage.googleapis.com/openimages/web/factsfigures.html
Expert Models
PFDet: 2nd Place Solutions to Open Images Competition
Expert Models
?
?
Suppression
BB & class
BB & class
BB & class
BB & class
BB & class
BB & class
Bounding Box
Model 1
Model 2
´
Concat
Suppression
Sigmoid Loss
Sigmoid Loss
?
? E.g., Bounding Box Football Ball
FC Softmax
volleyball score
football score
ball score
Cross
Entropy
football or ball
volleyball score
football score
ball score
FC
Sigmoid
Sigmoid
Sigmoid
Cross Entropy
football and ballCross Entropy
Cross Entropy
SoftmaxLossSigmoidLoss
Cosine Annealing
Co-occurrence Loss
Cosine Annealing
Co-occurrence Loss
Co-occurrence Loss
?
? Bounding Box Bounding Box
Ignore:
Face
Arm
Negative:
Car, ...
negative
Co-occurrence Loss
+22.7AP improvement on
^Human Parts ̄
on 47 part classes average +9.2AP improvement
Faster RCNN
SE-ResNeXt
Feature Pyramid Networks
Expert Models
Sigmoid Loss
Cosine Annealing
Co-occurrence Loss
Single Best Model
Ensemble
PFDet: 2nd Place Solutions to Open Images Competition
MS COCO Open Images
# of classes 80 500
# of images 0.12M 1.7M
Increase by more than
x14
HW MN-1
V100 (32GB) x512
Infiniband
¢ Best single model 33
¢ 83% (v.s. 8 GPU)
SW ChainerMN
Multi-node Batch Normalization
? BN ghost BN
C mean, var
? segmentation, detection multi-node BN
C BN mean, var
? multi-node BN
? Faster R-CNN + FPN + etc
? Backbone: SE-ResNeXt (vs. NASNet, etc´´)
? LR Schedule: Cosine (vs. step, poly, etc´´)
? Suppression: NMW (vs. NMS, Soft NMS, etc´´)
? PSP, Context head
? : Co-occurrence loss
? : Multi-node BN, Linear LR scaling, warmup
? expert model
1. tech report:
Takuya Akiba, Tommi Kerola, Yusuke Niitani, Toru Ogawa, Shotaro Sano, Shuji Suzuki.
PFDet: 2nd Place Solution to Open Images Challenge 2018 Object Detection Track.
https://arxiv.org/abs/1809.00778
2. Co-occurrence loss tech report:
Yusuke Niitani, Takuya Akiba, Tommi Kerola, Toru Ogawa, Shotaro Sano, Shuji Suzuki.
Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects.
https://arxiv.org/abs/1811.10862

More Related Content

PFDet: 2nd Place Solutions to Open Images Competition