22 May 2021 Balanced-RetinaNet: solving the imbalanced problems in object detection
Yuxin Wang, Weibin Liu, Weiwei Xing
Author Affiliations +
Abstract

Compared with the improvement of model structure in the field of object detection, the imbalanced problems in the training process have received less attention, but it is also one of the main reasons affecting its performance. We mainly analyze the imbalanced problems that occur in different stages of the network training process and propose a more balanced network called Balanced-RetinaNet, which has three improvements. First, in the feature extraction stage, a multi-scale feature balanced module is proposed to settle the problem in terms of imbalanced feature distribution, that is, the high-level feature lacks spatial information and while the low-level feature lacks semantic information; then, in the object regression stage, an interval-based regression loss is proposed to solve the problem of inaccurate localization caused by the different contributions of different samples to the regression loss; finally, in the object classification stage, an adaptive focal loss is proposed to solve the problem of classification errors caused by the loss of a large number of negative samples overwhelming the overall classification loss. Experiments have proved that by solving the imbalanced problems, the detection accuracy has been significantly improved on the MS COCO dataset.

© 2021 SPIE and IS&T 1017-9909/2021/$28.00© 2021 SPIE and IS&T
Yuxin Wang, Weibin Liu, and Weiwei Xing "Balanced-RetinaNet: solving the imbalanced problems in object detection," Journal of Electronic Imaging 30(3), 033009 (22 May 2021). https://doi.org/10.1117/1.JEI.30.3.033009
Received: 18 November 2020; Accepted: 11 May 2021; Published: 22 May 2021
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Classification systems

Convolution

Detection and tracking algorithms

Image resolution

Performance modeling

Sensors

Back to Top