In order to solve the problem of multi-scale in a single image gathering crowd counting, a new crowd counting network based on the fusion of dilating convolution pyramid and context attention mechanism (DCPCANet) is proposed. With the first ten convolutional layers of VGG16 as the front-end network, an dilated convolutional pyramid fusion attention mechanism module (AMP) is proposed, which is introduced into the three-level upsampling feature fusion module to extract fused multi-scale features, and the AMP module stack is used as the back-end network to capture and fuse multiscale features, The context attention module (CAM) is used to generate the feature map with weight, and high-quality crowd density map is output at the same time. Three mainstream public data sets are adopted, ShanghaiTech PartA,ShanghaiTech PartB,UCF_CC_50. Compared with the previous algorithm, the MAE of the UCF_CC_50 dataset is reduced by 11%, which preliminarily verifies the accuracy and robustness of the model.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.