Pre-rotation only at inference-time: a way to rotation invariance

Peng Zhang; Jinsong Tang; Heping Zhong; Mingqiang Ning; Yue Fan

doi:10.1117/12.2644390

12 October 2022 Pre-rotation only at inference-time: a way to rotation invariance

Peng Zhang, Jinsong Tang, Heping Zhong, Mingqiang Ning, Yue Fan

Proceedings Volume 12342, Fourteenth International Conference on Digital Image Processing (ICDIP 2022); 123420O (2022) https://doi.org/10.1117/12.2644390
Event: Fourteenth International Conference on Digital Image Processing (ICDIP 2022), 2022, Wuhan, China

Abstract

Weight sharing across different locations makes Convolutional Neural Networks (CNNs) space shift invariant, i.e., the weights learned in one location can be applied to recognize objects in other locations. However, weight sharing mechanism has been lacked in Rotated Pattern Recognition (RPR) tasks, and CNNs have to learn training samples in different orientations by rote. As such rote-learning strategy has greatly increased the difficulty of training, a new solution for RPR tasks, Pre-Rotation Only At Inference time (PROAI), is proposed to provide CNNs with rotation invariance. The core idea of PROAI is to share CNN weights across multiple rotated versions of the test sample. At the training time, a CNN was trained with samples only in one angle; at the inference-time, test samples were pre-rotated at different angles and then fed into the CNN to calculate classification confidences; at the end both the category and the orientation were predicted using the position of the max value of these confidences. By adopting PROAI, the recognition ability learned at one orientation can be generalized to patterns at any other orientation, and both the number of parameters and the training time of CNN in RPR tasks can be greatly reduced. Experiments show that PROAI enables CNNs with less parameters and training time to achieve state-of-the-art classification and orientation performance on both rotated MNIST and rotated Fashion MNIST datasets.

Citation Download Citation

Peng Zhang, Jinsong Tang, Heping Zhong, Mingqiang Ning, and Yue Fan "Pre-rotation only at inference-time: a way to rotation invariance", Proc. SPIE 12342, Fourteenth International Conference on Digital Image Processing (ICDIP 2022), 123420O (12 October 2022); https://doi.org/10.1117/12.2644390

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available