At present, image processing has widely utilized in tremendous downstream applications including robotics, detection sensors and industrial products. Existing processing methods have achieved considerable accuracy towards complex images. However, existing classification concentrates on the convolution operation and ignore extraordinary information in feature space. In this work, we propose a novel classification based on the extracted features of images and utilized most advanced transformer model to identify the information from diverse images. Initially, the model extracts the features from the feature space and identify the feature information. Subsequently, the transformer model will train the classification mechanism through recognizing the feature information with the target labels. Finally, the trained model can recognize the different objectives in real-world images. From our extensive simulation and comparison analysis, we can conclude that proposed model can achieve the target classification with acceptable accuracy and reasonable training costs.
|