Presentation + Paper
9 August 2023 Scale-invariant semantic segmentation of natural RGB-D images combining decision tree and deep learning models
L. Romeo, R. P. Devanna, R. Marani, G. Matranga, M. Biddoccu, A. Milella
Author Affiliations +
Abstract
In-field sensing systems for automatic yield monitoring are gaining increasing importance as they promise to give a considerable boost in production. The development of artificial intelligence and sensing technologies to assist the human workforce also meets sustainability needs, which impact the ecological goals of current and future agricultural processes. In this context, image acquisition and processing systems are widely adopted to extract useful information for farmers. Although RGB-D cameras have been used in many applications for ground-based proximal sensing, relatively few works can be found that include depth information in image analysis. In this work, both semantic and depth information from RGB-D vineyard images is used in processing pipeline composed of a decision tree algorithm and a deep learning model. The goal is to reach coherent semantic segmentation of a set of natural images acquired at both long and short distances, using a low-cost RGB-D camera in an experimental vineyard. Depth information of each image is fed into a decision tree to predict the distance of the acquired vines from the camera. Before feeding the deep learning models, the images to be segmented are manipulated according to the predicted distance. The results of semantic segmentation with and without using the decision tree are compared, showing how depth information appears to be highly relevant in enhancing the accuracy and precision of the predicted semantic maps.
Conference Presentation
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
L. Romeo, R. P. Devanna, R. Marani, G. Matranga, M. Biddoccu, and A. Milella "Scale-invariant semantic segmentation of natural RGB-D images combining decision tree and deep learning models", Proc. SPIE 12621, Multimodal Sensing and Artificial Intelligence: Technologies and Applications III, 1262111 (9 August 2023); https://doi.org/10.1117/12.2677371
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Semantics

Deep learning

RGB color model

Cameras

Decision trees

Education and training

Back to Top