Lung cancer has both high incidence and mortality rates compared to other cancer types. One important factor for improved patient survival is early detection. Deep learning for lung nodule detection has been extensively studied, as a tool to facilitate clinicians with early nodule detection and classification. Many publications are reporting high detection accuracy and several models have been introduced to clinical practice. However, certain models may have reduced performance in real-world clinical practice. In this study, we introduce a method to assess the robustness of lung nodule detection models. Medically relevant image perturbations are used to assess the robustness of these models. The perturbations include noise and motion perturbations, which have been created in consultation with an expert radiologist to ensure the clinical relevance of the artifacts for thoracic computed tomography (CT) scans. The evaluated models demonstrate robustness to clinically relevant noise simulations, but it shows less resilience to motion artifacts in perturbed CT scans. This robustness evaluation method, incorporating simulated relevant artifacts, can be extended for use in other applications involving the analysis of CT scans.
|