Development of Debiasing Technique for Lung Nodule Chest X-ray Datasets to Generalize Deep Learning Models

Title
Development of Debiasing Technique for Lung Nodule Chest X-ray Datasets to Generalize Deep Learning Models
Publication Date
2023-07-21
Author(s)
Horry, Michael J
Chakraborty, Subrata
( author )
OrcID: https://orcid.org/0000-0002-0102-5424
Email: schakra3@une.edu.au
UNE Id une-id:schakra3
Pradhan, Biswajeet
Paul, Manoranjan
Zhu, Jing
Loh, Hui Wen
Barua, Prabal Datta
Acharya, U Rajendra
Type of document
Journal Article
Language
en
Entity Type
Publication
Publisher
MDPI AG
Place of publication
Switzerland
DOI
10.3390/s23146585
UNE publication id
une:1959.11/59306
Abstract

Screening programs for early lung cancer diagnosis are uncommon, primarily due to the challenge of reaching at-risk patients located in rural areas far from medical facilities. To overcome this obstacle, a comprehensive approach is needed that combines mobility, low cost, speed, accuracy, and privacy. One potential solution lies in combining the chest X-ray imaging mode with federated deep learning, ensuring that no single data source can bias the model adversely. This study presents a pre-processing pipeline designed to debias chest X-ray images, thereby enhancing internal classification and external generalization. The pipeline employs a pruning mechanism to train a deep learning model for nodule detection, utilizing the most informative images from a publicly available lung nodule X-ray dataset. Histogram equalization is used to remove systematic differences in image brightness and contrast. Model training is then performed using combinations of lung field segmentation, close cropping, and rib/bone suppression. The resulting deep learning models, generated through this pre-processing pipeline, demonstrate successful generalization on an independent lung nodule dataset. By eliminating confounding variables in chest X-ray images and suppressing signal noise from the bone structures, the proposed deep learning lung nodule detection algorithm achieves an external generalization accuracy of 89%. This approach paves the way for the development of a low-cost and accessible deep learning-based clinical system for lung cancer screening.

Link
Citation
Sensors, 23(14), p. 1-21
ISSN
1424-8220
1424-8239
Start page
1
End page
21
Rights
Attribution 4.0 International

Files:

NameSizeformatDescriptionLink
openpublished/DevelopmentChakraborty2023JournalArticle.pdf 7505.363 KB application/pdf Published Version View document