Factors determining generalization in deep learning models for scoring COVID-CT images

Horry, Michael James; Chakraborty, Subrata; Pradhan, Biswajeet; Fallahpoor, Maryam; Chegeni, Hossein; Paul, Manoranjan

doi:10.3934/mbe.2021456

Title

Factors determining generalization in deep learning models for scoring COVID-CT images

Publication Date

2021-10-27

Author(s)

Horry, Michael James

Chakraborty, Subrata

( author )
OrcID: https://orcid.org/0000-0002-0102-5424
Email: schakra3@une.edu.au
UNE Id une-id:schakra3

Pradhan, Biswajeet

Fallahpoor, Maryam

Chegeni, Hossein

Paul, Manoranjan

Type of document

Journal Article

Language

en

Entity Type

Publication

Publisher

AIMS Press

Place of publication

United States of America

DOI

10.3934/mbe.2021456

UNE publication id

une:1959.11/43232

Abstract

The COVID-19 pandemic has inspired unprecedented data collection and computer vision modelling efforts worldwide, focused on the diagnosis of COVID-19 from medical images. However, these models have found limited, if any, clinical application due in part to unproven generalization to data sets beyond their source training corpus. This study investigates the generalizability of deep learning models using publicly available COVID-19 Computed Tomography data through cross dataset validation. The predictive ability of these models for COVID-19 severity is assessed using an independent dataset that is stratified for COVID-19 lung involvement . Each inter-dataset study is performed using histogram equalization, and contrast limited adaptive histogram equalization with and without a learning Gabor filter. We show that under certain conditions, deep learning models can generalize well to an external dataset with F1 scores up to 86%. The best performing model shows predictive accuracy of between 75% and 96% for lung involvement scoring against an external expertly stratified dataset. From these results we identify key factors promoting deep learning generalization, being primarily the uniform acquisition of training images, and secondly diversity in CT slice position.

Link

link

Citation

Mathematical Biosciences and Engineering, 18(6), p. 9264-9293

ISSN

1551-0018

1547-1063

Start page

9264

End page

9293

Rights

Attribution 4.0 International

Factors determining generalization in deep learning models for scoring COVID-CT images

Files: