Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals

Peng, Le; Luo, Gaoxiang; Walker, Andrew; Zaiman, Zachary; Jones, Emma K.; Gupta, Hemant; Kersten, Kristopher; Burns, John L.; Harle, Christopher A.; Magoc, Tanja; Shickel, Benjamin; Steenburg, Scott D.; Loftus, Tyler; Melton, Genevieve B.; Wawira Gichoya, Judy; Sun, Ju; Tignanelli, Christopher J.

Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals

dc.contributor.author	Peng, Le
dc.contributor.author	Luo, Gaoxiang
dc.contributor.author	Walker, Andrew
dc.contributor.author	Zaiman, Zachary
dc.contributor.author	Jones, Emma K.
dc.contributor.author	Gupta, Hemant
dc.contributor.author	Kersten, Kristopher
dc.contributor.author	Burns, John L.
dc.contributor.author	Harle, Christopher A.
dc.contributor.author	Magoc, Tanja
dc.contributor.author	Shickel, Benjamin
dc.contributor.author	Steenburg, Scott D.
dc.contributor.author	Loftus, Tyler
dc.contributor.author	Melton, Genevieve B.
dc.contributor.author	Wawira Gichoya, Judy
dc.contributor.author	Sun, Ju
dc.contributor.author	Tignanelli, Christopher J.
dc.contributor.department	Radiology and Imaging Sciences, School of Medicine
dc.date.accessioned	2023-09-27T15:56:36Z
dc.date.available	2023-09-27T15:56:36Z
dc.date.issued	2022
dc.description.abstract	Objective: Federated learning (FL) allows multiple distributed data holders to collaboratively learn a shared model without data sharing. However, individual health system data are heterogeneous. "Personalized" FL variations have been developed to counter data heterogeneity, but few have been evaluated using real-world healthcare data. The purpose of this study is to investigate the performance of a single-site versus a 3-client federated model using a previously described Coronavirus Disease 19 (COVID-19) diagnostic model. Additionally, to investigate the effect of system heterogeneity, we evaluate the performance of 4 FL variations. Materials and methods: We leverage a FL healthcare collaborative including data from 5 international healthcare systems (US and Europe) encompassing 42 hospitals. We implemented a COVID-19 computer vision diagnosis system using the Federated Averaging (FedAvg) algorithm implemented on Clara Train SDK 4.0. To study the effect of data heterogeneity, training data was pooled from 3 systems locally and federation was simulated. We compared a centralized/pooled model, versus FedAvg, and 3 personalized FL variations (FedProx, FedBN, and FedAMP). Results: We observed comparable model performance with respect to internal validation (local model: AUROC 0.94 vs FedAvg: 0.95, P = .5) and improved model generalizability with the FedAvg model (P < .05). When investigating the effects of model heterogeneity, we observed poor performance with FedAvg on internal validation as compared to personalized FL algorithms. FedAvg did have improved generalizability compared to personalized FL algorithms. On average, FedBN had the best rank performance on internal and external validation. Conclusion: FedAvg can significantly improve the generalization of the model compared to other personalization FL algorithms; however, at the cost of poor internal validity. Personalized FL may offer an opportunity to develop both internal and externally validated algorithms.
dc.eprint.version	Final published version
dc.identifier.citation	Peng L, Luo G, Walker A, et al. Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals. J Am Med Inform Assoc. 2022;30(1):54-63. doi:10.1093/jamia/ocac188
dc.identifier.uri	https://hdl.handle.net/1805/35836
dc.language.iso	en_US
dc.publisher	Oxford University Press
dc.relation.isversionof	10.1093/jamia/ocac188
dc.relation.journal	Journal of the American Medical Informatics Association
dc.rights	Publisher Policy
dc.source	PMC
dc.subject	COVID-19
dc.subject	Artificial intelligence
dc.subject	Computer vision
dc.subject	Federated learning
dc.title	Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals
dc.type	Article
ul.alternative.fulltext	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9619688/

Files

Original bundle

Now showing 1 - 1 of 1

Name:: ocac188.pdf
Size:: 828.58 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.99 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Open Access Policy Articles
Department of Radiology and Imaging Sciences Works