Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program

dc.contributor.authorVarga, Tibor V.
dc.contributor.authorLiu, Jinxi
dc.contributor.authorGoldberg, Ronald B.
dc.contributor.authorChen, Guannan
dc.contributor.authorDagogo-Jack, Samuel
dc.contributor.authorLorenzo, Carlos
dc.contributor.authorMather, Kieren J.
dc.contributor.authorPi-Sunyer, Xavier
dc.contributor.authorBrunak, Søren
dc.contributor.authorTemprosa, Marinella
dc.contributor.departmentMedicine, School of Medicineen_US
dc.date.accessioned2022-08-18T11:10:38Z
dc.date.available2022-08-18T11:10:38Z
dc.date.issued2021-03
dc.description.abstractIntroduction: Although various lipid and non-lipid analytes measured by nuclear magnetic resonance (NMR) spectroscopy have been associated with type 2 diabetes, a structured comparison of the ability of NMR-derived biomarkers and standard lipids to predict individual diabetes risk has not been undertaken in larger studies nor among individuals at high risk of diabetes. Research design and methods: Cumulative discriminative utilities of various groups of biomarkers including NMR lipoproteins, related non-lipid biomarkers, standard lipids, and demographic and glycemic traits were compared for short-term (3.2 years) and long-term (15 years) diabetes development in the Diabetes Prevention Program, a multiethnic, placebo-controlled, randomized controlled trial of individuals with pre-diabetes in the USA (N=2590). Logistic regression, Cox proportional hazards model and six different hyperparameter-tuned machine learning algorithms were compared. The Matthews Correlation Coefficient (MCC) was used as the primary measure of discriminative utility. Results: Models with baseline NMR analytes and their changes did not improve the discriminative utility of simpler models including standard lipids or demographic and glycemic traits. Across all algorithms, models with baseline 2-hour glucose performed the best (max MCC=0.36). Sophisticated machine learning algorithms performed similarly to logistic regression in this study. Conclusions: NMR lipoproteins and related non-lipid biomarkers were associated but did not augment discrimination of diabetes risk beyond traditional diabetes risk factors except for 2-hour glucose. Machine learning algorithms provided no meaningful improvement for discrimination compared with logistic regression, which suggests a lack of influential latent interactions among the analytes assessed in this study.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationVarga TV, Liu J, Goldberg RB, et al. Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program. BMJ Open Diabetes Res Care. 2021;9(1):e001953. doi:10.1136/bmjdrc-2020-001953en_US
dc.identifier.urihttps://hdl.handle.net/1805/29816
dc.language.isoen_USen_US
dc.publisherBMJen_US
dc.relation.isversionof10.1136/bmjdrc-2020-001953en_US
dc.relation.journalBMJ Open Diabetes Research & Careen_US
dc.rightsAttribution-NonCommercial 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/*
dc.sourcePMCen_US
dc.subjectLipoproteinsen_US
dc.subjectPrediabetic stateen_US
dc.subjectDiabetes Mellitus, Type 2en_US
dc.subjectLipidsen_US
dc.titlePredictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Programen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
bmjdrc-2020-001953.pdf
Size:
1.14 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: