Integrative analysis of multi-omics and imaging data with incorporation of biological information via structural Bayesian factor analysis

dc.contributor.authorBao, Jingxuan
dc.contributor.authorChang, Changgee
dc.contributor.authorZhang, Qiyiwen
dc.contributor.authorSaykin, Andrew J.
dc.contributor.authorShen, Li
dc.contributor.authorLong, Qi
dc.contributor.authorAlzheimer’s Disease Neuroimaging Initiative
dc.contributor.departmentRadiology and Imaging Sciences, School of Medicine
dc.date.accessioned2024-02-21T20:20:33Z
dc.date.available2024-02-21T20:20:33Z
dc.date.issued2023
dc.description.abstractMotivation: With the rapid development of modern technologies, massive data are available for the systematic study of Alzheimer's disease (AD). Though many existing AD studies mainly focus on single-modality omics data, multi-omics datasets can provide a more comprehensive understanding of AD. To bridge this gap, we proposed a novel structural Bayesian factor analysis framework (SBFA) to extract the information shared by multi-omics data through the aggregation of genotyping data, gene expression data, neuroimaging phenotypes and prior biological network knowledge. Our approach can extract common information shared by different modalities and encourage biologically related features to be selected, guiding future AD research in a biologically meaningful way. Method: Our SBFA model decomposes the mean parameters of the data into a sparse factor loading matrix and a factor matrix, where the factor matrix represents the common information extracted from multi-omics and imaging data. Our framework is designed to incorporate prior biological network information. Our simulation study demonstrated that our proposed SBFA framework could achieve the best performance compared with the other state-of-the-art factor-analysis-based integrative analysis methods. Results: We apply our proposed SBFA model together with several state-of-the-art factor analysis models to extract the latent common information from genotyping, gene expression and brain imaging data simultaneously from the ADNI biobank database. The latent information is then used to predict the functional activities questionnaire score, an important measurement for diagnosis of AD quantifying subjects' abilities in daily life. Our SBFA model shows the best prediction performance compared with the other factor analysis models. Availability: Code are publicly available at https://github.com/JingxuanBao/SBFA.
dc.eprint.versionFinal published version
dc.identifier.citationBao J, Chang C, Zhang Q, et al. Integrative analysis of multi-omics and imaging data with incorporation of biological information via structural Bayesian factor analysis. Brief Bioinform. 2023;24(2):bbad073. doi:10.1093/bib/bbad073
dc.identifier.urihttps://hdl.handle.net/1805/38583
dc.language.isoen_US
dc.publisherOxford University Press
dc.relation.isversionof10.1093/bib/bbad073
dc.relation.journalBriefings in Bioinformatics
dc.rightsPublisher Policy
dc.sourcePMC
dc.subjectStructural Bayesian factor analysis
dc.subjectMulti-omics
dc.subjectAlzheimer’s disease
dc.subjectBiological network
dc.titleIntegrative analysis of multi-omics and imaging data with incorporation of biological information via structural Bayesian factor analysis
dc.typeArticle
ul.alternative.fulltexthttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10387302/
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
bbad073.pdf
Size:
1.9 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: