Variational autoencoder-based model improves polygenic prediction in blood cell traits

Date
2025-08-08
Language
American English
Embargo Lift Date
Committee Members
Degree
Degree Year
Department
Grantor
Journal Title
Journal ISSN
Volume Title
Found At
Elsevier
Can't use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.
Abstract

Genetic prediction of complex traits, enabled by large-scale genomic studies, has created new measures to understand individual genetic predisposition. Polygenic risk scores (PRSs) offer a way to aggregate information across the genome, enabling personalized risk prediction for complex traits and diseases. However, conventional PRS calculation methods that rely on linear models are limited in their ability to capture complex patterns and interaction effects in high-dimensional genomic data. In this study, we seek to improve the predictive power of PRS through applying advanced deep learning techniques. We show that the variational autoencoder-based model for PRS construction (VAE-PRS) outperforms currently state-of-the-art methods for biobank-level data in 14 out of 16 blood cell traits, while being computationally efficient. Through comprehensive experiments, we found that the VAE-PRS model offers the ability to capture interaction effects in high-dimensional data and shows robust performance across different pre-screened variant sets. Furthermore, VAE-PRS is easily interpretable via assessing the contribution of each individual marker to the final prediction score through the Shapley additive explanations method, providing potential new insights in identifying trait-associated genetic variants. In summary, VAE-PRS presents a measure to genetic risk prediction for blood cell traits by harnessing the power of deep learning methods given appropriate training sample size, which could further facilitate the development of personalized medicine and genetic research.

Description
item.page.description.tableofcontents
item.page.relation.haspart
Cite As
Li X, Kharitonova E, Pang M, et al. Variational autoencoder-based model improves polygenic prediction in blood cell traits. HGG Adv. Published online August 8, 2025. doi:10.1016/j.xhgg.2025.100490
ISSN
Publisher
Series/Report
Sponsorship
Major
Extent
Identifier
Relation
Journal
HGG Advances
Source
PMC
Alternative Title
Type
Article
Number
Volume
Conference Dates
Conference Host
Conference Location
Conference Name
Conference Panel
Conference Secretariat Location
Version
Final published version
Full Text Available at
This item is under embargo {{howLong}}