WEVar: a novel statistical learning framework for predicting noncoding regulatory variants

dc.contributor.authorWang, Ye
dc.contributor.authorJiang, Yuchao
dc.contributor.authorYao, Bing
dc.contributor.authorHuang, Kun
dc.contributor.authorLiu, Yunlong
dc.contributor.authorWang, Yue
dc.contributor.authorQin, Xiao
dc.contributor.authorSaykin, Andrew J.
dc.contributor.authorChen, Li
dc.contributor.departmentBiostatistics and Health Data Science, School of Medicineen_US
dc.date.accessioned2023-06-15T17:22:58Z
dc.date.available2023-06-15T17:22:58Z
dc.date.issued2021
dc.description.abstractUnderstanding the functional consequence of noncoding variants is of great interest. Though genome-wide association studies or quantitative trait locus analyses have identified variants associated with traits or molecular phenotypes, most of them are located in the noncoding regions, making the identification of causal variants a particular challenge. Existing computational approaches developed for prioritizing noncoding variants produce inconsistent and even conflicting results. To address these challenges, we propose a novel statistical learning framework, which directly integrates the precomputed functional scores from representative scoring methods. It will maximize the usage of integrated methods by automatically learning the relative contribution of each method and produce an ensemble score as the final prediction. The framework consists of two modes. The first 'context-free' mode is trained using curated causal regulatory variants from a wide range of context and is applicable to predict regulatory variants of unknown and diverse context. The second 'context-dependent' mode further improves the prediction when the training and testing variants are from the same context. By evaluating the framework via both simulation and empirical studies, we demonstrate that it outperforms integrated scoring methods and the ensemble score successfully prioritizes experimentally validated regulatory variants in multiple risk loci.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationWang Y, Jiang Y, Yao B, et al. WEVar: a novel statistical learning framework for predicting noncoding regulatory variants. Brief Bioinform. 2021;22(6):bbab189. doi:10.1093/bib/bbab189en_US
dc.identifier.urihttps://hdl.handle.net/1805/33795
dc.language.isoen_USen_US
dc.publisherOxford University Pressen_US
dc.relation.isversionof10.1093/bib/bbab189en_US
dc.relation.journalBriefings in Bioinformaticsen_US
dc.rightsPublisher Policyen_US
dc.sourcePMCen_US
dc.subjectNoncoding variantsen_US
dc.subjectPrioritizationen_US
dc.subjectFunctional scoreen_US
dc.titleWEVar: a novel statistical learning framework for predicting noncoding regulatory variantsen_US
dc.typeArticleen_US
ul.alternative.fulltexthttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574971/en_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
bbab189.pdf
Size:
2.23 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: