Investigating DNA-, RNA-, and protein-based features as a means to discriminate pathogenic synonymous variants

dc.contributor.authorLivingstone, Mark
dc.contributor.authorFolkman, Lukas
dc.contributor.authorYang, Yuedong
dc.contributor.authorZhang, Ping
dc.contributor.authorMort, Matthew
dc.contributor.authorCooper, David N.
dc.contributor.authorLiu, Yunlong
dc.contributor.authorStantic, Bela
dc.contributor.authorZhou, Yaoqi
dc.contributor.departmentDepartment of Medical & Molecular Genetics, IU School of Medicineen_US
dc.date.accessioned2017-08-11T17:58:44Z
dc.date.available2017-08-11T17:58:44Z
dc.date.issued2017
dc.description.abstractSynonymous single-nucleotide variants (SNVs), although they do not alter the encoded protein sequences, have been implicated in many genetic diseases. Experimental studies indicate that synonymous SNVs can lead to changes in the secondary and tertiary structures of DNA and RNA, thereby affecting translational efficiency, cotranslational protein folding as well as the binding of DNA-/RNA-binding proteins. However, the importance of these various features in disease phenotypes is not clearly understood. Here, we have built a support vector machine (SVM) model (termed DDIG-SN) as a means to discriminate disease-causing synonymous variants. The model was trained and evaluated on nearly 900 disease-causing variants. The method achieves robust performance with the area under the receiver operating characteristic curve of 0.84 and 0.85 for protein-stratified 10-fold cross-validation and independent testing, respectively. We were able to show that the disease-causing effects in the immediate proximity to exon–intron junctions (1–3 bp) are driven by the loss of splicing motif strength, whereas the gain of splicing motif strength is the primary cause in regions further away from the splice site (4–69 bp). The method is available as a part of the DDIG server at http://sparks-lab.org/ddig.en_US
dc.eprint.versionAuthor's manuscripten_US
dc.identifier.citationLivingstone, M., Folkman, L., Yang, Y., Zhang, P., Mort, M., Cooper, D. N., Liu, Y., Stantic, B. and Zhou, Y. (), Investigating DNA, RNA and protein-based features as a means to discriminate pathogenic synonymous variants. Human Mutation. Accepted Author Manuscript. doi:10.1002/humu.23283en_US
dc.identifier.urihttps://hdl.handle.net/1805/13801
dc.language.isoenen_US
dc.publisherWileyen_US
dc.relation.isversionof10.1002/humu.23283en_US
dc.relation.journalHuman Mutationen_US
dc.rightsPublisher Policyen_US
dc.sourceAuthoren_US
dc.subjectsynonymous single-nucleotide variantsen_US
dc.subjectsame-sense varianten_US
dc.subjectsilent mutationen_US
dc.titleInvestigating DNA-, RNA-, and protein-based features as a means to discriminate pathogenic synonymous variantsen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Livingstone_2017_investigating.pdf
Size:
1.11 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: