Rapid discrimination between deleterious and benign missense mutations in the CAGI 6 experiment

dc.contributor.authorFaraggi, Eshel
dc.contributor.authorJernigan, Robert L.
dc.contributor.authorKloczkowski, Andrzej
dc.contributor.departmentPhysics, School of Science
dc.date.accessioned2024-10-11T08:50:14Z
dc.date.available2024-10-11T08:50:14Z
dc.date.issued2024-08-27
dc.description.abstractWe describe the machine learning tool that we applied in the CAGI 6 experiment to predict whether single residue mutations in proteins are deleterious or benign. This tool was trained using only single sequences, i.e., without multiple sequence alignments or structural information. Instead, we used global characterizations of the protein sequence. Training and testing data for human gene mutations was obtained from ClinVar (ncbi.nlm.nih.gov/pub/ClinVar/), and for non-human gene mutations from Uniprot (www.uniprot.org). Testing was done on post-training data from ClinVar. This testing yielded high AUC and Matthews correlation coefficient (MCC) for well trained examples but low generalizability. For genes with either sparse or unbalanced training data, the prediction accuracy is poor. The resulting prediction server is available online at http://www.mamiris.com/Shoni.cagi6.
dc.eprint.versionFinal published version
dc.identifier.citationFaraggi E, Jernigan RL, Kloczkowski A. Rapid discrimination between deleterious and benign missense mutations in the CAGI 6 experiment. Hum Genomics. 2024;18(1):89. Published 2024 Aug 27. doi:10.1186/s40246-024-00655-z
dc.identifier.urihttps://hdl.handle.net/1805/43890
dc.language.isoen_US
dc.publisherSpringer Nature
dc.relation.isversionof10.1186/s40246-024-00655-z
dc.relation.journalHuman Genomics
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.sourcePMC
dc.subjectComputational biology
dc.subjectMachine learning
dc.subjectMissense mutation
dc.subjectProteins
dc.subjectSoftware
dc.titleRapid discrimination between deleterious and benign missense mutations in the CAGI 6 experiment
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Faraggi2024Rapid-CCBYNCND.pdf
Size:
808.21 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.04 KB
Format:
Item-specific license agreed upon to submission
Description: