Evaluation of Machine Learning Models for Proteoform Retention and Migration Time Prediction in Top-Down Mass Spectrometry

dc.contributor.authorChen, Wenrong
dc.contributor.authorMcCool, Elijah N.
dc.contributor.authorSun, Liangliang
dc.contributor.authorZang, Yong
dc.contributor.authorNing, Xia
dc.contributor.authorLiu, Xiaowen
dc.contributor.departmentBioHealth Informatics, School of Informatics and Computingen_US
dc.date.accessioned2023-07-17T17:04:45Z
dc.date.available2023-07-17T17:04:45Z
dc.date.issued2022
dc.description.abstractReversed-phase liquid chromatography (RPLC) and capillary zone electrophoresis (CZE) are two primary proteoform separation methods in mass spectrometry (MS)-based top-down proteomics. Proteoform retention time (RT) prediction in RPLC and migration time (MT) prediction in CZE provide additional information for accurate proteoform identification and quantification. While existing methods are mainly focused on peptide RT and MT prediction in bottom-up MS, there is still a lack of methods for proteoform RT and MT prediction in top-down MS. We systematically evaluated eight machine learning models and a transfer learning method for proteoform RT prediction and five models and the transfer learning method for proteoform MT prediction. Experimental results showed that a gated recurrent unit (GRU)-based model with transfer learning achieved a high accuracy (R = 0.978) for proteoform RT prediction and that the GRU-based model and a fully connected neural network model obtained a high accuracy of R = 0.982 and 0.981 for proteoform MT prediction, respectively.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationChen W, McCool EN, Sun L, Zang Y, Ning X, Liu X. Evaluation of Machine Learning Models for Proteoform Retention and Migration Time Prediction in Top-Down Mass Spectrometry. J Proteome Res. 2022;21(7):1736-1747. doi:10.1021/acs.jproteome.2c00124en_US
dc.identifier.urihttps://hdl.handle.net/1805/34430
dc.language.isoen_USen_US
dc.publisherAmerican Chemical Societyen_US
dc.relation.isversionof10.1021/acs.jproteome.2c00124en_US
dc.relation.journalJournal of Proteome Researchen_US
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.sourcePMCen_US
dc.subjectTop-down mass spectrometryen_US
dc.subjectRetentionen_US
dc.subjectMigration time predictionen_US
dc.subjectMachine learningen_US
dc.titleEvaluation of Machine Learning Models for Proteoform Retention and Migration Time Prediction in Top-Down Mass Spectrometryen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
pr2c00124.pdf
Size:
1.97 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: