Evaluation of Machine Learning Models for Proteoform Retention and Migration Time Prediction in Top-Down Mass Spectrometry
dc.contributor.author | Chen, Wenrong | |
dc.contributor.author | McCool, Elijah N. | |
dc.contributor.author | Sun, Liangliang | |
dc.contributor.author | Zang, Yong | |
dc.contributor.author | Ning, Xia | |
dc.contributor.author | Liu, Xiaowen | |
dc.contributor.department | BioHealth Informatics, School of Informatics and Computing | en_US |
dc.date.accessioned | 2023-07-17T17:04:45Z | |
dc.date.available | 2023-07-17T17:04:45Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Reversed-phase liquid chromatography (RPLC) and capillary zone electrophoresis (CZE) are two primary proteoform separation methods in mass spectrometry (MS)-based top-down proteomics. Proteoform retention time (RT) prediction in RPLC and migration time (MT) prediction in CZE provide additional information for accurate proteoform identification and quantification. While existing methods are mainly focused on peptide RT and MT prediction in bottom-up MS, there is still a lack of methods for proteoform RT and MT prediction in top-down MS. We systematically evaluated eight machine learning models and a transfer learning method for proteoform RT prediction and five models and the transfer learning method for proteoform MT prediction. Experimental results showed that a gated recurrent unit (GRU)-based model with transfer learning achieved a high accuracy (R = 0.978) for proteoform RT prediction and that the GRU-based model and a fully connected neural network model obtained a high accuracy of R = 0.982 and 0.981 for proteoform MT prediction, respectively. | en_US |
dc.eprint.version | Final published version | en_US |
dc.identifier.citation | Chen W, McCool EN, Sun L, Zang Y, Ning X, Liu X. Evaluation of Machine Learning Models for Proteoform Retention and Migration Time Prediction in Top-Down Mass Spectrometry. J Proteome Res. 2022;21(7):1736-1747. doi:10.1021/acs.jproteome.2c00124 | en_US |
dc.identifier.uri | https://hdl.handle.net/1805/34430 | |
dc.language.iso | en_US | en_US |
dc.publisher | American Chemical Society | en_US |
dc.relation.isversionof | 10.1021/acs.jproteome.2c00124 | en_US |
dc.relation.journal | Journal of Proteome Research | en_US |
dc.rights | Attribution 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.source | PMC | en_US |
dc.subject | Top-down mass spectrometry | en_US |
dc.subject | Retention | en_US |
dc.subject | Migration time prediction | en_US |
dc.subject | Machine learning | en_US |
dc.title | Evaluation of Machine Learning Models for Proteoform Retention and Migration Time Prediction in Top-Down Mass Spectrometry | en_US |
dc.type | Article | en_US |