Comparison of Supervised Machine Learning and Probabilistic Approaches for Record Linkage
dc.contributor.author | McNutt, Andrew T. | |
dc.contributor.author | Grannis, Shaun J. | |
dc.contributor.author | Bo, Na | |
dc.contributor.author | Xu, Huiping | |
dc.contributor.author | Kasthurirathne, Suranga N. | |
dc.date.accessioned | 2020-07-20T19:43:22Z | |
dc.date.available | 2020-07-20T19:43:22Z | |
dc.date.issued | 2020-03-25 | |
dc.description.abstract | Record linkage is vital to prevent fragmentation of patient data. Machine learning approaches present considerable potential for record linkage. We compared the performance of three machine learning algorithms to an established probabilistic record linkage technique. Machine learning approaches exhibited results that were comparable, or statistically superior to the established probabilistic approach. It is unclear if the cost of manually reviewing datasets for supervised learning is justified by the performance improvements they yield. | en_US |
dc.identifier.citation | McNutt, A.T., Grannis, S.J., Bo, N., Xu, H., Kasthurirathne, S. N.(2020, March). Comparison of Supervised Machine Learning and Probabilistic Approaches for Record Linkage. AMIA Informatics summit 2020 Conference Proceedings. | en_US |
dc.identifier.uri | https://hdl.handle.net/1805/23283 | |
dc.language.iso | en_US | en_US |
dc.publisher | AMIA Informatics summit 2019 Conference Proceedings. | en_US |
dc.subject | record linkage | en_US |
dc.subject | patient matching | en_US |
dc.subject | Neural network | en_US |
dc.subject | Decision trees | en_US |
dc.title | Comparison of Supervised Machine Learning and Probabilistic Approaches for Record Linkage | en_US |
dc.type | Presentation | en_US |