Evaluating Methods for Identifying Cancer in Free-Text Pathology Reports Using Various Machine Learning and Data Preprocessing Approaches
dc.contributor.author | Kasthurirathne, Suranga Nath | |
dc.contributor.author | Dixon, Brian E. | |
dc.contributor.author | Grannis, Shaun J. | |
dc.contributor.department | Department of BioHealth Informatics, School of Informatics and Computing | en_US |
dc.date.accessioned | 2016-07-20T17:10:08Z | |
dc.date.available | 2016-07-20T17:10:08Z | |
dc.date.issued | 2015 | |
dc.description.abstract | Automated detection methods can address delays and incompleteness in cancer case reporting. Existing automated efforts are largely dependent on complex dictionaries and coded data. Using a gold standard of manually reviewed pathology reports, we evaluated the performance of alternative input formats and decision models on a convenience sample of free-text pathology reports. Results showed that the input format significantly impacted performance, and specific algorithms yielded better results for presicion, recall and accuracy. We conclude that our approach is sufficiently accurate for practical purposes and represents a generalized process. | en_US |
dc.eprint.version | Final published version | en_US |
dc.identifier.citation | Kasthurirathne, S. N., Dixon, B. E., & Grannis, S. J. (2015). Evaluating Methods for Identifying Cancer in Free-Text Pathology Reports Using Various Machine Learning and Data Preprocessing Approaches. Studies in health technology and informatics, 216, 1070-1070. | en_US |
dc.identifier.uri | https://hdl.handle.net/1805/10428 | |
dc.language.iso | en | en_US |
dc.publisher | IOS | en_US |
dc.relation.isversionof | 10.3233/978-1-61499-564-7-1070 | en_US |
dc.relation.journal | Studies in health technology and informatics | en_US |
dc.rights | Attribution-NonCommercial 3.0 United States | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/3.0/us/ | |
dc.source | Publisher | en_US |
dc.subject | public health reporting | en_US |
dc.subject | decision models | en_US |
dc.subject | ontologies | en_US |
dc.title | Evaluating Methods for Identifying Cancer in Free-Text Pathology Reports Using Various Machine Learning and Data Preprocessing Approaches | en_US |
dc.type | Article | en_US |