Natural language processing accurately categorizes findings from colonoscopy and pathology reports

dc.contributor.authorImler, Timothy D.
dc.contributor.authorMorea, Justin
dc.contributor.authorKahi, Charles
dc.contributor.authorImperiale, Thomas F.
dc.contributor.departmentMedicine, School of Medicine
dc.date.accessioned2025-05-28T07:59:19Z
dc.date.available2025-05-28T07:59:19Z
dc.date.issued2013
dc.description.abstractBackground & aims: Little is known about the ability of natural language processing (NLP) to extract meaningful information from free-text gastroenterology reports for secondary use. Methods: We randomly selected 500 linked colonoscopy and pathology reports from 10,798 nonsurveillance colonoscopies to train and test the NLP system. By using annotation by gastroenterologists as the reference standard, we assessed the accuracy of an open-source NLP engine that processed and extracted clinically relevant concepts. The primary outcome was the highest level of pathology. Secondary outcomes were location of the most advanced lesion, largest size of an adenoma removed, and number of adenomas removed. Results: The NLP system identified the highest level of pathology with 98% accuracy, compared with triplicate annotation by gastroenterologists (the standard). Accuracy values for location, size, and number were 97%, 96%, and 84%, respectively. Conclusions: The NLP can extract specific meaningful concepts with 98% accuracy. It might be developed as a method to further quantify specific quality metrics.
dc.eprint.versionAuthor's manuscript
dc.identifier.citationImler TD, Morea J, Kahi C, Imperiale TF. Natural language processing accurately categorizes findings from colonoscopy and pathology reports. Clin Gastroenterol Hepatol. 2013;11(6):689-694. doi:10.1016/j.cgh.2012.11.035
dc.identifier.urihttps://hdl.handle.net/1805/48420
dc.language.isoen_US
dc.publisherElsevier
dc.relation.isversionof10.1016/j.cgh.2012.11.035
dc.relation.journalClinical Gastroenterology and Hepatology
dc.rightsPublisher Policy
dc.sourcePMC
dc.subjectAdenoma detection rate
dc.subjectColon cancer screening
dc.subjectSoftware
dc.subjectComputerized
dc.subjectNatural language procession
dc.subjectColonoscopy
dc.subjectGastroenterology
dc.subjectAdenoma detection rate
dc.subjectMedical informatics
dc.titleNatural language processing accurately categorizes findings from colonoscopy and pathology reports
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Imler2013Natural-AAM.pdf
Size:
255.45 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.04 KB
Format:
Item-specific license agreed upon to submission
Description: