Natural language processing accurately categorizes findings from colonoscopy and pathology reports

Date
2013
Language
American English
Embargo Lift Date
Committee Members
Degree
Degree Year
Department
Grantor
Journal Title
Journal ISSN
Volume Title
Found At
Elsevier
Can't use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.
Abstract

Background & aims: Little is known about the ability of natural language processing (NLP) to extract meaningful information from free-text gastroenterology reports for secondary use.

Methods: We randomly selected 500 linked colonoscopy and pathology reports from 10,798 nonsurveillance colonoscopies to train and test the NLP system. By using annotation by gastroenterologists as the reference standard, we assessed the accuracy of an open-source NLP engine that processed and extracted clinically relevant concepts. The primary outcome was the highest level of pathology. Secondary outcomes were location of the most advanced lesion, largest size of an adenoma removed, and number of adenomas removed.

Results: The NLP system identified the highest level of pathology with 98% accuracy, compared with triplicate annotation by gastroenterologists (the standard). Accuracy values for location, size, and number were 97%, 96%, and 84%, respectively.

Conclusions: The NLP can extract specific meaningful concepts with 98% accuracy. It might be developed as a method to further quantify specific quality metrics.

Description
item.page.description.tableofcontents
item.page.relation.haspart
Cite As
Imler TD, Morea J, Kahi C, Imperiale TF. Natural language processing accurately categorizes findings from colonoscopy and pathology reports. Clin Gastroenterol Hepatol. 2013;11(6):689-694. doi:10.1016/j.cgh.2012.11.035
ISSN
Publisher
Series/Report
Sponsorship
Major
Extent
Identifier
Relation
Journal
Clinical Gastroenterology and Hepatology
Source
PMC
Alternative Title
Type
Article
Number
Volume
Conference Dates
Conference Host
Conference Location
Conference Name
Conference Panel
Conference Secretariat Location
Version
Author's manuscript
Full Text Available at
This item is under embargo {{howLong}}