A Systematic Approach to Configuring MetaMap for Optimal Performance

dc.contributor.authorJing, Xia
dc.contributor.authorIndani, Akash
dc.contributor.authorHubig, Nina
dc.contributor.authorMin, Hua
dc.contributor.authorGong, Yang
dc.contributor.authorCimino, James J.
dc.contributor.authorSittig, Dean F.
dc.contributor.authorRennert, Lior
dc.contributor.authorRobinson, David
dc.contributor.authorBiondich, Paul
dc.contributor.authorWright, Adam
dc.contributor.authorNøhr, Christian
dc.contributor.authorLaw, Timothy
dc.contributor.authorFaxvaag, Arild
dc.contributor.authorGimbel, Ronald
dc.contributor.departmentPediatrics, School of Medicine
dc.date.accessioned2023-10-10T16:06:46Z
dc.date.available2023-10-10T16:06:46Z
dc.date.issued2022
dc.description.abstractBackground: MetaMap is a valuable tool for processing biomedical texts to identify concepts. Although MetaMap is highly configurative, configuration decisions are not straightforward. Objective: To develop a systematic, data-driven methodology for configuring MetaMap for optimal performance. Methods: MetaMap, the word2vec model, and the phrase model were used to build a pipeline. For unsupervised training, the phrase and word2vec models used abstracts related to clinical decision support as input. During testing, MetaMap was configured with the default option, one behavior option, and two behavior options. For each configuration, cosine and soft cosine similarity scores between identified entities and gold-standard terms were computed for 40 annotated abstracts (422 sentences). The similarity scores were used to calculate and compare the overall percentages of exact matches, similar matches, and missing gold-standard terms among the abstracts for each configuration. The results were manually spot-checked. The precision, recall, and F-measure (β =1) were calculated. Results: The percentages of exact matches and missing gold-standard terms were 0.6-0.79 and 0.09-0.3 for one behavior option, and 0.56-0.8 and 0.09-0.3 for two behavior options, respectively. The percentages of exact matches and missing terms for soft cosine similarity scores exceeded those for cosine similarity scores. The average precision, recall, and F-measure were 0.59, 0.82, and 0.68 for exact matches, and 1.00, 0.53, and 0.69 for missing terms, respectively. Conclusion: We demonstrated a systematic approach that provides objective and accurate evidence guiding MetaMap configurations for optimizing performance. Combining objective evidence and the current practice of using principles, experience, and intuitions outperforms a single strategy in MetaMap configurations. Our methodology, reference codes, measurements, results, and workflow are valuable references for optimizing and configuring MetaMap.
dc.eprint.versionFinal published version
dc.identifier.citationJing X, Indani A, Hubig N, et al. A Systematic Approach to Configuring MetaMap for Optimal Performance. Methods Inf Med. 2022;61(S 02):e51-e63. doi:10.1055/a-1862-0421
dc.identifier.urihttps://hdl.handle.net/1805/36238
dc.language.isoen_US
dc.publisherThieme
dc.relation.isversionof10.1055/a-1862-0421
dc.relation.journalMethods of Information in Medicine
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.sourcePMC
dc.subjectMetaMap
dc.subjectNatural language processing
dc.subjectClinical decision support system
dc.subjectConfiguration and optimization
dc.subjectPerformance
dc.titleA Systematic Approach to Configuring MetaMap for Optimal Performance
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
10-1055-a-1862-0421.pdf
Size:
1.7 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: