Zero-shot Learning with Minimum Instruction to Extract Social Determinants and Family History from Clinical Notes using GPT Model

dc.contributor.authorBhate, Neel Jitesh
dc.contributor.authorMittal, Ansh
dc.contributor.authorHe, Zhe
dc.contributor.authorLuo, Xiao
dc.contributor.departmentComputer Science, Luddy School of Informatics, Computing, and Engineering
dc.date.accessioned2024-10-11T08:08:23Z
dc.date.available2024-10-11T08:08:23Z
dc.date.issued2023
dc.description.abstractDemographics, social determinants of health, and family history documented in the unstructured text within the electronic health records are increasingly being studied to understand how this information can be utilized with the structured data to improve healthcare outcomes. After the GPT models were released, many studies have applied GPT models to extract this information from the narrative clinical notes. Different from the existing work, our research focuses on investigating the zero-shot learning on extracting this information together by providing minimum information to the GPT model. We utilize de-identified real-world clinical notes annotated for demographics, various social determinants, and family history information. Given that the GPT model might provide text different from the text in the original data, we explore two sets of evaluation metrics, including the traditional NER evaluation metrics and semantic similarity evaluation metrics, to completely understand the performance. Our results show that the GPT-3.5 method achieved an average of 0.975 F1 on demographics extraction, 0.615 F1 on social determinants extraction, and 0.722 F1 on family history extraction. We believe these results can be further improved through model fine-tuning or few-shots learning. Through the case studies, we also identified the limitations of the GPT models, which need to be addressed in future research.
dc.eprint.versionAuthor's manuscript
dc.identifier.citationN. J. Bhate, A. Mittal, Z. He and X. Luo, "Zero-shot Learning with Minimum Instruction to Extract Social Determinants and Family History from Clinical Notes using GPT Model," 2023 IEEE International Conference on Big Data (BigData), Sorrento, Italy, 2023, pp. 1476-1480, doi: 10.1109/BigData59044.2023.10386811.
dc.identifier.urihttps://hdl.handle.net/1805/43887
dc.language.isoen_US
dc.publisherIEEE
dc.relation.isversionof10.1109/BigData59044.2023.10386811
dc.relation.journal2023 IEEE International Conference on Big Data (BigData)
dc.rightsPublisher Policy
dc.sourcePMC
dc.subjectGPT
dc.subjectGenerative AI
dc.subjectClinical text
dc.subjectFamily history
dc.subjectSocial determinants
dc.titleZero-shot Learning with Minimum Instruction to Extract Social Determinants and Family History from Clinical Notes using GPT Model
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Bhate2023Zero-AAM.pdf
Size:
430.62 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.04 KB
Format:
Item-specific license agreed upon to submission
Description: