Analyzing the symptoms in colorectal and breast cancer patients with or without type 2 diabetes using EHR data

dc.contributor.authorLuo, Xiao
dc.contributor.authorStorey, Susan
dc.contributor.authorGandhi, Priyanka
dc.contributor.authorZhang, Zuoyi
dc.contributor.authorMetzger, Megan
dc.contributor.authorHuang, Kun
dc.contributor.departmentComputer Information and Graphics Technology, School of Engineering and Technologyen_US
dc.date.accessioned2023-02-14T17:46:30Z
dc.date.available2023-02-14T17:46:30Z
dc.date.issued2021
dc.description.abstractThis research extracted patient-reported symptoms from free-text EHR notes of colorectal and breast cancer patients and studied the correlation of the symptoms with comorbid type 2 diabetes, race, and smoking status. An NLP framework was developed first to use UMLS MetaMap to extract all symptom terms from the 366,398 EHR clinical notes of 1694 colorectal cancer (CRC) patients and 3458 breast cancer (BC) patients. Semantic analysis and clustering algorithms were then developed to categorize all the relevant symptoms into eight symptom clusters defined by seed terms. After all the relevant symptoms were extracted from the EHR clinical notes, the frequency of the symptoms reported from colorectal cancer (CRC) and breast cancer (BC) patients over three time-periods post-chemotherapy was calculated. Logistic regression (LR) was performed with each symptom cluster as the response variable while controlling for diabetes, race, and smoking status. The results show that the CRC and BC patients with Type 2 Diabetes (T2D) were more likely to report symptoms than CRC and BC without T2D over three time-periods in the cancer trajectory. We also found that current smokers were more likely to report anxiety (CRC, BC), neuropathic symptoms (CRC, BC), anxiety (BC), and depression (BC) than non-smokers.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationLuo, X., Storey, S., Gandhi, P., Zhang, Z., Metzger, M., & Huang, K. (2021). Analyzing the symptoms in colorectal and breast cancer patients with or without type 2 diabetes using EHR data. Health Informatics Journal, 27(1), 14604582211000784. https://doi.org/10.1177/14604582211000785en_US
dc.identifier.urihttps://hdl.handle.net/1805/31234
dc.language.isoenen_US
dc.publisherSageen_US
dc.relation.isversionof10.1177/14604582211000785en_US
dc.relation.journalHealth Informatics Journalen_US
dc.rightsAttribution-NonCommercial 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/*
dc.sourcePublisheren_US
dc.subjectdata miningen_US
dc.subjectelectronic health recordsen_US
dc.subjectmachine learningen_US
dc.titleAnalyzing the symptoms in colorectal and breast cancer patients with or without type 2 diabetes using EHR dataen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Luo2021Analyzing-CCBYNC.pdf
Size:
889.28 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: