Leveraging Electronic Dental Record Data to Classify Patients Based on Their Smoking Intensity

dc.contributor.authorPatel, Jay
dc.contributor.authorSiddiqui, Zasim
dc.contributor.authorKrishnan, A.
dc.contributor.authorThyvalikakath, Thankam Paul
dc.contributor.departmentCariology, Operative Dentistry and Dental Public Health, School of Dentistryen_US
dc.date.accessioned2018-12-07T14:53:18Z
dc.date.available2018-12-07T14:53:18Z
dc.date.issued2018
dc.description.abstractBackground Smoking is an established risk factor for oral diseases and, therefore, dental clinicians routinely assess and record their patients' detailed smoking status. Researchers have successfully extracted smoking history from electronic health records (EHRs) using text mining methods. However, they could not retrieve patients' smoking intensity due to its limited availability in the EHR. The presence of detailed smoking information in the electronic dental record (EDR) often under a separate section allows retrieving this information with less preprocessing. Objective To determine patients' detailed smoking status based on smoking intensity from the EDR. Methods First, the authors created a reference standard of 3,296 unique patients’ smoking histories from the EDR that classified patients based on their smoking intensity. Next, they trained three machine learning classifiers (support vector machine, random forest, and naïve Bayes) using the training set (2,176) and evaluated performances on test set (1,120) using precision (P), recall (R), and F-measure (F). Finally, they applied the best classifier to classify smoking status from an additional 3,114 patients’ smoking histories. Results Support vector machine performed best to classify patients into smokers, nonsmokers, and unknowns (P, R, F: 98%); intermittent smoker (P: 95%, R: 98%, F: 96%); past smoker (P, R, F: 89%); light smoker (P, R, F: 87%); smokers with unknown intensity (P: 76%, R: 86%, F: 81%), and intermediate smoker (P: 90%, R: 88%, F: 89%). It performed moderately to differentiate heavy smokers (P: 90%, R: 44%, F: 60%). EDR could be a valuable source for obtaining patients’ detailed smoking information. Conclusion EDR data could serve as a valuable source for obtaining patients' detailed smoking information based on their smoking intensity that may not be readily available in the EHR.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationPatel, J., Siddiqui, Z., Krishnan, A., & Thyvalikakath, T. P. (2018). Leveraging Electronic Dental Record Data to Classify Patients Based on Their Smoking Intensity. Methods of Information in Medicine. https://doi.org/10.1055/s-0038-1675817en_US
dc.identifier.urihttps://hdl.handle.net/1805/17935
dc.language.isoenen_US
dc.publisherThiemeen_US
dc.relation.isversionof10.1055/s-0038-1675817en_US
dc.relation.journalMethods of Information in Medicineen_US
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 United States
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/us/
dc.sourcePublisheren_US
dc.subjectelectronic dental recorden_US
dc.subjectsmoking intensityen_US
dc.subjectinformation extractionen_US
dc.titleLeveraging Electronic Dental Record Data to Classify Patients Based on Their Smoking Intensityen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Patel_2018_leveraging.pdf
Size:
271.12 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: