A practical approach for incorporating dependence among fields in probabilistic record linkage

dc.contributor.authorDaggy, Joanne K.
dc.contributor.authorXu, Huiping
dc.contributor.authorHui, Siu L.
dc.contributor.authorGamache, Roland E.
dc.contributor.authorGrannis, Shaun J.
dc.contributor.departmentBiostatistics and Health Data Science, Richard M. Fairbanks School of Public Health
dc.date.accessioned2025-05-01T12:50:13Z
dc.date.available2025-05-01T12:50:13Z
dc.date.issued2013-08-30
dc.description.abstractBackground: Methods for linking real-world healthcare data often use a latent class model, where the latent, or unknown, class is the true match status of candidate record-pairs. This commonly used model assumes that agreement patterns among multiple fields within a latent class are independent. When this assumption is violated, various approaches, including the most commonly proposed loglinear models, have been suggested to account for conditional dependence. Methods: We present a step-by-step guide to identify important dependencies between fields through a correlation residual plot and demonstrate how they can be incorporated into loglinear models for record linkage. This method is applied to healthcare data from the patient registry for a large county health department. Results: Our method could be readily implemented using standard software (with code supplied) to produce an overall better model fit as measured by BIC and deviance. Finding the most parsimonious model is known to reduce bias in parameter estimates. Conclusions: This novel approach identifies and accommodates conditional dependence in the context of record linkage. The conditional dependence model is recommended for routine use due to its flexibility for incorporating conditional dependence and easy implementation using existing software.
dc.eprint.versionFinal published version
dc.identifier.citationDaggy JK, Xu H, Hui SL, Gamache RE, Grannis SJ. A practical approach for incorporating dependence among fields in probabilistic record linkage. BMC Med Inform Decis Mak. 2013;13:97. Published 2013 Aug 30. doi:10.1186/1472-6947-13-97
dc.identifier.urihttps://hdl.handle.net/1805/47595
dc.language.isoen_US
dc.publisherSpringer Nature
dc.relation.isversionof10.1186/1472-6947-13-97
dc.relation.journalBMC Medical Informatics and Decision Making
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.sourcePMC
dc.subjectHumans
dc.subjectMedical record linkage
dc.subjectStatistical models
dc.titleA practical approach for incorporating dependence among fields in probabilistic record linkage
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Daggy2013Practical-CCBY.pdf
Size:
288.68 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.04 KB
Format:
Item-specific license agreed upon to submission
Description: