ScholarWorksIndianapolis
  • Communities & Collections
  • Browse ScholarWorks
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Suomi
  • Svenska
  • Türkçe
  • Tiếng Việt
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Yкраї́нська
  • Log In
    or
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Browse by Subject

Browsing by Subject "Medical record linkage"

Now showing 1 - 3 of 3
Results Per Page
Sort Options
  • Loading...
    Thumbnail Image
    Item
    A practical approach for incorporating dependence among fields in probabilistic record linkage
    (Springer Nature, 2013-08-30) Daggy, Joanne K.; Xu, Huiping; Hui, Siu L.; Gamache, Roland E.; Grannis, Shaun J.; Biostatistics and Health Data Science, Richard M. Fairbanks School of Public Health
    Background: Methods for linking real-world healthcare data often use a latent class model, where the latent, or unknown, class is the true match status of candidate record-pairs. This commonly used model assumes that agreement patterns among multiple fields within a latent class are independent. When this assumption is violated, various approaches, including the most commonly proposed loglinear models, have been suggested to account for conditional dependence. Methods: We present a step-by-step guide to identify important dependencies between fields through a correlation residual plot and demonstrate how they can be incorporated into loglinear models for record linkage. This method is applied to healthcare data from the patient registry for a large county health department. Results: Our method could be readily implemented using standard software (with code supplied) to produce an overall better model fit as measured by BIC and deviance. Finding the most parsimonious model is known to reduce bias in parameter estimates. Conclusions: This novel approach identifies and accommodates conditional dependence in the context of record linkage. The conditional dependence model is recommended for routine use due to its flexibility for incorporating conditional dependence and easy implementation using existing software.
  • Loading...
    Thumbnail Image
    Item
    Association of missing paternal demographics on infant birth certificates with perinatal risk factors for childhood obesity
    (Springer (Biomed Central Ltd.), 2016-07-14) Cheng, Erika R.; Hawkins, Summer Sherburne; Rifas-Shiman, Sheryl L.; Gillman, Matthew W.; Taveras, Elsie M.; Department of Pediatrics, IU School of Medicine
    BACKGROUND: The role of fathers in the development of obesity in their offspring remains poorly understood. We evaluated associations of missing paternal demographic information on birth certificates with perinatal risk factors for childhood obesity. METHODS: Data were from the Linked CENTURY Study, a database linking birth certificate and well-child visit data for 200,258 Massachusetts children from 1980-2008. We categorized participants based on the availability of paternal age, education, or race/ethnicity and maternal marital status on the birth certificate: (1) pregnancies missing paternal data; (2) pregnancies involving unmarried women with paternal data; and (3) pregnancies involving married women with paternal data. Using linear and logistic regression, we compared differences in smoking during pregnancy, gestational diabetes, birthweight, breastfeeding initiation, and ever recording a weight for length (WFL) ≥ the 95th percentile or crossing upwards ≥2 WFL percentiles between 0-24 months among the study groups. RESULTS: 11,989 (6.0 %) birth certificates were missing paternal data; 31,323 (15.6 %) mothers were unmarried. In adjusted analyses, missing paternal data was associated with lower birthweight (β -0.07 kg; 95 % CI: -0.08, -0.05), smoking during pregnancy (AOR 4.40; 95 % CI: 3.97, 4.87), non-initiation of breastfeeding (AOR 0.39; 95 % CI: 0.36, 0.42), and with ever having a WFL ≥ 95th percentile (AOR 1.10; 95 % CI: 1.01, 1.20). Similar associations were noted for pregnancies involving unmarried women with paternal data, but differences were less pronounced. CONCLUSIONS: Missing paternal data on the birth certificate is associated with perinatal risk factors for childhood obesity. Efforts to understand and reduce obesity risk factors in early life may need to consider paternal factors.
  • Loading...
    Thumbnail Image
    Item
    Establishing a framework for privacy-preserving record linkage among electronic health record and administrative claims databases within PCORnet®, the National Patient-Centered Clinical Research Network
    (BMC, 2022-10-31) Kiernan, Daniel; Carton, Thomas; Toh, Sengwee; Phua, Jasmin; Zirkle, Maryan; Louzao, Darcy; Haynes, Kevin; Weiner, Mark; Angulo, Francisco; Bailey, Charles; Bian, Jiang; Fort, Daniel; Grannis, Shaun; Krishnamurthy, Ashok Kumar; Nair, Vinit; Rivera, Pedro; Silverstein, Jonathan; Marsolo, Keith; Medicine, School of Medicine
    Objective: The aim of this study was to determine whether a secure, privacy-preserving record linkage (PPRL) methodology can be implemented in a scalable manner for use in a large national clinical research network. Results: We established the governance and technical capacity to support the use of PPRL across the National Patient-Centered Clinical Research Network (PCORnet®). As a pilot, four sites used the Datavant software to transform patient personally identifiable information (PII) into de-identified tokens. We queried the sites for patients with a clinical encounter in 2018 or 2019 and matched their tokens to determine whether overlap existed. We described patient overlap among the sites and generated a "deduplicated" table of patient demographic characteristics. Overlapping patients were found in 3 of the 6 site-pairs. Following deduplication, the total patient count was 3,108,515 (0.11% reduction), with the largest reduction in count for patients with an "Other/Missing" value for Sex; from 198 to 163 (17.6% reduction). The PPRL solution successfully links patients across data sources using distributed queries without directly accessing patient PII. The overlap queries and analysis performed in this pilot is being replicated across the full network to provide additional insight into patient linkages among a distributed research network.
About IU Indianapolis ScholarWorks
  • Accessibility
  • Privacy Notice
  • Copyright © 2025 The Trustees of Indiana University