- Browse by Author
Browsing by Author "Phua, Jasmin"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
Item Establishing a framework for privacy-preserving record linkage among electronic health record and administrative claims databases within PCORnet®, the National Patient-Centered Clinical Research Network(BMC, 2022-10-31) Kiernan, Daniel; Carton, Thomas; Toh, Sengwee; Phua, Jasmin; Zirkle, Maryan; Louzao, Darcy; Haynes, Kevin; Weiner, Mark; Angulo, Francisco; Bailey, Charles; Bian, Jiang; Fort, Daniel; Grannis, Shaun; Krishnamurthy, Ashok Kumar; Nair, Vinit; Rivera, Pedro; Silverstein, Jonathan; Marsolo, Keith; Medicine, School of MedicineObjective: The aim of this study was to determine whether a secure, privacy-preserving record linkage (PPRL) methodology can be implemented in a scalable manner for use in a large national clinical research network. Results: We established the governance and technical capacity to support the use of PPRL across the National Patient-Centered Clinical Research Network (PCORnet®). As a pilot, four sites used the Datavant software to transform patient personally identifiable information (PII) into de-identified tokens. We queried the sites for patients with a clinical encounter in 2018 or 2019 and matched their tokens to determine whether overlap existed. We described patient overlap among the sites and generated a "deduplicated" table of patient demographic characteristics. Overlapping patients were found in 3 of the 6 site-pairs. Following deduplication, the total patient count was 3,108,515 (0.11% reduction), with the largest reduction in count for patients with an "Other/Missing" value for Sex; from 198 to 163 (17.6% reduction). The PPRL solution successfully links patients across data sources using distributed queries without directly accessing patient PII. The overlap queries and analysis performed in this pilot is being replicated across the full network to provide additional insight into patient linkages among a distributed research network.Item Privacy‐preserving record linkage across disparate institutions and datasets to enable a learning health system: The national COVID cohort collaborative (N3C) experience(Wiley, 2024-01-11) Tachinardi, Umberto; Grannis, Shaun J.; Michael, Sam G.; Misquitta, Leonie; Dahlin, Jayme; Sheikh, Usman; Kho, Abel; Phua, Jasmin; Rogovin, Sara S.; Amor, Benjamin; Choudhury, Maya; Sparks, Philip; Mannaa, Amin; Ljazouli, Saad; Saltz, Joel; Prior, Fred; Baghal, Ahmen; Gersing, Kenneth; Embi, Peter J.; Medicine, School of MedicineIntroduction: Research driven by real-world clinical data is increasingly vital to enabling learning health systems, but integrating such data from across disparate health systems is challenging. As part of the NCATS National COVID Cohort Collaborative (N3C), the N3C Data Enclave was established as a centralized repository of deidentified and harmonized COVID-19 patient data from institutions across the US. However, making this data most useful for research requires linking it with information such as mortality data, images, and viral variants. The objective of this project was to establish privacy-preserving record linkage (PPRL) methods to ensure that patient-level EHR data remains secure and private when governance-approved linkages with other datasets occur. Methods: Separate agreements and approval processes govern N3C data contribution and data access. The Linkage Honest Broker (LHB), an independent neutral party (the Regenstrief Institute), ensures data linkages are robust and secure by adding an extra layer of separation between protected health information and clinical data. The LHB's PPRL methods (including algorithms, processes, and governance) match patient records using "deidentified tokens," which are hashed combinations of identifier fields that define a match across data repositories without using patients' clear-text identifiers. Results: These methods enable three linkage functions: Deduplication, Linking Multiple Datasets, and Cohort Discovery. To date, two external repositories have been cross-linked. As of March 1, 2023, 43 sites have signed the LHB Agreement; 35 sites have sent tokens generated for 9 528 998 patients. In this initial cohort, the LHB identified 135 037 matches and 68 596 duplicates. Conclusion: This large-scale linkage study using deidentified datasets of varying characteristics established secure methods for protecting the privacy of N3C patient data when linked for research purposes. This technology has potential for use with registries for other diseases and conditions.