- Browse by Author
Browsing by Author "Grannis, Shaun J."
Now showing 1 - 10 of 50
Results Per Page
Sort Options
Item A framework for a consistent and reproducible evaluation of manual review for patient matching algorithms(Oxford University Press, 2022) Gupta, Agrayan K.; Kasthurirathne, Suranga N.; Xu, Huiping; Li, Xiaochun; Ruppert, Matthew M.; Harle, Christopher A.; Grannis, Shaun J.; Medicine, School of MedicineHealthcare systems are hampered by incomplete and fragmented patient health records. Record linkage is widely accepted as a solution to improve the quality and completeness of patient records. However, there does not exist a systematic approach for manually reviewing patient records to create gold standard record linkage data sets. We propose a robust framework for creating and evaluating manually reviewed gold standard data sets for measuring the performance of patient matching algorithms. Our 8-point approach covers data preprocessing, blocking, record adjudication, linkage evaluation, and reviewer characteristics. This framework can help record linkage method developers provide necessary transparency when creating and validating gold standard reference matching data sets. In turn, this transparency will support both the internal and external validity of recording linkage studies and improve the robustness of new record linkage strategies.Item An Adversorial Approach to Enable Re-Use of Machine Learning Models and Collaborative Research Efforts Using Synthetic Unstructured Free-Text Medical Data(IOS, 2019) Kasthurirathne, Suranga N.; Dexter, Gregory; Grannis, Shaun J.; Epidemiology, School of Public HealthWe leverage Generative Adversarial Networks (GAN) to produce synthetic free-text medical data with low re-identification risk, and apply these to replicate machine learning solutions. We trained GAN models to generate free-text cancer pathology reports. Decision models were trained using synthetic datasets reported performance metrics that were statistically similar to models trained using original test data. Our results further the use of GANs to generate synthetic data for collaborative research and re-use of machine learning models.Item Assessing the capacity of social determinants of health data to augment predictive models identifying patients in need of wraparound social services(Oxford Press, 2018-01) Kasthurirathne, Suranga N.; Vest, Joshua R.; Menachemi, Nir; Halverson, Paul K.; Grannis, Shaun J.; Health Policy and Management, School of Public HealthIntroduction A growing variety of diverse data sources is emerging to better inform health care delivery and health outcomes. We sought to evaluate the capacity for clinical, socioeconomic, and public health data sources to predict the need for various social service referrals among patients at a safety-net hospital. Materials and Methods We integrated patient clinical data and community-level data representing patients’ social determinants of health (SDH) obtained from multiple sources to build random forest decision models to predict the need for any, mental health, dietitian, social work, or other SDH service referrals. To assess the impact of SDH on improving performance, we built separate decision models using clinical and SDH determinants and clinical data only. Results Decision models predicting the need for any, mental health, and dietitian referrals yielded sensitivity, specificity, and accuracy measures ranging between 60% and 75%. Specificity and accuracy scores for social work and other SDH services ranged between 67% and 77%, while sensitivity scores were between 50% and 63%. Area under the receiver operating characteristic curve values for the decision models ranged between 70% and 78%. Models for predicting the need for any services reported positive predictive values between 65% and 73%. Positive predictive values for predicting individual outcomes were below 40%. Discussion The need for various social service referrals can be predicted with considerable accuracy using a wide range of readily available clinical and community data that measure socioeconomic and public health conditions. While the use of SDH did not result in significant performance improvements, our approach represents a novel and important application of risk predictive modeling.Item Automating Provider Reporting of Communicable Disease Cases using Health Information Technology(Office of the Vice Chancellor for Research, 2014-04-11) Dixon, Brian E.; Lai, Patrick T. S.; Kirbiyik, Uzay; Grannis, Shaun J.Introduction Disease surveillance is a core public health (PH) function, which enables PH authorities to monitor disease outbreak and develop programs and policies to reduce disease burden. To manage and adjudicate cases of suspected communicable disease, PH workers gather data elements about persons, clinical care, and providers from various clinical sources, including providers, laboratories, among others. Current processes are paper-based and often yield incomplete and untimely reporting across different diseases requiring time-consuming follow-up by PH authorities to get needed information. Health information technology (HIT) refers to a wide range of technologies used in health care settings, including electronic health records and laboratory information systems. Health information exchange (HIE) involves electronic sharing of data and information between HIT systems, including those used in PH. Previous research has shown that using HIE to electronically report laboratory results to PH can improve surveillance practice, yet there has been little utilization of HIE for improving provider-based disease reporting [1]. Methods Our study uses an intervention to electronically pre-populate provider-based communicable disease case reporting forms with existing clinical, laboratory and patient data available through one of the largest and oldest HIE infrastructures in the U.S., the Indiana Network for Patient Care. Evaluation of the intervention will be conducted utilizing mixed methods in a concurrent design framework in which qualitative methods are embedded within the quantitative methods. Quantitative data will include reporting rates, timeliness and burden and report completeness and accuracy, analyzed using interrupted time-series and other pre-post comparisons. Qualitative data regarding pre-post provider perceptions of report completeness, accuracy, and timeliness, reporting burden, data quality, benefits, utility, adoption, utilization and impact on reporting workflow will be collected using semi-structured interviews and open-ended survey items. Data will be triangulated to find convergence or agreement by cross-validating results to produce a contextualized portrayal of the facilitators and barriers to implementation and use of the intervention. Results The intervention has been implemented in seven primary care clinics in the metropolitan Indianapolis area plus one rural clinic in Edinburgh. Analysis of baseline data shows that provider-based reports vary in their completeness, yet they contain critical information not available from laboratory information systems [2]. Furthermore, PH workers access a range of sources to gather the data they need to investigate disease cases [3]. Discussion and Conclusion By applying mixed research methods and measuring context, facilitators and barriers, and individual, organizational and data quality factors that may impact adoption and utilization of the intervention, we will document whether and how the intervention streamlines provider-based manual reporting workflows, lowers barriers to reporting, increases data completeness, improves reporting timeliness and captures a greater portion of communicable disease burden in the community. Early results are promising, and continued evaluation will be completed over the next 24 months.Item Capturing COVID-19–Like Symptoms at Scale Using Banner Ads on an Online News Platform: Pilot Survey Study(JMIR, 2021-05-20) Dixon, Brian E.; Mukherjee, Sumit; Wiensch, Ashley; Gray, Mary L.; Ferres, Juan M. Lavista; Grannis, Shaun J.; Epidemiology, School of Public HealthBackground: Identifying new COVID-19 cases is challenging. Not every suspected case undergoes testing, because testing kits and other equipment are limited in many parts of the world. Yet populations increasingly use the internet to manage both home and work life during the pandemic, giving researchers mediated connections to millions of people sheltering in place. Objective: The goal of this study was to assess the feasibility of using an online news platform to recruit volunteers willing to report COVID-19–like symptoms and behaviors. Methods: An online epidemiologic survey captured COVID-19–related symptoms and behaviors from individuals recruited through banner ads offered through Microsoft News. Respondents indicated whether they were experiencing symptoms, whether they received COVID-19 testing, and whether they traveled outside of their local area. Results: A total of 87,322 respondents completed the survey across a 3-week span at the end of April 2020, with 54.3% of the responses from the United States and 32.0% from Japan. Of the total respondents, 19,631 (22.3%) reported at least one symptom associated with COVID-19. Nearly two-fifths of these respondents (39.1%) reported more than one COVID-19–like symptom. Individuals who reported being tested for COVID-19 were significantly more likely to report symptoms (47.7% vs 21.5%; P<.001). Symptom reporting rates positively correlated with per capita COVID-19 testing rates (R2=0.26; P<.001). Respondents were geographically diverse, with all states and most ZIP Codes represented. More than half of the respondents from both countries were older than 50 years of age. Conclusions: News platforms can be used to quickly recruit study participants, enabling collection of infectious disease symptoms at scale and with populations that are older than those found through social media platforms. Such platforms could enable epidemiologists and researchers to quickly assess trends in emerging infections potentially before at-risk populations present to clinics and hospitals for testing and/or treatment.Item Characterizing clinical findings of Sjögren's Disease patients in community practices using matched electronic dental-health record data(Public Library of Science, 2023-07-31) Felix Gomez, Grace Gomez; Hugenberg, Steven T.; Zunt, Susan; Patel, Jay S.; Wang, Mei; Rajapuri, Anushri Singh; Lembcke, Lauren R.; Rajendran, Divya; Smith, Jonas C.; Cheriyan, Biju; Boyd, LaKeisha J.; Eckert, George J.; Grannis, Shaun J.; Srinivasan, Mythily; Zero, Domenick T.; Thyvalikakath, Thankam P.; Cariology, Operative Dentistry and Dental Public Health, School of DentistryEstablished classifications exist to confirm Sjögren's Disease (SD) (previously referred as Sjögren's Syndrome) and recruit patients for research. However, no established classification exists for diagnosis in clinical settings causing delayed diagnosis. SD patients experience a huge dental disease burden impairing their quality of life. This study established criteria to characterize Indiana University School of Dentistry (IUSD) patients' SD based on symptoms and signs in the electronic health record (EHR) data available through the state-wide Indiana health information exchange (IHIE). Association between SD diagnosis, and comorbidities including other autoimmune conditions, and documentation of SD diagnosis in electronic dental record (EDR) were also determined. The IUSD patients' EDR were linked with their EHR data in the IHIE and queried for SD diagnostic ICD9/10 codes. The resulting cohorts' EHR clinical findings were characterized and classified using diagnostic criteria based on clinical experts' recommendations. Descriptive statistics were performed, and Chi-square tests determined the association between the different SD presentations and comorbidities including other autoimmune conditions. Eighty-three percent of IUSD patients had an EHR of which 377 patients had a SD diagnosis. They were characterized as positive (24%), uncertain (20%) and negative (56%) based on EHR clinical findings. Dry eyes and mouth were reported for 51% and positive Anti-Ro/SSA antibodies and anti-nuclear antibody (ANA) for 17% of this study cohort. One comorbidity was present in 98% and other autoimmune condition/s were present in 53% respectively. Significant differences were observed between the three SD clinical characteristics/classifications and certain medical and autoimmune conditions (p<0.05). Sixty-nine percent of patients' EDR did not mention SD, highlighting the huge gap in reporting SD during dental care. This study of SD patients diagnosed in community practices characterized three different SD clinical presentations, which can be used to generate SD study cohorts for longitudinal studies using EHR data. The results emphasize the heterogenous SD clinical presentations and the need for further research to diagnose SD early in community practice settings where most people seek care.Item Characterizing Informatics Roles and Needs of Public Health Workers: Results from the Public Health Workforce Interests and Needs Survey(Lippincott Williams & Wilkins, 2015-11) Dixon, Brian E.; McFarlane, Timothy D.; Dearth, Shandy; Grannis, Shaun J.; Gibson, P. Joseph; Department of Epidemiology, Richard M. Fairbanks School of Public HealthObjective: To characterize public health workers who specialize in informatics and to assess informatics-related aspects of the work performed by the public health workforce. Methods (Design, Setting, Participants): Using the nationally representative Public Health Workforce Interests and Needs Survey (PH WINS), we characterized and compared responses from informatics, information technology (IT), clinical and laboratory, and other public health science specialists working in state health agencies. Main Outcome Measures: Demographics, income, education, and agency size were analyzed using descriptive statistics. Weighted medians and interquartile ranges were calculated for responses pertaining to job satisfaction, workplace environment, training needs, and informatics-related competencies. Results: Of 10 246 state health workers, we identified 137 (1.3%) informatics specialists and 419 (4.1%) IT specialists. Overall, informatics specialists are younger, but share many common traits with other public health science roles, including positive attitudes toward their contributions to the mission of public health as well as job satisfaction. Informatics specialists differ demographically from IT specialists, and the 2 groups also differ with respect to salary as well as their distribution across agencies of varying size. All groups identified unmet public health and informatics competency needs, particularly limited training necessary to fully utilize technology for their work. Moreover, all groups indicated a need for greater future emphasis on leveraging electronic health information for public health functions. Conclusions: Findings from the PH WINS establish a framework and baseline measurements that can be leveraged to routinely monitor and evaluate the ineludible expansion and maturation of the public health informatics workforce and can also support assessment of the growth and evolution of informatics training needs for the broader field. Ultimately, such routine evaluations have the potential to guide local and national informatics workforce development policy.Item Comparison of Supervised Machine Learning and Probabilistic Approaches for Record Linkage(AMIA Informatics summit 2019 Conference Proceedings., 2020-03-25) McNutt, Andrew T.; Grannis, Shaun J.; Bo, Na; Xu, Huiping; Kasthurirathne, Suranga N.Record linkage is vital to prevent fragmentation of patient data. Machine learning approaches present considerable potential for record linkage. We compared the performance of three machine learning algorithms to an established probabilistic record linkage technique. Machine learning approaches exhibited results that were comparable, or statistically superior to the established probabilistic approach. It is unclear if the cost of manually reviewing datasets for supervised learning is justified by the performance improvements they yield.Item Completeness and timeliness of notifiable disease reporting: a comparison of laboratory and provider reports submitted to a large county health department(Springer Nature, 2017-06-23) Dixon, Brian E.; Zhang, Zuoyi; Lai, Patrick T. S.; Kirbiyik, Uzay; Williams, Jennifer; Hills, Rebecca; Revere, Debra; Gibson, P. Joseph; Grannis, Shaun J.; BioHealth Informatics, School of Informatics and ComputingBACKGROUND: Most public health agencies expect reporting of diseases to be initiated by hospital, laboratory or clinic staff even though so-called passive approaches are known to be burdensome for reporters and produce incomplete as well as delayed reports, which can hinder assessment of disease and delay recognition of outbreaks. In this study, we analyze patterns of reporting as well as data completeness and timeliness for traditional, passive reporting of notifiable disease by two distinct sources of information: hospital and clinic staff versus clinical laboratory staff. Reports were submitted via fax machine as well as electronic health information exchange interfaces. METHODS: Data were extracted from all submitted notifiable disease reports for seven representative diseases. Reporting rates are the proportion of known cases having a corresponding case report from a provider, a faxed laboratory report or an electronic laboratory report. Reporting rates were stratified by disease and compared using McNemar's test. For key data fields on the reports, completeness was calculated as the proportion of non-blank fields. Timeliness was measured as the difference between date of laboratory confirmed diagnosis and the date the report was received by the health department. Differences in completeness and timeliness by data source were evaluated using a generalized linear model with Pearson's goodness of fit statistic. RESULTS: We assessed 13,269 reports representing 9034 unique cases. Reporting rates varied by disease with overall rates of 19.1% for providers and 84.4% for laboratories (p < 0.001). All but three of 15 data fields in provider reports were more often complete than those fields within laboratory reports (p <0.001). Laboratory reports, whether faxed or electronically sent, were received, on average, 2.2 days after diagnosis versus a week for provider reports (p <0.001). CONCLUSIONS: Despite growth in the use of electronic methods to enhance notifiable disease reporting, there still exists much room for improvement.Item The COVID-19 health equity twindemic: Statewide epidemiologic trends of SARS-CoV-2 outcomes among racial minorities and in rural America(Cold Spring Harbor Laboratory Press, 2021) Dixon, Brian E.; Grannis, Shaun J.; Lembcke, Lauren; Roberts, Anna; Embi, Peter J.; Epidemiology, School of Public HealthBackground Early studies on COVID-19 identified unequal patterns in hospitalization and mortality in urban environments for racial and ethnic minorities. These studies were primarily single center observational studies conducted within the first few weeks or months of the pandemic. We sought to examine trends in COVID-19 morbidity and mortality over time for minority and rural populations, especially during the U.S. fall surge. Methods Statewide cohort of all adult residents in Indiana tested for SARS-CoV-2 infection between March 1 and December 31, 2020, linked to electronic health records. Primary measures were per capita rates of infection, hospitalization, and death. Age adjusted rates were calculated for multiple time periods corresponding to public health mitigation efforts. Results Morbidity and mortality increased over time with notable differences among sub-populations. Initially, per capita hospitalizations among racial minorities were 3-4 times higher than whites, and per capita deaths among urban residents were twice those of rural residents. By fall 2020, per capita hospitalizations and deaths in rural areas surpassed those of urban areas, and gaps between black/brown and white populations narrowed. Cumulative morbidity and mortality were highest among minority groups and in rural communities. Conclusions Burden of COVID-19 morbidity and mortality shifted over time, creating a twindemic involving disparities in outcomes based on race and geography. Health officials should explicitly measure disparities and adjust mitigation and vaccination strategies to protect vulnerable sub-populations with greater disease burden.