- Browse by Author
Browsing by Author "Bian, Jiang"
Now showing 1 - 8 of 8
Results Per Page
Sort Options
Item A large language model for electronic health records(Springer Nature, 2022-12-26) Yang, Xi; Chen, Aokun; PourNejatian, Nima; Shin, Hoo Chang; Smith, Kaleb E.; Parisien, Christopher; Compas, Colin; Martin, Cheryl; Costa, Anthony B.; Flores, Mona G.; Zhang, Ying; Magoc, Tanja; Harle, Christopher A.; Lipori, Gloria; Mitchell, Duane A.; Hogan, William R.; Shenkman, Elizabeth A.; Bian, Jiang; Wu, Yonghui; Health Policy and Management, Richard M. Fairbanks School of Public HealthThere is an increasing interest in developing artificial intelligence (AI) systems to process and interpret electronic health records (EHRs). Natural language processing (NLP) powered by pretrained language models is the key technology for medical AI systems utilizing clinical narratives. However, there are few clinical language models, the largest of which trained in the clinical domain is comparatively small at 110 million parameters (compared with billions of parameters in the general domain). It is not clear how large clinical language models with billions of parameters can help medical AI systems utilize unstructured EHRs. In this study, we develop from scratch a large clinical language model—GatorTron—using >90 billion words of text (including >82 billion words of de-identified clinical text) and systematically evaluate it on five clinical NLP tasks including clinical concept extraction, medical relation extraction, semantic textual similarity, natural language inference (NLI), and medical question answering (MQA). We examine how (1) scaling up the number of parameters and (2) scaling up the size of the training data could benefit these NLP tasks. GatorTron models scale up the clinical language model from 110 million to 8.9 billion parameters and improve five clinical NLP tasks (e.g., 9.6% and 9.5% improvement in accuracy for NLI and MQA), which can be applied to medical AI systems to improve healthcare delivery. The GatorTron models are publicly available at: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/clara/models/gatortron_ogItem A New Era of Data-Driven Cancer Research and Care: Opportunities and Challenges(American Association for Cancer Research, 2024) Gomez, Felicia; Danos, Arpad M.; Del Fiol, Guilherme; Madabhushi, Anant; Tiwari, Pallavi; McMichael, Joshua F.; Bakas, Spyridon; Bian, Jiang; Davatzikos, Christos; Fertig, Elana J.; Kalpathy-Cramer, Jayashree; Kenney, Johanna; Savova, Guergana K.; Yetisgen, Meliha; Van Allen, Eliezer M.; Warner, Jeremy L.; Prior, Fred; Griffith, Malachi; Griffith, Obi L.; Pathology and Laboratory Medicine, School of MedicinePeople diagnosed with cancer and their formal and informal caregivers are increasingly faced with a deluge of complex information, thanks to rapid advancements in the type and volume of diagnostic, prognostic, and treatment data. This commentary discusses the opportunities and challenges that the society faces as we integrate large volumes of data into regular cancer care.Item Disparities in Pediatric Patient Portal Activation and Feature Us(Oxford University Press, 2021-09-29) LeLaurin, Jennifer H.; Nguyen, Oliver T.; Thompson, Lindsay A.; Hall, Jaclyn; Bian, Jiang; Cho, Hee Deok; Acharya, Ratna; Harle, Christopher A.; Salloum, Ramzi G.; Biostatistics and Health Data Science, Richard M. Fairbanks School of Public HealthObjective: Disparities in adult patient portal adoption are well-documented; however, less is known about disparities in portal adoption in pediatrics. This study examines the prevalence and factors associated with patient portal activation and the use of specific portal features in general pediatrics. Materials and methods: We analyzed electronic health record data from 2012 to 2020 in a large academic medical center that offers both parent and adolescent portals. We summarized portal activation and use of select portal features (messaging, records access and management, appointment management, visit/admissions summaries, and interactive feature use). We used logistic regression to model factors associated with patient portal activation among all patients along with feature use and frequent feature use among ever users (ie, ≥1 portal use). Results: Among 52 713 unique patients, 39% had activated the patient portal, including 36% of patients aged 0-11, 41% of patients aged 12-17, and 62% of patients aged 18-21 years. Among activated accounts, ever use of specific features ranged from 28% for visit/admission summaries to 92% for records access and management. Adjusted analyses showed patients with activated accounts were more likely to be adolescents or young adults, white, female, privately insured, and less socioeconomically vulnerable. Individual feature use among ever users generally followed the same pattern. Conclusions: Our findings demonstrate that important disparities persist in portal adoption in pediatric populations, highlighting the need for strategies to promote equitable access to patient portals.Item Establishing a framework for privacy-preserving record linkage among electronic health record and administrative claims databases within PCORnet®, the National Patient-Centered Clinical Research Network(BMC, 2022-10-31) Kiernan, Daniel; Carton, Thomas; Toh, Sengwee; Phua, Jasmin; Zirkle, Maryan; Louzao, Darcy; Haynes, Kevin; Weiner, Mark; Angulo, Francisco; Bailey, Charles; Bian, Jiang; Fort, Daniel; Grannis, Shaun; Krishnamurthy, Ashok Kumar; Nair, Vinit; Rivera, Pedro; Silverstein, Jonathan; Marsolo, Keith; Medicine, School of MedicineObjective: The aim of this study was to determine whether a secure, privacy-preserving record linkage (PPRL) methodology can be implemented in a scalable manner for use in a large national clinical research network. Results: We established the governance and technical capacity to support the use of PPRL across the National Patient-Centered Clinical Research Network (PCORnet®). As a pilot, four sites used the Datavant software to transform patient personally identifiable information (PII) into de-identified tokens. We queried the sites for patients with a clinical encounter in 2018 or 2019 and matched their tokens to determine whether overlap existed. We described patient overlap among the sites and generated a "deduplicated" table of patient demographic characteristics. Overlapping patients were found in 3 of the 6 site-pairs. Following deduplication, the total patient count was 3,108,515 (0.11% reduction), with the largest reduction in count for patients with an "Other/Missing" value for Sex; from 198 to 163 (17.6% reduction). The PPRL solution successfully links patients across data sources using distributed queries without directly accessing patient PII. The overlap queries and analysis performed in this pilot is being replicated across the full network to provide additional insight into patient linkages among a distributed research network.Item How the clinical research community responded to the COVID-19 pandemic: an analysis of the COVID-19 clinical studies in ClinicalTrials.gov(AMIA, 2021-04-01) He, Zhe; Erdengasileng, Arslan; Luo, Xiao; Xing, Aiwen; Charness, Neil; Bian, Jiang; Computer Information and Graphics Technology, School of Engineering and TechnologyIn the past few months, a large number of clinical studies on the novel coronavirus disease (COVID-19) have been initiated worldwide to find effective therapeutics, vaccines, and preventive strategies for COVID-19. In this study, we aim to understand the landscape of COVID-19 clinical research and identify the issues that may cause recruitment difficulty or reduce study generalizability.We analyzed 3765 COVID-19 studies registered in the largest public registry—ClinicalTrials.gov, leveraging natural language processing (NLP) and using descriptive, association, and clustering analyses. We first characterized COVID-19 studies by study features such as phase and tested intervention. We then took a deep dive and analyzed their eligibility criteria to understand whether these studies: (1) considered the reported underlying health conditions that may lead to severe illnesses, and (2) excluded older adults, either explicitly or implicitly, which may reduce the generalizability of these studies to the older adults population.Our analysis included 2295 interventional studies and 1470 observational studies. Most trials did not explicitly exclude older adults with common chronic conditions. However, known risk factors such as diabetes and hypertension were considered by less than 5% of trials based on their trial description. Pregnant women were excluded by 34.9% of the studies.Most COVID-19 clinical studies included both genders and older adults. However, risk factors such as diabetes, hypertension, and pregnancy were under-represented, likely skewing the population that was sampled. A careful examination of existing COVID-19 studies can inform future COVID-19 trial design towards balanced internal validity and generalizability.Item Real-world Effectiveness of BNT162b2 Against Infection and Severe Diseases in Children and Adolescent(medRxiv, 2023-11-13) Wu, Qiong; Tong, Jiayi; Zhang, Bingyu; Zhang, Dazheng; Chen, Jiajie; Lei, Yuqing; Lu, Yiwen; Wang, Yudong; Li, Lu; Shen, Yishan; Xu, Jie; Bailey, L. Charles; Bian, Jiang; Christakis, Dimitri A.; Fitzgerald, Megan L.; Hirabayashi, Kathryn; Jhaveri, Ravi; Khaitan, Alka; Lyu, Tianchen; Rao, Suchitra; Razzaghi, Hanieh; Schwenk, Hayden T.; Wang, Fei; Witvliet, Margot I.; Tchetgen Tchetgen, Eric J.; Morris, Jeffrey S.; Forrest, Christopher B.; Chen, Yong; Pediatrics, School of MedicineBackground: The efficacy of the BNT162b2 vaccine in pediatrics was assessed by randomized trials before the Omicron variant's emergence. The long-term durability of vaccine protection in this population during the Omicron period remains limited. Objective: To assess the effectiveness of BNT162b2 in preventing infection and severe diseases with various strains of the SARS-CoV-2 virus in previously uninfected children and adolescents. Design: Comparative effectiveness research accounting for underreported vaccination in three study cohorts: adolescents (12 to 20 years) during the Delta phase, children (5 to 11 years) and adolescents (12 to 20 years) during the Omicron phase. Setting: A national collaboration of pediatric health systems (PEDSnet). Participants: 77,392 adolescents (45,007 vaccinated) in the Delta phase, 111,539 children (50,398 vaccinated) and 56,080 adolescents (21,180 vaccinated) in the Omicron period. Exposures: First dose of the BNT162b2 vaccine vs. no receipt of COVID-19 vaccine. Measurements: Outcomes of interest include documented infection, COVID-19 illness severity, admission to an intensive care unit (ICU), and cardiac complications. The effectiveness was reported as (1-relative risk)*100% with confounders balanced via propensity score stratification. Results: During the Delta period, the estimated effectiveness of BNT162b2 vaccine was 98.4% (95% CI, 98.1 to 98.7) against documented infection among adolescents, with no significant waning after receipt of the first dose. An analysis of cardiac complications did not find an increased risk after vaccination. During the Omicron period, the effectiveness against documented infection among children was estimated to be 74.3% (95% CI, 72.2 to 76.2). Higher levels of effectiveness were observed against moderate or severe COVID-19 (75.5%, 95% CI, 69.0 to 81.0) and ICU admission with COVID-19 (84.9%, 95% CI, 64.8 to 93.5). Among adolescents, the effectiveness against documented Omicron infection was 85.5% (95% CI, 83.8 to 87.1), with 84.8% (95% CI, 77.3 to 89.9) against moderate or severe COVID-19, and 91.5% (95% CI, 69.5 to 97.6)) against ICU admission with COVID-19. The effectiveness of the BNT162b2 vaccine against the Omicron variant declined after 4 months following the first dose and then stabilized. The analysis revealed a lower risk of cardiac complications in the vaccinated group during the Omicron variant period. Limitations: Observational study design and potentially undocumented infection. Conclusions: Our study suggests that BNT162b2 was effective for various COVID-19-related outcomes in children and adolescents during the Delta and Omicron periods, and there is some evidence of waning effectiveness over time.Item Study protocol for a type III hybrid effectiveness-implementation trial to evaluate scaling interoperable clinical decision support for patient-centered chronic pain management in primary care(Springer Nature, 2022-07-15) Salloum, Ramzi G.; Bilello, Lori; Bian, Jiang; Diiulio, Julie; Gonzalez Paz, Laura; Gurka, Matthew J.; Gutierrez, Maria; Hurley, Robert W.; Jones, Ross E.; Martinez‑Wittinghan, Francisco; Marcial, Laura; Masri, Ghania; McDonnell, Cara; Militello, Laura G.; Modave, François; Nguyen, Khoa; Rhodes, Bryn; Siler, Kendra; Willis, David; Harle, Christopher A.; Biostatistics and Health Data Science, Richard M. Fairbanks School of Public HealthBackground: The US continues to face public health crises related to both chronic pain and opioid overdoses. Thirty percent of Americans suffer from chronic noncancer pain at an estimated yearly cost of over $600 billion. Most patients with chronic pain turn to primary care clinicians who must choose from myriad treatment options based on relative risks and benefits, patient history, available resources, symptoms, and goals. Recently, with attention to opioid-related risks, prescribing has declined. However, clinical experts have countered with concerns that some patients for whom opioid-related benefits outweigh risks may be inappropriately discontinued from opioids. Unfortunately, primary care clinicians lack usable tools to help them partner with their patients in choosing pain treatment options that best balance risks and benefits in the context of patient history, resources, symptoms, and goals. Thus, primary care clinicians and patients would benefit from patient-centered clinical decision support (CDS) for this shared decision-making process. Methods: The objective of this 3-year project is to study the adaptation and implementation of an existing interoperable CDS tool for pain treatment shared decision making, with tailored implementation support, in new clinical settings in the OneFlorida Clinical Research Consortium. Our central hypothesis is that tailored implementation support will increase CDS adoption and shared decision making. We further hypothesize that increases in shared decision making will lead to improved patient outcomes, specifically pain and physical function. The CDS implementation will be guided by the Exploration, Preparation, Implementation, Sustainment (EPIS) framework. The evaluation will be organized by the Reach, Effectiveness, Adoption, Implementation, and Maintenance (RE-AIM) framework. We will adapt and tailor PainManager, an open source interoperable CDS tool, for implementation in primary care clinics affiliated with the OneFlorida Clinical Research Consortium. We will evaluate the effect of tailored implementation support on PainManager's adoption for pain treatment shared decision making. This evaluation will establish the feasibility and obtain preliminary data in preparation for a multi-site pragmatic trial targeting the effectiveness of PainManager and tailored implementation support on shared decision making and patient-reported pain and physical function. Discussion: This research will generate evidence on strategies for implementing interoperable CDS in new clinical settings across different types of electronic health records (EHRs). The study will also inform tailored implementation strategies to be further tested in a subsequent hybrid effectiveness-implementation trial. Together, these efforts will lead to important new technology and evidence that patients, clinicians, and health systems can use to improve care for millions of Americans who suffer from pain and other chronic conditions.Item The OneFlorida Data Trust: a centralized, translational research data infrastructure of statewide scope(Oxford University Press, 2022) Hogan, William R.; Shenkman, Elizabeth A.; Robinson, Temple; Carasquillo, Olveen; Robinson, Patricia S.; Essner, Rebecca Z.; Bian, Jiang; Lipori, Gigi; Harle, Christopher; Magoc, Tanja; Manini, Lizabeth; Mendoza, Tona; White, Sonya; Loiacono, Alex; Hall, Jackie; Nelson, Dave; Biostatistics and Health Data Science, Richard M. Fairbanks School of Public HealthThe OneFlorida Data Trust is a centralized research patient data repository created and managed by the OneFlorida Clinical Research Consortium (“OneFlorida”). It comprises structured electronic health record (EHR), administrative claims, tumor registry, death, and other data on 17.2 million individuals who received healthcare in Florida between January 2012 and the present. Ten healthcare systems in Miami, Orlando, Tampa, Jacksonville, Tallahassee, Gainesville, and rural areas of Florida contribute EHR data, covering the major metropolitan regions in Florida. Deduplication of patients is accomplished via privacy-preserving entity resolution (precision 0.97–0.99, recall 0.75), thereby linking patients’ EHR, claims, and death data. Another unique feature is the establishment of mother-baby relationships via Florida vital statistics data. Research usage has been significant, including major studies launched in the National Patient-Centered Clinical Research Network (“PCORnet”), where OneFlorida is 1 of 9 clinical research networks. The Data Trust’s robust, centralized, statewide data are a valuable and relatively unique research resource.