Using transfer learning-based causality extraction to mine latent factors for Sjögren’s syndrome from biomedical literature

VanSchaik, Jack T.; Jain, Palak; Rajapuri, Anushri; Cheriyan, Biju; Thyvalikakath, Thankam P.; Chakraborty, Sunandan

Using transfer learning-based causality extraction to mine latent factors for Sjögren’s syndrome from biomedical literature

dc.contributor.author	VanSchaik, Jack T.
dc.contributor.author	Jain, Palak
dc.contributor.author	Rajapuri, Anushri
dc.contributor.author	Cheriyan, Biju
dc.contributor.author	Thyvalikakath, Thankam P.
dc.contributor.author	Chakraborty, Sunandan
dc.contributor.department	Human-Centered Computing, School of Informatics and Computing
dc.date.accessioned	2024-02-14T20:25:30Z
dc.date.available	2024-02-14T20:25:30Z
dc.date.issued	2023-09
dc.description.abstract	Understanding causality is a longstanding goal across many different domains. Different articles, such as those published in medical journals, disseminate newly discovered knowledge that is often causal. In this paper, we use this intuition to build a model that leverages causal relations to unearth factors related to Sjögren's syndrome from biomedical literature. Sjögren's syndrome is an autoimmune disease affecting up to 3.1 million Americans. Due to the uncommon nature of the illness, symptoms across different specialties coupled with common symptoms of other autoimmune conditions such as rheumatoid arthritis, it is difficult for clinicians to diagnose the disease timely. Due to the lack of a dedicated dataset for causal relationships built from biomedical literature, we propose a transfer learning-based approach, where the relationship extraction model is trained on a wide variety of datasets. We conduct an empirical analysis of numerous neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that an ELECTRA-based sentence-level relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. We use this empirical observation to create a pipeline for identifying causal sentences from literature text, extracting the causal relationships from causal sentences, and building a causal network consisting of latent factors related to Sjögren's syndrome. We show that our approach can retrieve such factors with high precision and recall values. Comparative experiments show that this approach leads to 25% improvement in retrieval F1-score compared to several state-of-the-art biomedical models, including BioBERT and Gram-CNN. We apply this model to a corpus of research articles related to Sjögren's syndrome collected from PubMed to create a causal network for Sjögren's syndrome. The proposed causal network for Sjögren's syndrome will potentially help clinicians with a holistic knowledge base for faster diagnosis.
dc.eprint.version	Final published version
dc.identifier.citation	VanSchaik, J. T., Jain, P., Rajapuri, A., Cheriyan, B., Thyvalikakath, T. P., & Chakraborty, S. (2023). Using transfer learning-based causality extraction to mine latent factors for Sjögren’s syndrome from biomedical literature. Heliyon, 9(9), e19265. https://doi.org/10.1016/j.heliyon.2023.e19265
dc.identifier.uri	https://hdl.handle.net/1805/38529
dc.language.iso	en_US
dc.publisher	Cell Press
dc.relation.isversionof	10.1016/j.heliyon.2023.e19265
dc.relation.journal	Heliyon
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International	en
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0
dc.source	Publisher
dc.subject	Text mining
dc.subject	Causal relationships
dc.subject	Relationship extraction
dc.subject	Sjögren's syndrome
dc.title	Using transfer learning-based causality extraction to mine latent factors for Sjögren’s syndrome from biomedical literature
dc.type	Article

Files

Original bundle

Now showing 1 - 1 of 1

Name:: VanSchaik2023Using-CCBYNCND.pdf
Size:: 1.51 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.99 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Open Access Policy Articles
Human-Centered Computing Works
Open Access Publishing Fund