Using transfer learning-based causality extraction to mine latent factors for Sjögren’s syndrome from biomedical literature

dc.contributor.authorVanSchaik, Jack T.
dc.contributor.authorJain, Palak
dc.contributor.authorRajapuri, Anushri
dc.contributor.authorCheriyan, Biju
dc.contributor.authorThyvalikakath, Thankam P.
dc.contributor.authorChakraborty, Sunandan
dc.contributor.departmentHuman-Centered Computing, School of Informatics and Computing
dc.date.accessioned2024-02-14T20:25:30Z
dc.date.available2024-02-14T20:25:30Z
dc.date.issued2023-09
dc.description.abstractUnderstanding causality is a longstanding goal across many different domains. Different articles, such as those published in medical journals, disseminate newly discovered knowledge that is often causal. In this paper, we use this intuition to build a model that leverages causal relations to unearth factors related to Sjögren's syndrome from biomedical literature. Sjögren's syndrome is an autoimmune disease affecting up to 3.1 million Americans. Due to the uncommon nature of the illness, symptoms across different specialties coupled with common symptoms of other autoimmune conditions such as rheumatoid arthritis, it is difficult for clinicians to diagnose the disease timely. Due to the lack of a dedicated dataset for causal relationships built from biomedical literature, we propose a transfer learning-based approach, where the relationship extraction model is trained on a wide variety of datasets. We conduct an empirical analysis of numerous neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that an ELECTRA-based sentence-level relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. We use this empirical observation to create a pipeline for identifying causal sentences from literature text, extracting the causal relationships from causal sentences, and building a causal network consisting of latent factors related to Sjögren's syndrome. We show that our approach can retrieve such factors with high precision and recall values. Comparative experiments show that this approach leads to 25% improvement in retrieval F1-score compared to several state-of-the-art biomedical models, including BioBERT and Gram-CNN. We apply this model to a corpus of research articles related to Sjögren's syndrome collected from PubMed to create a causal network for Sjögren's syndrome. The proposed causal network for Sjögren's syndrome will potentially help clinicians with a holistic knowledge base for faster diagnosis.
dc.eprint.versionFinal published version
dc.identifier.citationVanSchaik, J. T., Jain, P., Rajapuri, A., Cheriyan, B., Thyvalikakath, T. P., & Chakraborty, S. (2023). Using transfer learning-based causality extraction to mine latent factors for Sjögren’s syndrome from biomedical literature. Heliyon, 9(9), e19265. https://doi.org/10.1016/j.heliyon.2023.e19265
dc.identifier.urihttps://hdl.handle.net/1805/38529
dc.language.isoen_US
dc.publisherCell Press
dc.relation.isversionof10.1016/j.heliyon.2023.e19265
dc.relation.journalHeliyon
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0
dc.sourcePublisher
dc.subjectText mining
dc.subjectCausal relationships
dc.subjectRelationship extraction
dc.subjectSjögren's syndrome
dc.titleUsing transfer learning-based causality extraction to mine latent factors for Sjögren’s syndrome from biomedical literature
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
VanSchaik2023Using-CCBYNCND.pdf
Size:
1.51 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: