Novel Approach to Cluster Patient-Generated Data Into Actionable Topics: Case Study of a Web-Based Breast Cancer Forum

dc.contributor.authorJones, Josette
dc.contributor.authorPradhan, Meeta
dc.contributor.authorHosseini, Masoud
dc.contributor.authorKulanthaivel, Anand
dc.contributor.authorHosseini, Mahmood
dc.contributor.departmentBiohealth Informatics, School of Informatics and Computingen_US
dc.date.accessioned2019-01-09T18:48:15Z
dc.date.available2019-01-09T18:48:15Z
dc.date.issued2018
dc.description.abstractBackground: The increasing use of social media and mHealth apps has generated new opportunities for health care consumers to share information about their health and well-being. Information shared through social media contains not only medical information but also valuable information about how the survivors manage disease and recovery in the context of daily life. Objective: The objective of this study was to determine the feasibility of acquiring and modeling the topics of a major online breast cancer support forum. Breast cancer patient support forums were selected to discover the hidden, less obvious aspects of disease management and recovery. Methods: First, manual topic categorization was performed using qualitative content analysis (QCA) of each individual forum board. Second, we requested permission from the Breastcancer.org Community for a more in-depth analysis of the postings. Topic modeling was then performed using open source software Machine Learning Language Toolkit, followed by multiple linear regression (MLR) analysis to detect highly correlated topics among the different website forums. Results: QCA of the forums resulted in 20 categories of user discussion. The final topic model organized >4 million postings into 30 manageable topics. Using qualitative analysis of the topic models and statistical analysis, we grouped these 30 topics into 4 distinct clusters with similarity scores of ≥0.80; these clusters were labeled Symptoms & Diagnosis, Treatment, Financial, and Family & Friends. A clinician review confirmed the clinical significance of the topic clusters, allowing for future detection of actionable items within social media postings. To identify the most significant topics across individual forums, MLR demonstrated that 6 topics—based on the Akaike information criterion values ranging from −642.75 to −412.32—were statistically significant. Conclusions: The developed method provides an insight into the areas of interest and concern, including those not ascertainable in the clinic. Such topics included support from lay and professional caregivers and late side effects of therapy that consumers discuss in social media and may be of interest to clinicians. The developed methods and results indicate the potential of social media to inform the clinical workflow with regards to the impact of recovery on daily life. [JMIR Med Inform 2018;6(4):e45]en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationJones, J., Pradhan, M., Hosseini, M., Kulanthaivel, A., & Hosseini, M. (2018). Novel Approach to Cluster Patient-Generated Data Into Actionable Topics: Case Study of a Web-Based Breast Cancer Forum. JMIR Medical Informatics, 6(4), e45. https://doi.org/10.2196/medinform.9162en_US
dc.identifier.urihttps://hdl.handle.net/1805/18117
dc.language.isoen_USen_US
dc.publisherJMIRen_US
dc.relation.isversionof10.2196/medinform.9162en_US
dc.relation.journalJMIR Medical Informaticsen_US
dc.rightsAttribution 3.0 United States
dc.rights.urihttps://creativecommons.org/licenses/by/3.0/us
dc.sourcePublisheren_US
dc.subjectData interpretationen_US
dc.subjectInfodemiologyen_US
dc.subjectNatural language processingen_US
dc.subjectPatient-generated informationen_US
dc.subjectSocial mediaen_US
dc.subjectStatistical analysisen_US
dc.titleNovel Approach to Cluster Patient-Generated Data Into Actionable Topics: Case Study of a Web-Based Breast Cancer Forumen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
250eed30-89de-41c9-973f-32dee40708ac.pdf
Size:
1.29 MB
Format:
Adobe Portable Document Format
Description:
Article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: