Point process modeling of drug overdoses with heterogeneous and missing data
dc.contributor.author | Liu, Xueying | |
dc.contributor.author | Carter, Jeremy | |
dc.contributor.author | Ray, Brad | |
dc.contributor.author | Mohler, George | |
dc.contributor.department | Computer and Information Science, School of Science | |
dc.date.accessioned | 2024-03-20T07:52:54Z | |
dc.date.available | 2024-03-20T07:52:54Z | |
dc.date.issued | 2021 | |
dc.description.abstract | Opioid overdose rates have increased in the United States over the past decade and reflect a major public health crisis. Modeling and prediction of drug and opioid hotspots, where a high percentage of events fall in a small percentage of space–time, could help better focus limited social and health services. In this work we present a spatial-temporal point process model for drug overdose clustering. The data input into the model comes from two heterogeneous sources: (1) high volume emergency medical calls for service (EMS) records containing location and time but no information on the type of nonfatal overdose, and (2) fatal overdose toxicology reports from the coroner containing location and high-dimensional information from the toxicology screen on the drugs present at the time of death. We first use nonnegative matrix factorization to cluster toxicology reports into drug overdose categories, and we then develop an EM algorithm for integrating the two heterogeneous data sets, where the mark corresponding to overdose category is inferred for the EMS data and the high volume EMS data is used to more accurately predict drug overdose death hotspots. We apply the algorithm to drug overdose data from Indianapolis, showing that the point process defined on the integrated data out-performs point processes that use only coroner data (AUC improvement 0.81 to 0.85). We also investigate the extent to which overdoses are contagious, as a function of the type of overdose, while controlling for exogenous fluctuations in the background rate that might also contribute to clustering. We find that drug and opioid overdose deaths exhibit significant excitation with branching ratio ranging from 0.72 to 0.98. | |
dc.eprint.version | Author's manuscript | |
dc.identifier.citation | Liu X, Carter J, Ray B, Mohler G. Point process modeling of drug overdoses with heterogeneous and missing data. The Annals of Applied Statistics. 2021;15(1):88-101. doi:10.1214/20-AOAS1384 | |
dc.identifier.uri | https://hdl.handle.net/1805/39350 | |
dc.language.iso | en_US | |
dc.publisher | Institute of Mathematical Statistics | |
dc.relation.isversionof | 10.1214/20-AOAS1384 | |
dc.relation.journal | The Annals of Applied Statistics | |
dc.rights | Publisher Policy | |
dc.source | ArXiv | |
dc.subject | Point process | |
dc.subject | Expectation maximization algorithm | |
dc.subject | Semi-supervised learning | |
dc.subject | Nonnegative matrix factorization | |
dc.subject | Opioid overdose | |
dc.title | Point process modeling of drug overdoses with heterogeneous and missing data | |
dc.type | Article |