Text Mining for Social Harm and Criminal Justice Applications

dc.contributor.advisorMohler, George
dc.contributor.authorPandey, Ritika
dc.contributor.otherHasan, Mohammad Al
dc.contributor.otherMukhopadhyay, Snehasis
dc.date.accessioned2020-07-23T10:52:09Z
dc.date.available2020-07-23T10:52:09Z
dc.date.issued2020-08
dc.degree.date2020en_US
dc.degree.grantorPurdue Universityen_US
dc.degree.levelM.S.en_US
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractIncreasing rates of social harm events and plethora of text data demands the need of employing text mining techniques not only to better understand their causes but also to develop optimal prevention strategies. In this work, we study three social harm issues: crime topic models, transitions into drug addiction and homicide investigation chronologies. Topic modeling for the categorization and analysis of crime report text allows for more nuanced categories of crime compared to official UCR categorizations. This study has important implications in hotspot policing. We investigate the extent to which topic models that improve coherence lead to higher levels of crime concentration. We further explore the transitions into drug addiction using Reddit data. We proposed a prediction model to classify the users’ transition from casual drug discussion forum to recovery drug discussion forum and the likelihood of such transitions. Through this study we offer insights into modern drug culture and provide tools with potential applications in combating opioid crises. Lastly, we present a knowledge graph based framework for homicide investigation chronologies that may aid investigators in analyzing homicide case data and also allow for post hoc analysis of key features that determine whether a homicide is ultimately solved. For this purpose we perform named entity recognition to determine witnesses, detectives and suspects from chronology, use keyword expansion to identify various evidence types and finally link these entities and evidence to construct a homicide investigation knowledge graph. We compare the performance over several choice of methodologies for these sub-tasks and analyze the association between network statistics of knowledge graph and homicide solvability.en_US
dc.identifier.urihttps://hdl.handle.net/1805/23348
dc.identifier.urihttp://dx.doi.org/10.7912/C2/2375
dc.language.isoen_USen_US
dc.subjectText miningen_US
dc.subjectMachine learningen_US
dc.subjectData miningen_US
dc.subjectComputer Scienceen_US
dc.subjectSocial Harmen_US
dc.subjectCriminal Justiceen_US
dc.subjectInformation retrievalen_US
dc.subjectNatural Language Processingen_US
dc.subjectCrime topic modelingen_US
dc.subjectaddiction analysisen_US
dc.subjecthomicide investigation analysisen_US
dc.titleText Mining for Social Harm and Criminal Justice Applicationsen_US
dc.typeThesisen
thesis.degree.disciplineComputer & Information Scienceen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Text_Mining_for_social_harm_and_criminal_justice_applications.pdf
Size:
1.26 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: