Natural Language Processing of Stories

dc.contributor.advisorMukhopadhyay, Snehasis
dc.contributor.authorRittichier, Kaley J.
dc.contributor.otherDurresi, Arjan
dc.contributor.otherMohler, George
dc.date.accessioned2022-05-27T14:11:12Z
dc.date.available2022-05-27T14:11:12Z
dc.date.issued2022-05
dc.degree.date2022en_US
dc.degree.disciplineComputer & Information Science
dc.degree.grantorPurdue Universityen_US
dc.degree.levelM.S.en_US
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractIn this thesis, I deal with the task of computationally processing stories with a focus on multidisciplinary ends, specifically in Digital Humanities and Cultural Analytics. In the process, I collect, clean, investigate, and predict from two datasets. The first is a dataset of 2,302 open-source literary works categorized by the time period they are set in. These works were all collected from Project Gutenberg. The classification of the time period in which the work is set was discovered by collecting and inspecting Library of Congress subject classifications, Wikipedia Categories, and literary factsheets from SparkNotes. The second is a dataset of 6,991 open-source literary works categorized by the hierarchical location the work is set in; these labels were constructed from Library of Congress subject classifications and SparkNotes factsheets. These datasets are the first of their kind and can help move forward an understanding of 1) the presentation of settings in stories and 2) the effect the settings have on our understanding of the stories.en_US
dc.identifier.urihttps://hdl.handle.net/1805/29175
dc.identifier.urihttp://dx.doi.org/10.7912/C2/2925
dc.language.isoen_USen_US
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectNatural Language Processingen_US
dc.subjectStoriesen_US
dc.subjectStory Settingen_US
dc.subjectDigital Humanitiesen_US
dc.subjectCultural Analyticsen_US
dc.titleNatural Language Processing of Storiesen_US
dc.typeThesisen
thesis.degree.disciplineComputer & Information Scienceen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Rittichier_Thesis_Final.pdf
Size:
348.52 KB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: