Text mining of online book reviews for non-trivial clustering of books and users

dc.contributor.advisorFang, Shiaofen
dc.contributor.authorLin, Eric
dc.contributor.otherMukhopadhyay, Snehasis
dc.contributor.otherDu, Yingzi, 1975-
dc.date.accessioned2013-08-14T16:06:56Z
dc.date.available2013-08-14T16:06:56Z
dc.date.issued2013-08-14
dc.degree.date2012en_US
dc.degree.disciplineDepartment of Computer and Information Scienceen_US
dc.degree.grantorPurdue Universityen_US
dc.degree.levelM.S.en_US
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractThe classification of consumable media by mining relevant text for their identifying features is a subjective process. Previous attempts to perform this type of feature mining have generally been limited in scope due having limited access to user data. Many of these studies used human domain knowledge to evaluate the accuracy of features extracted using these methods. In this thesis, we mine book review text to identify nontrivial features of a set of similar books. We make comparisons between books by looking for books that share characteristics, ultimately performing clustering on the books in our data set. We use the same mining process to identify a corresponding set of characteristics in users. Finally, we evaluate the quality of our methods by examining the correlation between our similarity metric, and user ratings.en_US
dc.identifier.urihttps://hdl.handle.net/1805/3421
dc.identifier.urihttp://dx.doi.org/10.7912/C2/2301
dc.language.isoen_USen_US
dc.subjectminingen_US
dc.subjectdataen_US
dc.subjectanalysisen_US
dc.subjectrecommendationen_US
dc.subjectsentimenten_US
dc.subject.lcshEnd-user computingen_US
dc.subject.lcshWeb usage miningen_US
dc.subject.lcshKnowledge managementen_US
dc.subject.lcshInformation behavior -- Researchen_US
dc.subject.lcshCluster analysis -- Data processingen_US
dc.subject.lcshSystem analysis -- Data processingen_US
dc.subject.lcshInformation retrieval -- Book reviewsen_US
dc.titleText mining of online book reviews for non-trivial clustering of books and usersen_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Elin thesis (FINAL with forms).pdf
Size:
1.18 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: