Multilingual Cyberbullying Detection System

dc.contributor.advisorRaje, Rajeev R.
dc.contributor.authorPawar, Rohit S.
dc.contributor.otherTuceryan, Mihran
dc.contributor.otherDurresi, Arjan
dc.date.accessioned2019-04-25T14:09:12Z
dc.date.available2019-04-25T14:09:12Z
dc.date.issued2019-05
dc.degree.date2019en_US
dc.degree.grantorPurdue Universityen_US
dc.degree.levelM.S.en_US
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractSince the use of social media has evolved, the ability of its users to bully others has increased. One of the prevalent forms of bullying is Cyberbullying, which occurs on the social media sites such as Facebook©, WhatsApp©, and Twitter©. The past decade has witnessed a growth in cyberbullying – is a form of bullying that occurs virtually by the use of electronic devices, such as messaging, e-mail, online gaming, social media, or through images or mails sent to a mobile. This bullying is not only limited to English language and occurs in other languages. Hence, it is of the utmost importance to detect cyberbullying in multiple languages. Since current approaches to identify cyberbullying are mostly focused on English language texts, this thesis proposes a new approach (called Multilingual Cyberbullying Detection System) for the detection of cyberbullying in multiple languages (English, Hindi, and Marathi). It uses two techniques, namely, Machine Learning-based and Lexicon-based, to classify the input data as bullying or non-bullying. The aim of this research is to not only detect cyberbullying but also provide a distributed infrastructure to detect bullying. We have developed multiple prototypes (standalone, collaborative, and cloud-based) and carried out experiments with them to detect cyberbullying on different datasets from multiple languages. The outcomes of our experiments show that the machine-learning model outperforms the lexicon-based model in all the languages. In addition, the results of our experiments show that collaboration techniques can help to improve the accuracy of a poor-performing node in the system. Finally, we show that the cloud-based configurations performed better than the local configurations.en_US
dc.identifier.urihttps://hdl.handle.net/1805/18942
dc.identifier.urihttp://dx.doi.org/10.7912/C2/2364
dc.language.isoen_USen_US
dc.rightsAttribution 3.0 United States
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/us/
dc.subjectDistributed Computingen_US
dc.subjectNatural Language Processingen_US
dc.subjectMachine Learningen_US
dc.subjectIndian Languagesen_US
dc.subjectClouden_US
dc.titleMultilingual Cyberbullying Detection Systemen_US
dc.typeThesisen
thesis.degree.disciplineComputer & Information Scienceen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MULTILINGUAL CYBERBULLYING DETECTION SYSTEM.pdf
Size:
1.13 MB
Format:
Adobe Portable Document Format
Description:
Thesis Report
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: