Multilingual Cyberbullying Detection System

Pawar, Rohit S.

Multilingual Cyberbullying Detection System

dc.contributor.advisor	Raje, Rajeev R.
dc.contributor.author	Pawar, Rohit S.
dc.contributor.other	Tuceryan, Mihran
dc.contributor.other	Durresi, Arjan
dc.date.accessioned	2019-04-25T14:09:12Z
dc.date.available	2019-04-25T14:09:12Z
dc.date.issued	2019-05
dc.degree.date	2019	en_US
dc.degree.grantor	Purdue University	en_US
dc.degree.level	M.S.	en_US
dc.description	Indiana University-Purdue University Indianapolis (IUPUI)	en_US
dc.description.abstract	Since the use of social media has evolved, the ability of its users to bully others has increased. One of the prevalent forms of bullying is Cyberbullying, which occurs on the social media sites such as Facebook©, WhatsApp©, and Twitter©. The past decade has witnessed a growth in cyberbullying – is a form of bullying that occurs virtually by the use of electronic devices, such as messaging, e-mail, online gaming, social media, or through images or mails sent to a mobile. This bullying is not only limited to English language and occurs in other languages. Hence, it is of the utmost importance to detect cyberbullying in multiple languages. Since current approaches to identify cyberbullying are mostly focused on English language texts, this thesis proposes a new approach (called Multilingual Cyberbullying Detection System) for the detection of cyberbullying in multiple languages (English, Hindi, and Marathi). It uses two techniques, namely, Machine Learning-based and Lexicon-based, to classify the input data as bullying or non-bullying. The aim of this research is to not only detect cyberbullying but also provide a distributed infrastructure to detect bullying. We have developed multiple prototypes (standalone, collaborative, and cloud-based) and carried out experiments with them to detect cyberbullying on different datasets from multiple languages. The outcomes of our experiments show that the machine-learning model outperforms the lexicon-based model in all the languages. In addition, the results of our experiments show that collaboration techniques can help to improve the accuracy of a poor-performing node in the system. Finally, we show that the cloud-based configurations performed better than the local configurations.	en_US
dc.identifier.uri	https://hdl.handle.net/1805/18942
dc.identifier.uri	http://dx.doi.org/10.7912/C2/2364
dc.language.iso	en_US	en_US
dc.rights	Attribution 3.0 United States
dc.rights.uri	https://creativecommons.org/licenses/by/3.0/us
dc.subject	Distributed Computing	en_US
dc.subject	Natural Language Processing	en_US
dc.subject	Machine Learning	en_US
dc.subject	Indian Languages	en_US
dc.subject	Cloud	en_US
dc.title	Multilingual Cyberbullying Detection System	en_US
dc.type	Thesis	en
thesis.degree.discipline	Computer & Information Science	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: MULTILINGUAL CYBERBULLYING DETECTION SYSTEM.pdf
Size:: 1.13 MB
Format:: Adobe Portable Document Format
Description:: Thesis Report

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.99 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer & Information Science Department Theses and Dissertations