LEVERAGING A BILSTM-BASED EMOTION  RECOGNITION TRANSFER LEARNING MODEL TO  IDENTIFY ABUSIVE LANGUAGE PATTERNS FOR  COMPLEX PHRASAL ANALYSIS IN CYBERBULLYING  DETECTION

Matta, Marita

AUB ScholarWorks Home
→
Students Publications
→
AUB Students' Theses, Dissertations, and Projects
→
View Item

dc.contributor.advisor	Azad, Bijan
dc.contributor.advisor	Zablith, Fouad
dc.contributor.author	Matta, Marita
dc.date.accessioned	2024-02-07T13:59:26Z
dc.date.available	2024-02-07T13:59:26Z
dc.date.issued	2024-02-07
dc.date.submitted	2024-02-07
dc.identifier.uri	http://hdl.handle.net/10938/24328
dc.description.abstract	In the last two decades, the penetration of Social Networking Sites and Social Media (SNS/SM) platforms has risen to include more than one-third of the global population. However, the use of SNS/SM has produced both positive and negative results so much so that there have been calls to researchers pay immediate and far greater attention to these contradictory effects of SNS/SM capabilities. A key negative impact is the fast and significant rise of cyberbullying. Cyberbullying has emerged as a serious act on social networking sites/social media (SNS/SM) platforms in today’s digital society. Statistics underscore this whereby 42% of individuals indicate they have experienced cyberbullying, more specifically 38% of females and 54% of males have also experienced some form of bullying. This form of dysfunctional social act is expressed via aggression, harassment, and toxic behavior poses severe consequences to increasing penetration of SNS/SM. Simultaneously, there has been a great awareness of the need for moderating contents on (SNS/SM) to detect and reduce cyberbullying contents. It is also recognized that human content moderation on SNS/SM is impractical and too costly. Therefore, there is an increasing need for accurate methods of content moderation that are less reliant on human judgement and instead employ sophisticated machine learning methods. A key shortcoming of the current machine learning approaches to cyberbullying detection is that their accuracy needs to be improved significantly to be relied upon for practical deployment of non-human content moderation on SNS/SM. Problem of using advanced data analytics and AI-based techniques in cyberbullying detection in the service of reducing the latter has been extensively studied. However, existing research method have faced challenges with the issue of false positives whereby many identified instances may not be cyberbullying. For instance, “I hate you” and “I hate thinking about the future” both contain the word “hate”, yet the second sentence is a non-cyberbully phrase/sentence that contains the word hate as a metaphor for personal discomfort with the uncertainty associated with one’s future. Indeed, more accurate detection can come from a deeper contextually sensitive detection methods going beyond identifying simple hate words, whereby distinguishing 3 between harmful and innocuous expressions becomes the focal research problem. To address this, we will conduct a complex phrasal analysis to identify cyberbullying through the detection of the underlying emotion behind the seemingly toxic phrases. This approach aims to mitigate false positive predictions, which can occur when relying on hate keywords or an online hate corpus. To achieve our goal, we employ an emotion recognition transfer learning model to comprehend the underlying emotion trigger of cyberbullying and enhance its detection. To accomplish this, we use a Bi-LSTM pre-trained emotion detection model with a 92% accuracy on the training set. Then, we adapt the knowledge gained from the first model to improve learning on a different but related task which is improving cyberbullying detection by integrating elements of context based on emotion identification within the expressed phrase. Based on extensive testing and training of the model on the data, we propose that LSTM and CNN multi-label-based classification model which is exhibiting an 80% accuracy on the training as superior approach to cyberbullying detection. To evaluate and validate our results, we randomly split our dataset into training and validation sets, testing against an unseen dataset. A comparative analysis with (Gencoglu, 2021) model reveals a significant improvement in performance using McNemar’s test and the Paired T-test. Notably, our model shows an 18% improvement on (Gencoglu, 2021) model. In summary, this research has addressed the limitations inherent in keyword-based cyberbullying detection methodologies, thereby making a meaningful contribution to the field of cyberbullying detection in SNS/SM research.
dc.language.iso	en_US
dc.subject	social networking sites
dc.subject	social media
dc.subject	machine learning
dc.subject	complex phrasal analysis
dc.subject	cyberbullying detection
dc.title	LEVERAGING A BILSTM-BASED EMOTION RECOGNITION TRANSFER LEARNING MODEL TO IDENTIFY ABUSIVE LANGUAGE PATTERNS FOR COMPLEX PHRASAL ANALYSIS IN CYBERBULLYING DETECTION
dc.type	Thesis
dc.contributor.department	Suliman S. Olayan School of Business
dc.contributor.faculty	Suliman S. Olayan School of Business
dc.contributor.commembers	Taleb, Sirine
dc.contributor.commembers	Nasr, Walid
dc.contributor.degree	MSBA
dc.contributor.AUBidnumber	202370289

Files in this item

Name: Marita Matta thesis ...

Size: 2.261Mb

Format: PDF

View/Open

This item appears in the following Collection(s)

AUB Students' Theses, Dissertations, and Projects [12719]

Show simple item record

Search AUB ScholarWorks

Browse

All of AUB ScholarWorks
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

My Account

Copyright Statement

All materials included in the institutional repository are protected by copyright laws and are the property of their respective copyright holders. Materials may be used for non-commercial, educational, or research purposes only, and must be cited or attributed to the original source. Permission for any other use must be obtained from the copyright holder(s) directly. The American University of Beirut Libraries does not assume responsibility for any infringement of copyright laws that may occur as a result of the use of materials in the repository. If you believe that your copyright has been infringed upon in the repository, please contact the AUB Libraries immediately.

For further information, please contact us at scholarworks@aub.edu.lb

LEVERAGING A BILSTM-BASED EMOTION RECOGNITION TRANSFER LEARNING MODEL TO IDENTIFY ABUSIVE LANGUAGE PATTERNS FOR COMPLEX PHRASAL ANALYSIS IN CYBERBULLYING DETECTION