A MULTI-PLATFORM APPROACH USING HYBRID DEEP LEARNING MODELS FOR AUTOMATIC DETECTION OF HATE SPEECH ON SOCIAL MEDIAHate speech on online social networks is a general problem across social media platforms that has the potential of causing physical harm to t

Authors

  • YELLAMADA SIMON Department of Computer Science, Federal Polytechnic Mubi, Adamawa, Nigeria
  • BENSON YUSUF BAHA Department of Information Technology, Modibbo Adama University Yola, Adamawa, Nigeria
  • ETEMI JOSHUA GARBA Department of Computer Science, Modibbo Adama University Yola, Adamawa, Nigeria

DOI:

https://doi.org/10.56892/bima.v6i02.363

Keywords:

Hate Speech, Deep Learning, Classification, Word Embedding, Social Media,

Abstract

Hate speech on online social networks is a general problem across social media platforms that
has the potential of causing physical harm to the society. The growing number of hateful
comments on the Internet and the rate at which tweets and posts are published per second on
social media make it a challenging task to manually identify and remove the hateful comments
from such posts. Although numerous publications have proposed machine learning approaches to
detect hate speech and other antisocial online behaviours without concentrating on blocking the
hate speech from being published on social media. Similarly, prior publications on deep learning
and multi-platform approaches did not work on the topic of detecting hate speech in English
language comments on Twitter and Facebook. This paper proposed a deep learning approach
based on a hybrid of convolutional neural network (CNN) and long short-term memory (LSTM)
with pre-trained GloVe words embedding to automatically detect and block hate speech on
multiple social media platforms including Twitter and Facebook. Thus, datasets were collected
from Twitter and Facebook which were annotated as hateful and non-hateful. A set of features
were extracted from the datasets based on word embedding mechanism, and the word
embeddings were fed into our deep learning framework. The experiment was carried out as a
three independent tasks approach. The results show that our hybrid CNN-LSTM approach in
Task 1 achieved an f1-score of 0.91, Task 2 obtained an f1-score of 0.92, and Task 3 achieved an
f1-score of 0.87. Thus, there is outstanding performance in classifying text as Hate speech or
non-hate speech in all the considered metrics. Based on the findings, we conclude that hate
speech can be detected and blocked on social media before it can reach the public.

Downloads

Published

2022-08-30

How to Cite

SIMON, Y. ., YUSUF BAHA, B. ., & JOSHUA GARBA, E. (2022). A MULTI-PLATFORM APPROACH USING HYBRID DEEP LEARNING MODELS FOR AUTOMATIC DETECTION OF HATE SPEECH ON SOCIAL MEDIAHate speech on online social networks is a general problem across social media platforms that has the potential of causing physical harm to t. BIMA JOURNAL OF SCIENCE AND TECHNOLOGY (2536-6041), 6(02), 77-90. https://doi.org/10.56892/bima.v6i02.363