Multimodal Hate Speech Detection in Greek Social Media release_guiihz4zlna5bcawgckgnvsara

by Konstantinos Perifanos, Dionysis Goutsos

Published in Multimodal Technologies and Interaction by MDPI AG.

2021   p34

Abstract

Hateful and abusive speech presents a major challenge for all online social media platforms. Recent advances in Natural Language Processing and Natural Language Understanding allow for more accurate detection of hate speech in textual streams. This study presents a new multimodal approach to hate speech detection by combining Computer Vision and Natural Language processing models for abusive context detection. Our study focuses on Twitter messages and, more specifically, on hateful, xenophobic, and racist speech in Greek aimed at refugees and migrants. In our approach, we combine transfer learning and fine-tuning of Bidirectional Encoder Representations from Transformers (BERT) and Residual Neural Networks (Resnet). Our contribution includes the development of a new dataset for hate speech classification, consisting of tweet IDs, along with the code to obtain their visual appearance, as they would have been rendered in a web browser. We have also released a pre-trained Language Model trained on Greek tweets, which has been used in our experiments. We report a consistently high level of accuracy (accuracy score = 0.970, f1-score = 0.947 in our best model) in racist and xenophobic speech detection.
In application/xml+jats format

Archived Files and Locations

application/pdf  2.8 MB
file_xsr5lpp5vrh6bfxivmng4dfbke
res.mdpi.com (web)
web.archive.org (webarchive)
application/pdf  2.8 MB
file_opwiry3vajc4rjx2nlpjzfrfua
mdpi-res.com (web)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   published
Date   2021-06-29
Language   en ?
Container Metadata
Open Access Publication
In DOAJ
In ISSN ROAD
In Keepers Registry
ISSN-L:  2414-4088
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: d87d7201-b5f4-4751-9e99-ae5f172797ab
API URL: JSON