AI-Assisted and Explainable Hate Speech Detection for Social Media Moderators – A Design Science Approach

Bunde, Enrico

AI-Assisted and Explainable Hate Speech Detection for Social Media Moderators – A Design Science Approach

Files

0125.pdf (1.04 MB)

Date

2021-01-05

Authors

Bunde, Enrico

Starting Page

1264

Abstract

To date, the detection of hate speech is still primarily carried out by humans, yet there is great potential for combining human expertise with automated approaches. However, identified challenges include low levels of agreement between humans and machines due to the algorithms’ missing expertise of, e.g., cultural, and social structures. In this work, a design science approach is used to derive design knowledge and develop an artifact, through which humans are integrated in the process of detecting and evaluating hate speech. For this purpose, explainable artificial intelligence (XAI) is utilized: the artifact will provide explanative information, why the deep learning model predicted whether a text contains hate. Results show that the instantiated design knowledge in form of a dashboard is perceived as valuable and that XAI features increase the perception of the artifact’s usefulness, ease of use, trustworthiness as well as the intention to use it.

Keywords

Explainable Artificial Intelligence (XAI), deep learning, design science, explainable artificial intelligence, hate speech detection

URI

http://hdl.handle.net/10125/70766

Extent

10 pages

Related To

Proceedings of the 54th Hawaii International Conference on System Sciences

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

Collections

Explainable Artificial Intelligence (XAI)

Full item page

Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

AI-Assisted and Explainable Hate Speech Detection for Social Media Moderators – A Design Science Approach

Files

Date

Authors

Contributor

Advisor

Department

Instructor

Depositor

Speaker

Researcher

Consultant

Interviewer

Narrator

Transcriber

Annotator

Journal Title

Journal ISSN

Volume Title

Publisher

Volume

Number/Issue

Starting Page

Ending Page

Alternative Title

Abstract

Description

Keywords

Citation

URI

Extent

Format

Geographic Location

Time Period

Related To

Related To (URI)

Table of Contents

Rights

Rights Holder

Local Contexts

Collections