A BERT-based Hate Speech Classifier from Transcribed Online Short-Form Videos

Rommel Hernandez Urbano, Jeffrey Uy Ajero, Angelic Legaspi Angeles, Maria Nikki Hacar Quintos, Joseph Marvin Regalado Imperial, Ramon Llabanes Rodriguez

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

4 Citations (SciVal)

Abstract

With the rise of human-centric technologies such as social media platforms, the amount of hate also continues to grow proportionally with the increasing number of users worldwide. TikTok is one of the most-used social media platforms due to its feature that allows users to express themselves via creating and sharing short-form videos based on any desired topic and content. In addition, it has also become a platform for political discourse and mudslinging as users can freely express an opinion and indirectly debate with random people online. In this study, we propose the use of BERT, a complex bidirectional transformer-based model, for the task of automatic hate speech detection from speech transcribed from Tagalog TikTok videos. Results of our experiments show that a BERT-based hate speech classifier scores 61% F1. We also extended the task beyond several algorithms such as LSTM, Naïve Bayes, and Decision Tree and found out that traditional methods such as a simple Bernoulli Naïve Bayes approach remain at par with the BERT model.

Original languageEnglish
Title of host publicationICSET 2021 - 2021 5th International Conference on E-Society, E-Education and E-Technology
Place of PublicationU. S. A.
PublisherAssociation for Computing Machinery
Pages186-192
Number of pages7
ISBN (Electronic)9781450390156
DOIs
Publication statusPublished - 21 Aug 2021
Event5th International Conference on E-Society, E-Education and E-Technology, ICSET 2021 - Virtual, Online, Taiwan
Duration: 21 Aug 202123 Aug 2021

Publication series

NameACM International Conference Proceeding Series

Conference

Conference5th International Conference on E-Society, E-Education and E-Technology, ICSET 2021
Country/TerritoryTaiwan
CityVirtual, Online
Period21/08/2123/08/21

Keywords

  • Bidirectional Encoder Representations from Transformers (BERT)
  • Filipino Language
  • Hate Speech
  • TikTok

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'A BERT-based Hate Speech Classifier from Transcribed Online Short-Form Videos'. Together they form a unique fingerprint.

Cite this