Skip to main navigation Skip to search Skip to main content

Inclusive ASR for Critical Public Services: Debiasing with Actor-Simulated Speech

Melissa Torgbi, Andrew Clayman, Jordan Speight, Joe Hirst, Harish Tayyar Madabushi

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

Abstract

Recent advances in automatic speech recognition (ASR) have improved the overall performance of speech recognition, yet regional dialects continue to pose significant challenges. This is particularly critical in public service applications such as legal aid and housing support, where bias in ASR systems inadvertently disadvantages vulnerable groups. While fine-tuning existing models using data from the target application is a common approach to addressing bias, the sensitive nature of these services makes this approach infeasible. To overcome this and ensure inclusivity, we collected over 200 h of actor-generated simulated data, aimed at addressing regional dialects in the United Kingdom, where dialects and accents are interlinked with socioeconomic status. Through a set of rigorous experiments, including fine-tuning several models using simulated data, we demonstrate that simulated data not only improves the real-world performance of models but also provides insights into fine-tuning data configurations that are more effective in practice.

Original languageEnglish
Title of host publicationText, Speech, and Dialogue - 28th International Conference, TSD 2025, Proceedings
EditorsKamil Ekštein, Miloslav Konopík, Ondrej Pražák, František Pártl
Place of PublicationCham, Switzerland
PublisherSpringer
Pages331-342
Number of pages12
ISBN (Electronic)9783032025487
ISBN (Print)9783032025470
DOIs
Publication statusPublished - 22 Aug 2025
Event28th International Conference on Text, Speech, and Dialogue, TSD 2025 - Erlangen, Germany
Duration: 25 Aug 202528 Aug 2025

Publication series

NameLecture Notes in Computer Science
Volume16029 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference28th International Conference on Text, Speech, and Dialogue, TSD 2025
Country/TerritoryGermany
CityErlangen
Period25/08/2528/08/25

Keywords

  • debiasing automatic speech recognition
  • inclusive speech technologies
  • simulated data for debiasing

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Inclusive ASR for Critical Public Services: Debiasing with Actor-Simulated Speech'. Together they form a unique fingerprint.

Cite this