Raghuveer Peri, Krishna Somandepalli, and Shrikanth Narayanan. A study of bias mitigation strategies for speaker recognition. Computer Speech & Language, 79:101481, 2023.

Download

[PDF] 

Abstract

Speaker recognition is increasingly used in several everyday applications including smart speakers, customer care centers and other speech-driven analytics. It is crucial to accurately evaluate and mitigate biases present in machine learning (ML) based speech technologies, such as speaker recognition, to ensure their inclusive adoption. ML fairness studies with respect to various demographic factors in modern speaker recognition systems are lagging compared to other human-centered applications such as face recognition. Existing studies on fairness in speaker recognition systems are largely limited to evaluating biases at specific operating points of the systems, which can lead to false expectations of fairness. Moreover, there are only a handful of bias mitigation strategies developed for speaker recognition systems. In this paper, we systematically evaluate the biases present in speaker recognition systems with respect to gender across a range of system operating points. We also propose adversarial and multi-task learning techniques to improve the fairness of these systems. We show through quantitative and qualitative evaluations that the proposed methods improve the fairness of ASV systems over baseline methods trained using data balancing techniques. We also present a fairness-utility trade-off analysis to jointly examine fairness and the overall system performance. We show that although systems trained using adversarial techniques improve fairness, they are prone to reduced utility. On the other hand, multi-task methods can improve the fairness while retaining the utility. These findings can inform the choice of bias mitigation strategies in the field of speaker recognition.

BibTeX Entry

@article{PERI2023101481,
title = {A study of bias mitigation strategies for speaker recognition},
journal = {Computer Speech & Language},
volume = {79},
pages = {101481},
year = {2023},
issn = {0885-2308},
doi = {https://doi.org/10.1016/j.csl.2022.101481},
url = {https://www.sciencedirect.com/science/article/pii/S0885230822001048},
  bib2html_rescat = {mica},
     link = {http://sail.usc.edu/publications/files/Peri-CSL2023.pdf},
author = {Raghuveer Peri and Krishna Somandepalli and Shrikanth Narayanan},
keywords = {Fairness, Bias mitigation, Fairness-utility trade-off, Speaker verification, Speaker recognition, Adversarial training, Multi task learning},
abstract = {Speaker recognition is increasingly used in several everyday applications including smart speakers, customer care centers and other speech-driven analytics. It is crucial to accurately evaluate and mitigate biases present in machine learning (ML) based speech technologies, such as speaker recognition, to ensure their inclusive adoption. ML fairness studies with respect to various demographic factors in modern speaker recognition systems are lagging compared to other human-centered applications such as face recognition. Existing studies on fairness in speaker recognition systems are largely limited to evaluating biases at specific operating points of the systems, which can lead to false expectations of fairness. Moreover, there are only a handful of bias mitigation strategies developed for speaker recognition systems. In this paper, we systematically evaluate the biases present in speaker recognition systems with respect to gender across a range of system operating points. We also propose adversarial and multi-task learning techniques to improve the fairness of these systems. We show through quantitative and qualitative evaluations that the proposed methods improve the fairness of ASV systems over baseline methods trained using data balancing techniques. We also present a fairness-utility trade-off analysis to jointly examine fairness and the overall system performance. We show that although systems trained using adversarial techniques improve fairness, they are prone to reduced utility. On the other hand, multi-task methods can improve the fairness while retaining the utility. These findings can inform the choice of bias mitigation strategies in the field of speaker recognition.}
}

Generated by bib2html.pl (written by Patrick Riley ) on Wed Oct 02, 2024 21:13:44