Document Type

Article

Publication Date

2-15-2025

Abstract

Speech recognition has the potential to make technology more accessible to users. However, the accuracy of speech recognition remains limited for users with disabilities, including those with Down Syndrome, and the types and frequencies of recognition errors are poorly understood. This paper characterizes these problems, focusing on errors occurring when recognizing Down Syndrome speech. We analyze the transcripts from six speech recognition algorithms (Google, IBM, Otter.ai, Microsoft, AssemblyAI, OpenAI) using the audio content of 15 individuals with Down Syndrome (331 dialogues; 3428 words). Our analysis shows: (1) significant difference in speech recognition accuracy for people with Down Syndrome compared to neurotypical users; (2) the best algorithm for recognizing Down Syndrome speech is OpenAI (Word Accuracy = 67%; F1-score = 0.944); and (3) there is a prevalence of deletion errors followed by substitutions and insertions. These findings have implications for enhancing speech recognition for the next-generation voice assistants to meet the needs of users with Down Syndrome.

Comments

This article was originally published in Universal Access in the Information Society, volume 24, in 2025. https://doi.org/10.1007/s10209-025-01197-4

Copyright

The authors

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.