Machine listening: Making speech recognition systems more inclusive

WASHINGTON, April 30, 2024 – Interactions with voice technology, such as Amazon’s Alexa, Apple’s Siri, and Google Assistant, can make life easier by increasing efficiency and productivity. However, errors in generating and understanding speech during interactions are common. When using these devices, speakers often style-shift their speech from their normal patterns into a louder and slower register, called technology-directed speech.

Credit: Michelle Cohn, Zion Mengesha, Michal Lahav, and Courtney Heldreth

Research on technology-directed speech typically focuses on mainstream varieties of U.S. English without considering speaker groups that are more consistently misunderstood by technology. In JASA Express Letters, published on behalf of the Acoustical Society of America by AIP Publishing, researchers from Google Research, the University of California, Davis, and Stanford University wanted to address this gap.

One group commonly misunderstood by voice technology are individuals who speak African American English, or AAE. Since the rate of automatic speech recognition errors can be higher for AAE speakers, downstream effects of linguistic discrimination in technology may result.

“Across all automatic speech recognition systems, four out of every ten words spoken by Black men were being transcribed incorrectly,” said co-author Zion Mengesha. “This affects fairness for African American English speakers in every institution using voice technology, including health care and employment.”

“We saw an opportunity to better understand this problem by talking to Black users and understanding their emotional, behavioral, and linguistic responses when engaging with voice technology,” said co-author Courtney Heldreth.

The team designed an experiment to test how AAE speakers adapt their speech when imagining talking to a voice assistant, compared to talking to a friend, family member, or stranger. The study tested familiar human, unfamiliar human, and voice assistant-directed speech conditions by comparing speech rate and pitch variation. Study participants included 19 adults identifying as Black or African American who had experienced issues with voice technology. Each participant asked a series of questions to a voice assistant. The same questions were repeated as if speaking to a familiar person and, again, to a stranger. Each question was recorded for a total of 153 recordings.

Analysis of the recordings showed that the speakers exhibited two consistent adjustments when they were talking to voice technology compared to talking to another person: a slower rate of speech with less pitch variation (more monotone speech).

“These findings suggest that people have mental models of how to talk to technology,” said co-author Michelle Cohn. “A set ‘mode’ that they engage to be better understood, in light of disparities in speech recognition systems.”

There are other groups misunderstood by voice technology, such as second-language speakers. The researchers hope to expand the language varieties explored in human-computer interaction experiments and address barriers in technology so that it can support everyone who wants to use it.

###

The article “African American English speakers’ pitch variation and rate adjustments for imagined technological and human addressees” is authored by Michelle Cohn, Zion Mengesha, Michal Lahav, and Courtney Heldreth. The article will appear in JASA Express Letters on April 30, 2024 (DOI: 10.1121/10.0025484). After that date, it can be accessed at https://doi.org/10.1121/10.0025484.

ABOUT THE JOURNAL

JASA Express Letters is a gold open-access journal devoted to the rapid and open dissemination of important new research results and technical discussion in all fields of acoustics. It serves physical scientists, life scientists, engineers, psychologists, physiologists, architects, musicians, and speech communication specialists who wish to quickly report the results of their acoustical research in letter-sized contributions. See https://asa.scitation.org/journal/jel.

ABOUT ACOUSTICAL SOCIETY OF AMERICA

The Acoustical Society of America (ASA) is the premier international scientific society in acoustics devoted to the science and technology of sound. Its 7,000 members worldwide represent a broad spectrum of the study of acoustics. ASA publications include The Journal of the Acoustical Society of America (the world’s leading journal on acoustics), JASA Express Letters, Proceedings of Meetings on Acoustics, Acoustics Today magazine, books, and standards on acoustics. The society also holds two major scientific meetings each year. See https://acousticalsociety.org/.

###

Journal

JASA Express Letters

DOI

10.1121/10.0025484

Article Title

African American English speakers’ pitch variation and rate adjustments for imagined technological and human addressees

Article Publication Date

30-Apr-2024

Machine listening: Making speech recognition systems more inclusive

Listening to mindfulness audios during radiation improves physical, emotional side effects

Regulating cholesterol levels might be the key to improving cancer treatment

Related Posts

Bentham Science Launches International Journal of Wireless and Communication Engineering Innovation

What Long-Term Evidence Reveals About Hypoglycemia in Late-Preterm Infants

Scalable In-Process Inspection for Direct-Ink-Writing Additive Manufacturing

New study finds shampoo routines may increase household water waste

Johns Hopkins study finds deep disagreement on AI priorities for older adults

Cage-confined catalysis enables asymmetric intermolecular photocycloaddition of (benzo)furans and excited alkenes

Regulating cholesterol levels might be the key to improving cancer treatment

Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

University of Seville Breaks 120-Year-Old Mystery, Revises a Key Einstein Concept

Bee body mass, pathogens and local climate influence heat tolerance

Researchers record first-ever images and data of a shark experiencing a boat strike

Groundbreaking Clinical Trial Reveals Lubiprostone Enhances Kidney Function

RECENT NEWS

Categories

Subscribe to Blog via Email

Welcome Back!

Retrieve your password