Monday, August 18, 2025
Science
No Result
View All Result
  • Login
  • HOME
  • SCIENCE NEWS
  • CONTACT US
  • HOME
  • SCIENCE NEWS
  • CONTACT US
No Result
View All Result
Scienmag
No Result
View All Result
Home Science News Technology and Engineering

Temporal shift for speech emotion recognition

July 15, 2024
in Technology and Engineering
Reading Time: 2 mins read
0
Temporal shift for speech emotion recognition
66
SHARES
596
VIEWS
Share on FacebookShare on Twitter
ADVERTISEMENT

Humans can guess how someone on the other end of a phone call is feeling based on how they speak as well as what they say. Speech emotion recognition is the artificial intelligence version of this ability. Seeking to address the issue of channel alignment in downstream speech emotion recognition applications, a research group at East China Normal University in Shanghai developed a temporal shift module that outperforms state-of-the-art methods in fine-tuning and feature-extraction scenarios. The group’s research was published Feb. 21 in Intelligent Computing, a Science Partner Journal.

According to the authors, “this architectural enrichment improves performance without imposing computational burdens.” They introduced three temporal shift models with different architectures: a convolutional neural network, a transformer and a long short-term memory recurrent neural network. Experiments pitted these temporal shift models against existing models on the large benchmark IEMOCAP dataset and found them to be generally more accurate, especially in the fine-tuning scenario. The temporal shift models also performed well in feature extraction when using a trainable weighted sum layer. In addition, the temporal shift models outperformed the baselines on three small datasets, RAVDESS, SAVEE and CASIA. Furthermore, temporal shift, serving as a network module, outperforms the kind of common shift operations that have been used for data augmentation.

The new temporal shift module achieves better performance by allowing the mingling of past, present and future features. Although such mingling benefits accuracy, it can also cause misalignment, which harms accuracy. The authors employed two strategies to address this trade-off: control of shift proportion and selection of shift placement. Models were tested with one half, one quarter, one eighth and one sixteenth of all channels shifted; a larger proportion allows more mingling but causes more misalignment. Two different placement models were tested: residual shift, in which the temporal shift module is located on a branch of the network and thus preserves unshifted data alongside shifted data, and in-place shift, which shifts all the data. After investigating shift proportion and shift placement, the authors chose the best-performing variants for each of the three architectures for conducting experiments against the state-of-the-art models in fine-tuning and feature extraction.

ADVERTISEMENT

Existing speech emotion recognition methods that rely on deep neural network architectures are effective, but they face the challenge of accuracy saturation. That is, their accuracy does not increase with incremental increases in the network size. A key part of the problem is that channel information and temporal information are not processed independently.

Future work can investigate the influence of the scale of the dataset and complexity of the downstream model on accuracy. Additional downstream tasks, such as audio classification, merit quantitative analysis. Moreover, it would be advantageous to make the parameters of future versions of the temporal shift model learnable to enable automatic optimization.

Share26Tweet17
Previous Post

Unveil new insights into soliton molecules: Pathways to chaos and frequency entrainment

Next Post

Protein droplets likely don’t cause Parkinson’s

Related Posts

blank
Technology and Engineering

Guaranteeing Optimal Resource Allocation: A Focus on Scientific Advancements

August 18, 2025
blank
Technology and Engineering

SwRI Innovates Spacecraft Orbital Debris Detection Technology

August 18, 2025
blank
Technology and Engineering

Can Sunlight Treat Moderate Neonatal Hyperbilirubinemia?

August 18, 2025
blank
Technology and Engineering

Researchers Unleash Wireless Innovation to Transmit Vast Amounts of Data

August 18, 2025
blank
Technology and Engineering

Settler Colonialism Undermines Food Systems in Crises

August 18, 2025
blank
Technology and Engineering

Exploring Eco-Friendly Alternatives to Formaldehyde and PFAS in Textile Finishing

August 18, 2025
Next Post
Protein droplets likely don’t cause Parkinson’s: Study deepens our understanding of neurodegenerative diseases linked to protein aggregation.

Protein droplets likely don’t cause Parkinson’s

  • Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    27535 shares
    Share 11011 Tweet 6882
  • University of Seville Breaks 120-Year-Old Mystery, Revises a Key Einstein Concept

    949 shares
    Share 380 Tweet 237
  • Bee body mass, pathogens and local climate influence heat tolerance

    641 shares
    Share 256 Tweet 160
  • Researchers record first-ever images and data of a shark experiencing a boat strike

    507 shares
    Share 203 Tweet 127
  • Warm seawater speeding up melting of ‘Doomsday Glacier,’ scientists warn

    311 shares
    Share 124 Tweet 78
Science

Embark on a thrilling journey of discovery with Scienmag.com—your ultimate source for cutting-edge breakthroughs. Immerse yourself in a world where curiosity knows no limits and tomorrow’s possibilities become today’s reality!

RECENT NEWS

  • Community-Driven Strategies Enhance Family Involvement in ADHD Treatment
  • Cutting-Edge Accelerator Boosts Qubit Performance
  • Increased Depression and Anxiety Among California Jews Linked to 2023 Hamas-Related Violence
  • Study Reveals Sex Differences and Global Trends in Urolithiasis Disease Burden

Categories

  • Agriculture
  • Anthropology
  • Archaeology
  • Athmospheric
  • Biology
  • Bussines
  • Cancer
  • Chemistry
  • Climate
  • Earth Science
  • Marine
  • Mathematics
  • Medicine
  • Pediatry
  • Policy
  • Psychology & Psychiatry
  • Science Education
  • Social Science
  • Space
  • Technology and Engineering

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 4,859 other subscribers

© 2025 Scienmag - Science Magazine

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • HOME
  • SCIENCE NEWS
  • CONTACT US

© 2025 Scienmag - Science Magazine

Discover more from Science

Subscribe now to keep reading and get access to the full archive.

Continue reading