ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.07378
  4. Cited By
Dawn of the transformer era in speech emotion recognition: closing the
  valence gap
v1v2v3v4 (latest)

Dawn of the transformer era in speech emotion recognition: closing the valence gap

14 March 2022
Johannes Wagner
Andreas Triantafyllopoulos
H. Wierstorf
Maximilian Schmitt
Felix Burkhardt
F. Eyben
Björn W. Schuller
ArXiv (abs)PDFHTML

Papers citing "Dawn of the transformer era in speech emotion recognition: closing the valence gap"

47 / 47 papers shown
Title
Learning Annotation Consensus for Continuous Emotion Recognition
Learning Annotation Consensus for Continuous Emotion Recognition
Ibrahim Shoer
E. Erzin
19
0
0
27 May 2025
Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition
Minxue Niu
E. Provost
VLM
196
0
0
23 May 2025
Exploring Local Interpretable Model-Agnostic Explanations for Speech Emotion Recognition with Distribution-Shift
Exploring Local Interpretable Model-Agnostic Explanations for Speech Emotion Recognition with Distribution-Shift
Maja J. Hjuler
Line H. Clemmensen
Sneha Das
FAtt
113
1
0
07 Apr 2025
autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks
autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks
Simon Rampp
Andreas Triantafyllopoulos
M. Milling
Björn Schuller
262
0
0
16 Dec 2024
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
Deok-Hyeon Cho
Hyung-Seok Oh
Seung-Bin Kim
Seong-Whan Lee
113
8
0
04 Nov 2024
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions
Kun Zhou
You Zhang
Shengkui Zhao
Hao Wang
Zexu Pan
...
Chongjia Ni
Yukun Ma
Trung Hieu Nguyen
J. Yip
Bin Ma
104
7
0
25 Sep 2024
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Yuanchao Li
Peter Bell
Catherine Lai
76
10
0
12 Jun 2024
Fusion approaches for emotion recognition from speech using acoustic and
  text-based features
Fusion approaches for emotion recognition from speech using acoustic and text-based features
L. Pepino
Pablo Riera
Luciana Ferrer
Agustin Gravano
70
49
0
27 Mar 2024
Probing Speech Emotion Recognition Transformers for Linguistic Knowledge
Probing Speech Emotion Recognition Transformers for Linguistic Knowledge
Andreas Triantafyllopoulos
Johannes Wagner
H. Wierstorf
Maximilian Schmitt
U. Reichel
F. Eyben
Felix Burkhardt
Björn W. Schuller
58
27
0
01 Apr 2022
Unsupervised Personalization of an Emotion Recognition System: The
  Unique Properties of the Externalization of Valence in Speech
Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech
K. Sridhar
Carlos Busso
CVBM
34
22
0
19 Jan 2022
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at
  Scale
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Arun Babu
Changhan Wang
Andros Tjandra
Kushal Lakhotia
Qiantong Xu
...
Yatharth Saraf
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
SSL
110
704
0
17 Nov 2021
Are Transformers More Robust Than CNNs?
Are Transformers More Robust Than CNNs?
Yutong Bai
Jieru Mei
Alan Yuille
Cihang Xie
ViTAAML
244
263
0
10 Nov 2021
A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion
  Recognition, Speaker Verification and Spoken Language Understanding
A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Yingzhi Wang
Abdelmoumene Boumadane
A. Heba
58
151
0
04 Nov 2021
AequeVox: Automated Fairness Testing of Speech Recognition Systems
AequeVox: Automated Fairness Testing of Speech Recognition Systems
Sai Sathiesh Rajan
Sakshi Udeshi
Sudipta Chattopadhyay
98
15
0
19 Oct 2021
Multistage linguistic conditioning of convolutional layers for speech
  emotion recognition
Multistage linguistic conditioning of convolutional layers for speech emotion recognition
Andreas Triantafyllopoulos
U. Reichel
Shuo Liu
Simon Huber
F. Eyben
Björn W. Schuller
72
11
0
13 Oct 2021
Exploring Wav2vec 2.0 fine-tuning for improved speech emotion
  recognition
Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
Li-Wei Chen
Alexander I. Rudnicky
VLM
61
127
0
12 Oct 2021
Multimodal Emotion Recognition with High-level Speech and Text Features
Multimodal Emotion Recognition with High-level Speech and Text Features
M. R. Makiuchi
Kuniaki Uto
Koichi Shinoda
58
72
0
29 Sep 2021
Using Large Pre-Trained Models with Cross-Modal Attention for
  Multi-Modal Emotion Recognition
Using Large Pre-Trained Models with Cross-Modal Attention for Multi-Modal Emotion Recognition
Krishna D N Freshworks
56
12
0
22 Aug 2021
Improved Speech Emotion Recognition using Transfer Learning and
  Spectrogram Augmentation
Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Sarala Padi
S. O. Sadjadi
Tianyi Zhou
Ram D. Sriram
51
35
0
05 Aug 2021
The Role of Phonetic Units in Speech Emotion Recognition
The Role of Phonetic Units in Speech Emotion Recognition
Jiahong Yuan
Xingyu Cai
Renjie Zheng
Liang Huang
Kenneth Church
60
15
0
02 Aug 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
180
2,966
0
14 Jun 2021
SUPERB: Speech processing Universal PERformance Benchmark
SUPERB: Speech processing Universal PERformance Benchmark
Shu-Wen Yang
Po-Han Chi
Yung-Sung Chuang
Cheng-I Jeff Lai
Kushal Lakhotia
...
Shuyan Dong
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
SSL
108
937
0
03 May 2021
On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion
  Recognition: An Update for the Deep Learning Era
On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
Shahin Amiriparian
Artem Sokolov
Ilhan Aslan
Lukas Christ
Maurice Gerczuk
...
M. Milling
Sandra Ottl
Ilya Poduremennykh
E. Shuranov
Björn W. Schuller
58
17
0
20 Apr 2021
The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment,
  Emotion, Physiological-Emotion, and Stress
The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress
Lukas Stappen
Alice Baird
Lukas Christ
Lea Schumann
Benjamin Sertolli
Eva-Maria Messner
Min Zhang
Guoying Zhao
Björn W. Schuller
46
88
0
14 Apr 2021
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
L. Pepino
Pablo Riera
Luciana Ferrer
67
364
0
08 Apr 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised
  Pre-Training
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Wei-Ning Hsu
Anuroop Sriram
Alexei Baevski
Tatiana Likhomanenko
Qiantong Xu
...
Jacob Kahn
Ann Lee
R. Collobert
Gabriel Synnaeve
Michael Auli
SSL
78
240
0
02 Apr 2021
Contrastive Unsupervised Learning for Speech Emotion Recognition
Contrastive Unsupervised Learning for Speech Emotion Recognition
Mao Li
Bo Yang
Joshua Levy
A. Stolcke
Viktor Rozgic
Spyros Matsoukas
C. Papayiannis
Daniel Bone
Chao Wang
SSL
88
49
0
12 Feb 2021
Speech Emotion Recognition with Multiscale Area Attention and Data
  Augmentation
Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation
Mingke Xu
Fan Zhang
Xiaodong Cui
Wei Zhang
40
52
0
03 Feb 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation
  Learning, Semi-Supervised Learning and Interpretation
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Changhan Wang
M. Rivière
Ann Lee
Anne Wu
Chaitanya Talnikar
Daniel Haziza
Mary Williamson
J. Pino
Emmanuel Dupoux
SSL
100
488
0
02 Jan 2021
A Survey on Visual Transformer
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
200
2,232
0
23 Dec 2020
Underspecification Presents Challenges for Credibility in Modern Machine
  Learning
Underspecification Presents Challenges for Credibility in Modern Machine Learning
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
...
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
OffRL
117
687
0
06 Nov 2020
CopyPaste: An Augmentation Method for Speech Emotion Recognition
CopyPaste: An Augmentation Method for Speech Emotion Recognition
R. Pappagari
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
50
40
0
27 Oct 2020
What is being transferred in transfer learning?
What is being transferred in transfer learning?
Behnam Neyshabur
Hanie Sedghi
Chiyuan Zhang
106
527
0
26 Aug 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
285
5,801
0
20 Jun 2020
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Marco Tulio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
ELM
208
1,107
0
08 May 2020
A Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
372
18,778
0
13 Feb 2020
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern
  Recognition
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
Qiuqiang Kong
Yin Cao
Turab Iqbal
Yuxuan Wang
Wenwu Wang
Mark D. Plumbley
VLMSSL
192
1,082
0
21 Dec 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
148
374
0
25 Oct 2019
Machine Learning Testing: Survey, Landscapes and Horizons
Machine Learning Testing: Survey, Landscapes and Horizons
Jie M. Zhang
Mark Harman
Lei Ma
Yang Liu
VLMAILaw
80
752
0
19 Jun 2019
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research
  in the Wild
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild
Jean Kossaifi
R. Walecki
Yannis Panagakis
Jie Shen
Maximilian Schmitt
...
Antoine Toisoul
Bjorn Schuller
Kam Star
Elnar Hajiyev
Maja Pantic
74
198
0
09 Jan 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
94,891
0
11 Oct 2018
Multimodal Speech Emotion Recognition Using Audio and Text
Multimodal Speech Emotion Recognition Using Audio and Text
Seunghyun Yoon
Seokhyun Byun
Kyomin Jung
53
295
0
10 Oct 2018
A General Framework for Fair Regression
A General Framework for Fair Regression
Jack K. Fitzsimons
AbdulRahman Al Ali
Michael A. Osborne
Stephen J. Roberts
FaML
117
37
0
10 Oct 2018
Polarity and Intensity: the Two Aspects of Sentiment Analysis
Polarity and Intensity: the Two Aspects of Sentiment Analysis
Leimin Tian
Catherine Lai
Johanna D. Moore
34
36
0
04 Jul 2018
Personalized Machine Learning for Robot Perception of Affect and Engagement in Autism Therapy
Ognjen Rudovic
Jaeryoung Lee
Miles Dai
Bjorn Schuller
Rosalind W. Picard
62
271
0
04 Feb 2018
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
713
131,652
0
12 Jun 2017
Improving the Robustness of Deep Neural Networks via Stability Training
Improving the Robustness of Deep Neural Networks via Stability Training
Stephan Zheng
Yang Song
Thomas Leung
Ian Goodfellow
OOD
50
638
0
15 Apr 2016
1