v1v2v3v4 (latest)

Dawn of the transformer era in speech emotion recognition: closing the valence gap

14 March 2022

Johannes Wagner

Andreas Triantafyllopoulos

Björn W. Schuller

Papers citing "Dawn of the transformer era in speech emotion recognition: closing the valence gap"

47 / 47 papers shown

Title
Learning Annotation Consensus for Continuous Emotion Recognition Ibrahim Shoer E. Erzin 19 0 0 27 May 2025
Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition Minxue Niu E. Provost VLM 196 0 0 23 May 2025
Exploring Local Interpretable Model-Agnostic Explanations for Speech Emotion Recognition with Distribution-Shift Maja J. Hjuler Line H. Clemmensen Sneha Das FAtt 113 1 0 07 Apr 2025
autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks Simon Rampp Andreas Triantafyllopoulos M. Milling Björn Schuller 262 0 0 16 Dec 2024
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector Deok-Hyeon Cho Hyung-Seok Oh Seung-Bin Kim Seong-Whan Lee 113 8 0 04 Nov 2024
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions Kun Zhou You Zhang Shengkui Zhao Hao Wang Zexu Pan ... Chongjia Ni Yukun Ma Trung Hieu Nguyen J. Yip Bin Ma 104 7 0 25 Sep 2024
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques Yuanchao Li Peter Bell Catherine Lai 76 10 0 12 Jun 2024
Fusion approaches for emotion recognition from speech using acoustic and text-based features L. Pepino Pablo Riera Luciana Ferrer Agustin Gravano 70 49 0 27 Mar 2024
Probing Speech Emotion Recognition Transformers for Linguistic Knowledge Andreas Triantafyllopoulos Johannes Wagner H. Wierstorf Maximilian Schmitt U. Reichel F. Eyben Felix Burkhardt Björn W. Schuller 58 27 0 01 Apr 2022
Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech K. Sridhar Carlos Busso CVBM 34 22 0 19 Jan 2022
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale Arun Babu Changhan Wang Andros Tjandra Kushal Lakhotia Qiantong Xu ... Yatharth Saraf J. Pino Alexei Baevski Alexis Conneau Michael Auli SSL 110 704 0 17 Nov 2021
Are Transformers More Robust Than CNNs? Yutong Bai Jieru Mei Alan Yuille Cihang Xie ViT AAML 244 263 0 10 Nov 2021
A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding Yingzhi Wang Abdelmoumene Boumadane A. Heba 58 151 0 04 Nov 2021
AequeVox: Automated Fairness Testing of Speech Recognition Systems Sai Sathiesh Rajan Sakshi Udeshi Sudipta Chattopadhyay 98 15 0 19 Oct 2021
Multistage linguistic conditioning of convolutional layers for speech emotion recognition Andreas Triantafyllopoulos U. Reichel Shuo Liu Simon Huber F. Eyben Björn W. Schuller 72 11 0 13 Oct 2021
Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition Li-Wei Chen Alexander I. Rudnicky VLM 61 127 0 12 Oct 2021
Multimodal Emotion Recognition with High-level Speech and Text Features M. R. Makiuchi Kuniaki Uto Koichi Shinoda 58 72 0 29 Sep 2021
Using Large Pre-Trained Models with Cross-Modal Attention for Multi-Modal Emotion Recognition Krishna D N Freshworks 56 12 0 22 Aug 2021
Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation Sarala Padi S. O. Sadjadi Tianyi Zhou Ram D. Sriram 51 35 0 05 Aug 2021
The Role of Phonetic Units in Speech Emotion Recognition Jiahong Yuan Xingyu Cai Renjie Zheng Liang Huang Kenneth Church 60 15 0 02 Aug 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units Wei-Ning Hsu Benjamin Bolte Yao-Hung Hubert Tsai Kushal Lakhotia Ruslan Salakhutdinov Abdel-rahman Mohamed SSL 180 2,966 0 14 Jun 2021
SUPERB: Speech processing Universal PERformance Benchmark Shu-Wen Yang Po-Han Chi Yung-Sung Chuang Cheng-I Jeff Lai Kushal Lakhotia ... Shuyan Dong Shang-Wen Li Shinji Watanabe Abdel-rahman Mohamed Hung-yi Lee SSL 108 937 0 03 May 2021
On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era Shahin Amiriparian Artem Sokolov Ilhan Aslan Lukas Christ Maurice Gerczuk ... M. Milling Sandra Ottl Ilya Poduremennykh E. Shuranov Björn W. Schuller 58 17 0 20 Apr 2021
The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress Lukas Stappen Alice Baird Lukas Christ Lea Schumann Benjamin Sertolli Eva-Maria Messner Min Zhang Guoying Zhao Björn W. Schuller 46 88 0 14 Apr 2021
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings L. Pepino Pablo Riera Luciana Ferrer 67 364 0 08 Apr 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training Wei-Ning Hsu Anuroop Sriram Alexei Baevski Tatiana Likhomanenko Qiantong Xu ... Jacob Kahn Ann Lee R. Collobert Gabriel Synnaeve Michael Auli SSL 78 240 0 02 Apr 2021
Contrastive Unsupervised Learning for Speech Emotion Recognition Mao Li Bo Yang Joshua Levy A. Stolcke Viktor Rozgic Spyros Matsoukas C. Papayiannis Daniel Bone Chao Wang SSL 88 49 0 12 Feb 2021
Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation Mingke Xu Fan Zhang Xiaodong Cui Wei Zhang 40 52 0 03 Feb 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation Changhan Wang M. Rivière Ann Lee Anne Wu Chaitanya Talnikar Daniel Haziza Mary Williamson J. Pino Emmanuel Dupoux SSL 100 488 0 02 Jan 2021
A Survey on Visual Transformer Kai Han Yunhe Wang Hanting Chen Xinghao Chen Jianyuan Guo ... Chunjing Xu Yixing Xu Zhaohui Yang Yiman Zhang Dacheng Tao ViT 200 2,232 0 23 Dec 2020
Underspecification Presents Challenges for Credibility in Modern Machine Learning Alexander DÁmour Katherine A. Heller D. Moldovan Ben Adlam B. Alipanahi ... Kellie Webster Steve Yadlowsky T. Yun Xiaohua Zhai D. Sculley OffRL 117 687 0 06 Nov 2020
CopyPaste: An Augmentation Method for Speech Emotion Recognition R. Pappagari Jesús Villalba Piotr Żelasko Laureano Moro-Velazquez Najim Dehak 50 40 0 27 Oct 2020
What is being transferred in transfer learning? Behnam Neyshabur Hanie Sedghi Chiyuan Zhang 106 527 0 26 Aug 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski Henry Zhou Abdel-rahman Mohamed Michael Auli SSL 285 5,801 0 20 Jun 2020
Beyond Accuracy: Behavioral Testing of NLP models with CheckList Marco Tulio Ribeiro Tongshuang Wu Carlos Guestrin Sameer Singh ELM 208 1,107 0 08 May 2020
A Simple Framework for Contrastive Learning of Visual Representations Ting-Li Chen Simon Kornblith Mohammad Norouzi Geoffrey E. Hinton SSL 372 18,778 0 13 Feb 2020
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition Qiuqiang Kong Yin Cao Turab Iqbal Yuxuan Wang Wenwu Wang Mark D. Plumbley VLM SSL 192 1,082 0 21 Dec 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders Andy T. Liu Shu-Wen Yang Po-Han Chi Po-Chun Hsu Hung-yi Lee SSL 148 374 0 25 Oct 2019
Machine Learning Testing: Survey, Landscapes and Horizons Jie M. Zhang Mark Harman Lei Ma Yang Liu VLM AILaw 80 752 0 19 Jun 2019
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild Jean Kossaifi R. Walecki Yannis Panagakis Jie Shen Maximilian Schmitt ... Antoine Toisoul Bjorn Schuller Kam Star Elnar Hajiyev Maja Pantic 74 198 0 09 Jan 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 1.8K 94,891 0 11 Oct 2018
Multimodal Speech Emotion Recognition Using Audio and Text Seunghyun Yoon Seokhyun Byun Kyomin Jung 53 295 0 10 Oct 2018
A General Framework for Fair Regression Jack K. Fitzsimons AbdulRahman Al Ali Michael A. Osborne Stephen J. Roberts FaML 117 37 0 10 Oct 2018
Polarity and Intensity: the Two Aspects of Sentiment Analysis Leimin Tian Catherine Lai Johanna D. Moore 34 36 0 04 Jul 2018
Personalized Machine Learning for Robot Perception of Affect and Engagement in Autism Therapy Ognjen Rudovic Jaeryoung Lee Miles Dai Bjorn Schuller Rosalind W. Picard 62 271 0 04 Feb 2018
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 713 131,652 0 12 Jun 2017
Improving the Robustness of Deep Neural Networks via Stability Training Stephan Zheng Yang Song Thomas Leung Ian Goodfellow OOD 50 638 0 15 Apr 2016