ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.10687
  4. Cited By
Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers
v1v2v3 (latest)

Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers

16 September 2024
Ruchik Mishra
Andrew Frye
M. M. Rayguru
Dan O. Popa
ArXiv (abs)PDFHTML

Papers citing "Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers"

20 / 20 papers shown
Title
Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing
Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing
Muhamamd Haris Khan
Selamawit Asfaw
Dmitrii Iarchuk
Miguel Altamirano Cabrera
Luis Moreno
Issatay Tokmurziyev
Dzmitry Tsetserukou
93
2
0
12 Jan 2025
What Does it Take to Generalize SER Model Across Datasets? A
  Comprehensive Benchmark
What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark
Adham Ibrahim
Shady Shehata
Ajinkya Kulkarni
Mukhtar Mohamed
Muhammad Abdul-Mageed
56
2
0
14 Jun 2024
Controlling Emotion in Text-to-Speech with Natural Language Prompts
Controlling Emotion in Text-to-Speech with Natural Language Prompts
Thomas Bott
Florian Lux
Ngoc Thang Vu
60
9
0
10 Jun 2024
Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study
  on Speech Emotion Recognition
Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition
Alexandra Saliba
Yuanchao Li
Ramon Sanabria
Catherine Lai
84
10
0
04 Feb 2024
Vesper: A Compact and Effective Pretrained Model for Speech Emotion
  Recognition
Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition
Weidong Chen
Xiaofen Xing
Peihao Chen
Xiangmin Xu
VLM
69
39
0
20 Jul 2023
Social Impressions of the NAO Robot and its Impact on Physiology
Social Impressions of the NAO Robot and its Impact on Physiology
Ruchik Mishra
K. Welch
61
2
0
03 Jul 2023
SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken
  Language Model for Speech Processing Tasks
SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Kai-Wei Chang
Wei-Cheng Tseng
Shang-Wen Li
Hung-yi Lee
78
23
0
31 Mar 2022
SepTr: Separable Transformer for Audio Spectrogram Processing
SepTr: Separable Transformer for Audio Spectrogram Processing
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
Fahad Shahbaz Khan
ViT
68
31
0
17 Mar 2022
Self-attention fusion for audiovisual emotion recognition with
  incomplete data
Self-attention fusion for audiovisual emotion recognition with incomplete data
K. Chumachenko
Alexandros Iosifidis
Moncef Gabbouj
118
40
0
26 Jan 2022
BEiT: BERT Pre-Training of Image Transformers
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
292
2,845
0
15 Jun 2021
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
L. Pepino
Pablo Riera
Luciana Ferrer
76
365
0
08 Apr 2021
Self-paced ensemble learning for speech and audio classification
Self-paced ensemble learning for speech and audio classification
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
157
23
0
22 Mar 2021
Seen and Unseen emotional style transfer for voice conversion with a new
  emotional speech dataset
Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset
Kun Zhou
Berrak Sisman
Rui Liu
Haizhou Li
79
192
0
28 Oct 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
682
41,483
0
22 Oct 2020
Non-linear Neurons with Human-like Apical Dendrite Activations
Non-linear Neurons with Human-like Apical Dendrite Activations
Mariana-Iuliana Georgescu
Radu Tudor Ionescu
Nicolae-Cătălin Ristea
N. Sebe
81
21
0
02 Feb 2020
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in
  Conversations
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
Soujanya Poria
Devamanyu Hazarika
Navonil Majumder
Gautam Naik
Min Zhang
Rada Mihalcea
123
1,077
0
05 Oct 2018
On the Robustness of Speech Emotion Recognition for Human-Robot
  Interaction with Deep Neural Networks
On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Egor Lakomkin
M. Zamani
C. Weber
S. Magg
S. Wermter
57
54
0
06 Apr 2018
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
805
132,725
0
12 Jun 2017
Prototypical Networks for Few-shot Learning
Prototypical Networks for Few-shot Learning
Jake C. Snell
Kevin Swersky
R. Zemel
305
8,154
0
15 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
833
11,952
0
09 Mar 2017
1