Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.13965
Cited By
Extending RNN-T-based speech recognition systems with emotion and language classification
28 July 2022
Zvi Kons
Hagai Aronowitz
E. Morais
Matheus Damasceno
H. Kuo
Samuel Thomas
G. Saon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Extending RNN-T-based speech recognition systems with emotion and language classification"
11 / 11 papers shown
Title
Towards a Common Speech Analysis Engine
Hagai Aronowitz
Itai Gat
E. Morais
Weizhong Zhu
R. Hoory
42
3
0
01 Mar 2022
Speech Emotion Recognition using Self-Supervised Features
E. Morais
R. Hoory
Weizhong Zhu
Itai Gat
Matheus Damasceno
Hagai Aronowitz
SSL
MDE
35
114
0
07 Feb 2022
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
140
2,879
0
14 Jun 2021
SUPERB: Speech processing Universal PERformance Benchmark
Shu-Wen Yang
Po-Han Chi
Yung-Sung Chuang
Cheng-I Jeff Lai
Kushal Lakhotia
...
Shuyan Dong
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
SSL
82
910
0
03 May 2021
RNN Transducer Models For Spoken Language Understanding
Samuel Thomas
H. Kuo
G. Saon
Zoltán Tüske
Brian Kingsbury
Gakuto Kurata
Zvi Kons
R. Hoory
34
14
0
08 Apr 2021
Advancing RNN Transducer Technology for Speech Recognition
G. Saon
Zoltan Tueske
Daniel Bolaños
Brian Kingsbury
63
87
0
17 Mar 2021
Streaming End-to-End Bilingual ASR Systems with Joint Language Identification
Surabhi Punjabi
Harish Arsikere
Zeynab Raeesy
Chander Chandak
Nikhil Bhave
...
Sri Garimella
Roland Maas
Mat Hans
Athanasios Mouchtaris
Siegfried Kunzmann
33
25
0
08 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
204
5,734
0
20 Jun 2020
Joint Speech Recognition and Speaker Diarization via Sequence Transduction
Laurent El Shafey
H. Soltau
Izhak Shafran
55
99
0
09 Jul 2019
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
Soujanya Poria
Devamanyu Hazarika
Navonil Majumder
Gautam Naik
Min Zhang
Rada Mihalcea
98
1,055
0
05 Oct 2018
Sequence Transduction with Recurrent Neural Networks
Alex Graves
157
1,858
0
14 Nov 2012
1