Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01323
Cited By
Semantic enrichment towards efficient speech representations
3 July 2023
G. Laperriere
H. Nguyen
Sahar Ghannay
Bassam Jabaian
Yannick Esteve
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Semantic enrichment towards efficient speech representations"
14 / 14 papers shown
Title
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
Sameer Khurana
Antoine Laurent
James R. Glass
40
36
0
17 May 2022
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
Sanyuan Chen
Yu Wu
Chengyi Wang
Shujie Liu
Zhuo Chen
...
Gang Liu
Jinyu Li
Jian Wu
Xiangzhan Yu
Furu Wei
SSL
74
42
0
27 Apr 2022
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Arun Babu
Changhan Wang
Andros Tjandra
Kushal Lakhotia
Qiantong Xu
...
Yatharth Saraf
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
SSL
84
699
0
17 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
206
1,846
0
26 Oct 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
145
2,939
0
14 Jun 2021
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
L. Pepino
Pablo Riera
Luciana Ferrer
46
360
0
08 Apr 2021
On the use of Self-supervised Pre-trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition
Manon Macary
Marie Tahon
Yannick Esteve
Anthony Rousseau
SSL
49
55
0
18 Nov 2020
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Yinghui Huang
H. Kuo
Samuel Thomas
Zvi Kons
Kartik Audhkhasi
Brian Kingsbury
R. Hoory
M. Picheny
VLM
39
63
0
08 Oct 2020
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
Andy T. Liu
Shang-Wen Li
Hung-yi Lee
SSL
117
358
0
12 Jul 2020
Language-agnostic BERT Sentence Embedding
Fangxiaoyu Feng
Yinfei Yang
Daniel Cer
N. Arivazhagan
Wei Wang
133
904
0
03 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
219
5,767
0
20 Jun 2020
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
130
373
0
25 Oct 2019
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability
Antoine Caubrière
N. Tomashenko
Antoine Laurent
Emmanuel Morin
Nathalie Camelin
Yannick Esteve
47
54
0
18 Jun 2019
From Audio to Semantics: Approaches to end-to-end spoken language understanding
Parisa Haghani
A. Narayanan
M. Bacchiani
Galen Chuang
Neeraj Gaur
Pedro J. Moreno
Rohit Prabhavalkar
Zhongdi Qu
Austin Waters
50
150
0
24 Sep 2018
1