Semantic enrichment towards efficient speech representations

Semantic enrichment towards efficient speech representations

3 July 2023

Papers citing "Semantic enrichment towards efficient speech representations"

14 / 14 papers shown

Title
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation Sameer Khurana Antoine Laurent James R. Glass 40 36 0 17 May 2022
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? Sanyuan Chen Yu Wu Chengyi Wang Shujie Liu Zhuo Chen ... Gang Liu Jinyu Li Jian Wu Xiangzhan Yu Furu Wei SSL 74 42 0 27 Apr 2022
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale Arun Babu Changhan Wang Andros Tjandra Kushal Lakhotia Qiantong Xu ... Yatharth Saraf J. Pino Alexei Baevski Alexis Conneau Michael Auli SSL 84 699 0 17 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing Sanyuan Chen Chengyi Wang Zhengyang Chen Yu-Huan Wu Shujie Liu ... Yao Qian Jian Wu Micheal Zeng Xiangzhan Yu Furu Wei SSL 206 1,846 0 26 Oct 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units Wei-Ning Hsu Benjamin Bolte Yao-Hung Hubert Tsai Kushal Lakhotia Ruslan Salakhutdinov Abdel-rahman Mohamed SSL 145 2,939 0 14 Jun 2021
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings L. Pepino Pablo Riera Luciana Ferrer 46 360 0 08 Apr 2021
On the use of Self-supervised Pre-trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition Manon Macary Marie Tahon Yannick Esteve Anthony Rousseau SSL 49 55 0 18 Nov 2020
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems Yinghui Huang H. Kuo Samuel Thomas Zvi Kons Kartik Audhkhasi Brian Kingsbury R. Hoory M. Picheny VLM 39 63 0 08 Oct 2020
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech Andy T. Liu Shang-Wen Li Hung-yi Lee SSL 117 358 0 12 Jul 2020
Language-agnostic BERT Sentence Embedding Fangxiaoyu Feng Yinfei Yang Daniel Cer N. Arivazhagan Wei Wang 133 904 0 03 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski Henry Zhou Abdel-rahman Mohamed Michael Auli SSL 219 5,767 0 20 Jun 2020
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders Andy T. Liu Shu-Wen Yang Po-Han Chi Po-Chun Hsu Hung-yi Lee SSL 130 373 0 25 Oct 2019
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability Antoine Caubrière N. Tomashenko Antoine Laurent Emmanuel Morin Nathalie Camelin Yannick Esteve 47 54 0 18 Jun 2019
From Audio to Semantics: Approaches to end-to-end spoken language understanding Parisa Haghani A. Narayanan M. Bacchiani Galen Chuang Neeraj Gaur Pedro J. Moreno Rohit Prabhavalkar Zhongdi Qu Austin Waters 50 150 0 24 Sep 2018