Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

8 October 2020

Kartik Audhkhasi

Papers citing "Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems"

41 / 41 papers shown

Title
A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding G. Laperriere Sahar Ghannay Bassam Jabaian Yannick Esteve 28 0 0 17 Jun 2024
SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection Shijue Huang Libo Qin Bingbing Wang Geng Tu Ruifeng Xu 20 4 0 31 Dec 2023
Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis Jianqiao Lu Wenyong Huang Nianzu Zheng Xingshan Zeng Y. Yeung Xiao Chen SyDa 24 1 0 09 Oct 2023
GRASS: Unified Generation Model for Speech-to-Semantic Tasks Aobo Xia Shuyu Lei Yushu Yang Xiang Guo Hua Chai 17 0 0 06 Sep 2023
Improving Joint Speech-Text Representations Without Alignment Cal Peyser Zhong Meng Ke Hu Rohit Prabhavalkar Andrew Rosenberg Tara N. Sainath M. Picheny Kyunghyun Cho VLM 31 4 0 11 Aug 2023
Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding Siddhant Arora Hayato Futami Yosuke Kashiwagi E. Tsunoo Brian Yan Shinji Watanabe 21 4 0 20 Jul 2023
Semantic enrichment towards efficient speech representations G. Laperriere H. Nguyen Sahar Ghannay Bassam Jabaian Yannick Esteve 45 2 0 03 Jul 2023
Multimodal Audio-textual Architecture for Robust Spoken Language Understanding Anderson R. Avila Mehdi Rezagholizadeh Chao Xing 16 1 0 12 Jun 2023
Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training Jianfeng He Julian Salazar Kaisheng Yao Haoqi Li Jason (Jinglun) Cai VLM 13 7 0 22 May 2023
Improving End-to-End SLU performance with Prosodic Attention and Distillation Shangeth Rajaa 29 2 0 14 May 2023
Bridging Speech and Textual Pre-trained Models with Unsupervised ASR Jiatong Shi Chan-Jan Hsu Ho-Lam Chung Dongji Gao Leibny Paola García-Perera Shinji Watanabe Ann Lee Hung-yi Lee 32 12 0 06 Nov 2022
Speech-text based multi-modal training with bidirectional attention for improved speech recognition Yuhang Yang Haihua Xu Hao-Ming Huang E. Chng Sheng Li 44 7 0 01 Nov 2022
End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator Guangzhi Sun C. Zhang P. Woodland 27 8 0 29 Oct 2022
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding G. Laperriere Valentin Pelloin Mickael Rouvier Themos Stafylakis Yannick Esteve 29 9 0 11 Oct 2022
Two-Pass Low Latency End-to-End Spoken Language Understanding Siddhant Arora Siddharth Dalmia Xuankai Chang Brian Yan A. Black Shinji Watanabe VLM 30 19 0 14 Jul 2022
Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems Vishal Sunder Eric Fosler-Lussier Samuel Thomas H. Kuo Brian Kingsbury 23 7 0 11 Apr 2022
Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding Vishal Sunder Samuel Thomas H. Kuo Jatin Ganhotra Brian Kingsbury Eric Fosler-Lussier VLM 41 10 0 11 Apr 2022
Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition N. J. Wang Zongfeng Quan Shaojun Wang Jing Xiao 11 1 0 08 Apr 2022
End-to-end model for named entity recognition from speech without paired training data Salima Mdhaffar J. Duret Titouan Parcollet Yannick Esteve 14 13 0 02 Apr 2022
Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems Samuel Thomas H. Kuo Brian Kingsbury G. Saon 14 24 0 26 Feb 2022
A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets Zvi Kons Aharon Satt H. Kuo Samuel Thomas Boaz Carmeli R. Hoory Brian Kingsbury 9 0 0 21 Feb 2022
Improving End-to-End Models for Set Prediction in Spoken Language Understanding H. Kuo Zoltán Tüske Samuel Thomas Brian Kingsbury G. Saon 21 0 0 28 Jan 2022
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model Keqi Deng Songjun Cao Yike Zhang Long Ma VLM 17 31 0 14 Dec 2021
FANS: Fusing ASR and NLU for on-device SLU Martin H. Radfar Athanasios Mouchtaris Siegfried Kunzmann Ariya Rastrow 17 12 0 31 Oct 2021
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding Wei Wang Shuo Ren Yao Qian Shujie Liu Yu Shi Y. Qian Michael Zeng 37 16 0 23 Oct 2021
Integrating Dialog History into End-to-End Spoken Language Understanding Systems Jatin Ganhotra Samuel Thomas H. Kuo Sachindra Joshi G. Saon Zoltán Tüske Brian Kingsbury 30 10 0 18 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification Yiding Jiang Bidisha Sharma Maulik C. Madhavi Haizhou Li 30 25 0 05 Aug 2021
Representation based meta-learning for few-shot spoken intent recognition Ashish R. Mittal Samarth Bharadwaj Shreya Khare Saneem A. Chemmengath Karthik Sankaranarayanan Brian Kingsbury 20 12 0 29 Jun 2021
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding Siddhant Arora Alissa Ostapenko Vijay Viswanathan Siddharth Dalmia Florian Metze Shinji Watanabe A. Black ELM 25 13 0 29 Jun 2021
Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech Pengwei Wang X. Ye Xiaohuan Zhou Jinghui Xie Hao Wang 13 6 0 10 May 2021
Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language Understanding S. Seo Donghyun Kwak Bowon Lee 32 33 0 15 Apr 2021
RNN Transducer Models For Spoken Language Understanding Samuel Thomas H. Kuo G. Saon Zoltán Tüske Brian Kingsbury Gakuto Kurata Zvi Kons R. Hoory 16 14 0 08 Apr 2021
Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs Sujeong Cha Wang Hou Hyun Jung M. Phung M. Picheny H. Kuo Samuel Thomas E. Morais VLM 22 15 0 07 Apr 2021
Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers Loren Lugosch Piyush Papreja Mirco Ravanelli A. Heba Titouan Parcollet 24 12 0 04 Apr 2021
An Approach to Improve Robustness of NLP Systems against ASR Errors Tong Cui Jinghui Xiao Liangyou Li Xin Jiang Qun Liu 19 11 0 25 Mar 2021
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification Bidisha Sharma Maulik C. Madhavi Haizhou Li 18 19 0 15 Feb 2021
Speech-language Pre-training for End-to-end Spoken Language Understanding Yao Qian Ximo Bian Yu Shi Naoyuki Kanda Leo Shen Zhen Xiao Michael Zeng AuLLM 21 45 0 11 Feb 2021
Multi-task Language Modeling for Improving Speech Recognition of Rare Words Chao-Han Huck Yang Linda Liu Ankur Gandhe Yile Gu A. Raju Denis Filimonov I. Bulyko 24 30 0 23 Nov 2020
Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding B. Agrawal Markus Müller Martin H. Radfar Samridhi Choudhary Athanasios Mouchtaris Siegfried Kunzmann 11 40 0 18 Nov 2020
End-to-End Spoken Language Understanding Without Full Transcripts H. Kuo Zoltán Tüske Samuel Thomas Yinghui Huang Kartik Audhkhasi Brian Kingsbury Gakuto Kurata Zvi Kons R. Hoory Luis Lastras AuLLM 23 26 0 30 Sep 2020
Large-scale Transfer Learning for Low-resource Spoken Language Understanding X. Jia Jianzong Wang Zhiyong Zhang Ning Cheng Jing Xiao 19 17 0 13 Aug 2020