ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04284
  4. Cited By
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent
  Systems

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

8 October 2020
Yinghui Huang
H. Kuo
Samuel Thomas
Zvi Kons
Kartik Audhkhasi
Brian Kingsbury
R. Hoory
M. Picheny
    VLM
ArXivPDFHTML

Papers citing "Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems"

41 / 41 papers shown
Title
A dual task learning approach to fine-tune a multilingual semantic
  speech encoder for Spoken Language Understanding
A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding
G. Laperriere
Sahar Ghannay
Bassam Jabaian
Yannick Esteve
28
0
0
17 Jun 2024
SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation
  for Multi-modal Intent Detection
SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection
Shijue Huang
Libo Qin
Bingbing Wang
Geng Tu
Ruifeng Xu
20
4
0
31 Dec 2023
Improving End-to-End Speech Processing by Efficient Text Data
  Utilization with Latent Synthesis
Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis
Jianqiao Lu
Wenyong Huang
Nianzu Zheng
Xingshan Zeng
Y. Yeung
Xiao Chen
SyDa
24
1
0
09 Oct 2023
GRASS: Unified Generation Model for Speech-to-Semantic Tasks
GRASS: Unified Generation Model for Speech-to-Semantic Tasks
Aobo Xia
Shuyu Lei
Yushu Yang
Xiang Guo
Hua Chai
17
0
0
06 Sep 2023
Improving Joint Speech-Text Representations Without Alignment
Improving Joint Speech-Text Representations Without Alignment
Cal Peyser
Zhong Meng
Ke Hu
Rohit Prabhavalkar
Andrew Rosenberg
Tara N. Sainath
M. Picheny
Kyunghyun Cho
VLM
31
4
0
11 Aug 2023
Integrating Pretrained ASR and LM to Perform Sequence Generation for
  Spoken Language Understanding
Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
Siddhant Arora
Hayato Futami
Yosuke Kashiwagi
E. Tsunoo
Brian Yan
Shinji Watanabe
21
4
0
20 Jul 2023
Semantic enrichment towards efficient speech representations
Semantic enrichment towards efficient speech representations
G. Laperriere
H. Nguyen
Sahar Ghannay
Bassam Jabaian
Yannick Esteve
45
2
0
03 Jul 2023
Multimodal Audio-textual Architecture for Robust Spoken Language
  Understanding
Multimodal Audio-textual Architecture for Robust Spoken Language Understanding
Anderson R. Avila
Mehdi Rezagholizadeh
Chao Xing
16
1
0
12 Jun 2023
Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal
  Selective Self-Training
Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training
Jianfeng He
Julian Salazar
Kaisheng Yao
Haoqi Li
Jason (Jinglun) Cai
VLM
13
7
0
22 May 2023
Improving End-to-End SLU performance with Prosodic Attention and
  Distillation
Improving End-to-End SLU performance with Prosodic Attention and Distillation
Shangeth Rajaa
29
2
0
14 May 2023
Bridging Speech and Textual Pre-trained Models with Unsupervised ASR
Bridging Speech and Textual Pre-trained Models with Unsupervised ASR
Jiatong Shi
Chan-Jan Hsu
Ho-Lam Chung
Dongji Gao
Leibny Paola García-Perera
Shinji Watanabe
Ann Lee
Hung-yi Lee
32
12
0
06 Nov 2022
Speech-text based multi-modal training with bidirectional attention for
  improved speech recognition
Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Yuhang Yang
Haihua Xu
Hao-Ming Huang
E. Chng
Sheng Li
44
7
0
01 Nov 2022
End-to-end Spoken Language Understanding with Tree-constrained Pointer
  Generator
End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator
Guangzhi Sun
C. Zhang
P. Woodland
27
8
0
29 Oct 2022
On the Use of Semantically-Aligned Speech Representations for Spoken
  Language Understanding
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding
G. Laperriere
Valentin Pelloin
Mickael Rouvier
Themos Stafylakis
Yannick Esteve
29
9
0
11 Oct 2022
Two-Pass Low Latency End-to-End Spoken Language Understanding
Two-Pass Low Latency End-to-End Spoken Language Understanding
Siddhant Arora
Siddharth Dalmia
Xuankai Chang
Brian Yan
A. Black
Shinji Watanabe
VLM
30
19
0
14 Jul 2022
Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in
  End-to-End Speech-to-Intent Systems
Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Vishal Sunder
Eric Fosler-Lussier
Samuel Thomas
H. Kuo
Brian Kingsbury
23
7
0
11 Apr 2022
Towards End-to-End Integration of Dialog History for Improved Spoken
  Language Understanding
Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding
Vishal Sunder
Samuel Thomas
H. Kuo
Jatin Ganhotra
Brian Kingsbury
Eric Fosler-Lussier
VLM
41
10
0
11 Apr 2022
Adding Connectionist Temporal Summarization into Conformer to Improve
  Its Decoder Efficiency For Speech Recognition
Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition
N. J. Wang
Zongfeng Quan
Shaojun Wang
Jing Xiao
11
1
0
08 Apr 2022
End-to-end model for named entity recognition from speech without paired
  training data
End-to-end model for named entity recognition from speech without paired training data
Salima Mdhaffar
J. Duret
Titouan Parcollet
Yannick Esteve
14
13
0
02 Apr 2022
Towards Reducing the Need for Speech Training Data To Build Spoken
  Language Understanding Systems
Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
Samuel Thomas
H. Kuo
Brian Kingsbury
G. Saon
14
24
0
26 Feb 2022
A new data augmentation method for intent classification enhancement and
  its application on spoken conversation datasets
A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets
Zvi Kons
Aharon Satt
H. Kuo
Samuel Thomas
Boaz Carmeli
R. Hoory
Brian Kingsbury
9
0
0
21 Feb 2022
Improving End-to-End Models for Set Prediction in Spoken Language
  Understanding
Improving End-to-End Models for Set Prediction in Spoken Language Understanding
H. Kuo
Zoltán Tüske
Samuel Thomas
Brian Kingsbury
G. Saon
21
0
0
28 Jan 2022
Improving Hybrid CTC/Attention End-to-end Speech Recognition with
  Pretrained Acoustic and Language Model
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
VLM
17
31
0
14 Dec 2021
FANS: Fusing ASR and NLU for on-device SLU
FANS: Fusing ASR and NLU for on-device SLU
Martin H. Radfar
Athanasios Mouchtaris
Siegfried Kunzmann
Ariya Rastrow
17
12
0
31 Oct 2021
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End
  Speech Recognition and Understanding
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding
Wei Wang
Shuo Ren
Yao Qian
Shujie Liu
Yu Shi
Y. Qian
Michael Zeng
37
16
0
23 Oct 2021
Integrating Dialog History into End-to-End Spoken Language Understanding
  Systems
Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Jatin Ganhotra
Samuel Thomas
H. Kuo
Sachindra Joshi
G. Saon
Zoltán Tüske
Brian Kingsbury
30
10
0
18 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for
  Intent Classification
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Yiding Jiang
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
30
25
0
05 Aug 2021
Representation based meta-learning for few-shot spoken intent
  recognition
Representation based meta-learning for few-shot spoken intent recognition
Ashish R. Mittal
Samarth Bharadwaj
Shreya Khare
Saneem A. Chemmengath
Karthik Sankaranarayanan
Brian Kingsbury
20
12
0
29 Jun 2021
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on
  Spoken Language Understanding
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding
Siddhant Arora
Alissa Ostapenko
Vijay Viswanathan
Siddharth Dalmia
Florian Metze
Shinji Watanabe
A. Black
ELM
25
13
0
29 Jun 2021
Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Pengwei Wang
X. Ye
Xiaohuan Zhou
Jinghui Xie
Hao Wang
13
6
0
10 May 2021
Integration of Pre-trained Networks with Continuous Token Interface for
  End-to-End Spoken Language Understanding
Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language Understanding
S. Seo
Donghyun Kwak
Bowon Lee
32
33
0
15 Apr 2021
RNN Transducer Models For Spoken Language Understanding
RNN Transducer Models For Spoken Language Understanding
Samuel Thomas
H. Kuo
G. Saon
Zoltán Tüske
Brian Kingsbury
Gakuto Kurata
Zvi Kons
R. Hoory
16
14
0
08 Apr 2021
Speak or Chat with Me: End-to-End Spoken Language Understanding System
  with Flexible Inputs
Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
Sujeong Cha
Wang Hou
Hyun Jung
M. Phung
M. Picheny
H. Kuo
Samuel Thomas
E. Morais
VLM
22
15
0
07 Apr 2021
Timers and Such: A Practical Benchmark for Spoken Language Understanding
  with Numbers
Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers
Loren Lugosch
Piyush Papreja
Mirco Ravanelli
A. Heba
Titouan Parcollet
24
12
0
04 Apr 2021
An Approach to Improve Robustness of NLP Systems against ASR Errors
An Approach to Improve Robustness of NLP Systems against ASR Errors
Tong Cui
Jinghui Xiao
Liangyou Li
Xin Jiang
Qun Liu
19
11
0
25 Mar 2021
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and
  language Models for Intent Classification
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
18
19
0
15 Feb 2021
Speech-language Pre-training for End-to-end Spoken Language
  Understanding
Speech-language Pre-training for End-to-end Spoken Language Understanding
Yao Qian
Ximo Bian
Yu Shi
Naoyuki Kanda
Leo Shen
Zhen Xiao
Michael Zeng
AuLLM
21
45
0
11 Feb 2021
Multi-task Language Modeling for Improving Speech Recognition of Rare
  Words
Multi-task Language Modeling for Improving Speech Recognition of Rare Words
Chao-Han Huck Yang
Linda Liu
Ankur Gandhe
Yile Gu
A. Raju
Denis Filimonov
I. Bulyko
24
30
0
23 Nov 2020
Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end
  Spoken Language Understanding
Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding
B. Agrawal
Markus Müller
Martin H. Radfar
Samridhi Choudhary
Athanasios Mouchtaris
Siegfried Kunzmann
11
40
0
18 Nov 2020
End-to-End Spoken Language Understanding Without Full Transcripts
End-to-End Spoken Language Understanding Without Full Transcripts
H. Kuo
Zoltán Tüske
Samuel Thomas
Yinghui Huang
Kartik Audhkhasi
Brian Kingsbury
Gakuto Kurata
Zvi Kons
R. Hoory
Luis Lastras
AuLLM
23
26
0
30 Sep 2020
Large-scale Transfer Learning for Low-resource Spoken Language
  Understanding
Large-scale Transfer Learning for Low-resource Spoken Language Understanding
X. Jia
Jianzong Wang
Zhiyong Zhang
Ning Cheng
Jing Xiao
19
17
0
13 Aug 2020
1