ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.12283
  4. Cited By
ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken
  Language Understanding

ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding

23 October 2020
Minjeong Kim
Gyuwan Kim
Sang-Woo Lee
Jung-Woo Ha
    VLM
ArXivPDFHTML

Papers citing "ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding"

9 / 9 papers shown
Title
A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning
  for Voice-Controlled Robots
A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled Robots
Peixin Chang
Shuijing Liu
Tianchen Ji
Neeloy Chakraborty
Kaiwen Hong
Katherine Driggs-Campbell
51
3
0
23 Jan 2023
TESSP: Text-Enhanced Self-Supervised Speech Pre-training
TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Zhuoyuan Yao
Shuo Ren
Sanyuan Chen
Ziyang Ma
Pengcheng Guo
Linfu Xie
24
5
0
24 Nov 2022
Introducing Semantics into Speech Encoders
Introducing Semantics into Speech Encoders
Derek Xu
Shuyan Dong
Changhan Wang
Suyoun Kim
Zhaojiang Lin
...
Alexei Baevski
Guan-Ting Lin
Hung-yi Lee
Yizhou Sun
Wei Wang
SSL
36
3
0
15 Nov 2022
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder
  Based Speech-Text Pre-training
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Zi-Hua Zhang
Long Zhou
Junyi Ao
Shujie Liu
Lirong Dai
Jinyu Li
Furu Wei
61
57
0
07 Oct 2022
Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust
  Intent Detection
Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Peilin Zhou
Dading Chong
Helin Wang
Qingcheng Zeng
24
5
0
23 May 2022
Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in
  End-to-End Speech-to-Intent Systems
Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Vishal Sunder
Eric Fosler-Lussier
Samuel Thomas
H. Kuo
Brian Kingsbury
23
7
0
11 Apr 2022
Building Robust Spoken Language Understanding by Cross Attention between
  Phoneme Sequence and ASR Hypothesis
Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis
Zexun Wang
Yuquan Le
Yi Zhu
Yuming Zhao
M.-W. Feng
Meng Chen
Xiaodong He
25
5
0
22 Mar 2022
Integration of Pre-trained Networks with Continuous Token Interface for
  End-to-End Spoken Language Understanding
Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language Understanding
S. Seo
Donghyun Kwak
Bowon Lee
32
33
0
15 Apr 2021
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
718
6,748
0
26 Sep 2016
1