Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.14971
Cited By
v1
v2 (latest)
BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation
19 October 2024
Jilong Li
Zhenxi Song
Jiaqi Wang
Meishan Zhang
Honghai Liu
Min Zhang
Zhiguo Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation"
20 / 20 papers shown
Title
Bridging Brain with Foundation Models through Self-Supervised Learning
Hamdi Altaheri
Fakhri Karray
Md. Milon Islam
S M Taslim Uddin Raju
Amir-Hossein Karimi
19
0
0
19 Jun 2025
NeuGPT: Unified multi-modal Neural GPT
Yiqian Yang
Yiqun Duan
Hyejeong Jo
Qiang Zhang
Renjing Xu
Oiwi Parker Jones
Xuming Hu
Chin-Teng Lin
Hui Xiong
90
6
0
28 Oct 2024
MAD: Multi-Alignment MEG-to-Text Decoding
Yiqian Yang
Hyejeong Jo
Yiqun Duan
Qiang Zhang
Jinni Zhou
Won Hee Lee
Renjing Xu
Hui Xiong
88
11
0
03 Jun 2024
Open-vocabulary Auditory Neural Decoding Using fMRI-prompted LLM
Xiaoyu Chen
Changde Du
Che Liu
Yizhe Wang
Huiguang He
54
3
0
13 May 2024
Are EEG-to-Text Models Working?
Hyejeong Jo
Yiqian Yang
Juhyeok Han
Yiqun Duan
Hui Xiong
Won Hee Lee
98
19
0
10 May 2024
Deep Representation Learning for Open Vocabulary Electroencephalography-to-Text Decoding
H. Amrani
D. Micucci
Paolo Napoletano
83
6
0
15 Nov 2023
UniCoRN: Unified Cognitive Signal ReconstructioN bridging cognitive signals and human language
Nuwa Xi
Sendong Zhao
Hao Wang
Chi-Liang Liu
Bing Qin
Ting Liu
88
21
0
06 Jul 2023
DUB: Discrete Unit Back-translation for Speech Translation
Dong Zhang
Rong Ye
Tom Ko
Mingxuan Wang
Yaqian Zhou
90
27
0
19 May 2023
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning
Qingru Zhang
Minshuo Chen
Alexander Bukharin
Nikos Karampatziakis
Pengcheng He
Yu Cheng
Weizhu Chen
Tuo Zhao
68
126
0
18 Mar 2023
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Max Bain
Jaesung Huh
Tengda Han
Andrew Zisserman
143
242
0
01 Mar 2023
Robust Speech Recognition via Large-Scale Weak Supervision
Alec Radford
Jong Wook Kim
Tao Xu
Greg Brockman
C. McLeavey
Ilya Sutskever
OffRL
230
3,770
0
06 Dec 2022
Textless Direct Speech-to-Speech Translation with Discrete Speech Representation
Xinjian Li
Ye Jia
Chung-Cheng Chiu
100
30
0
31 Oct 2022
Decoding speech perception from non-invasive brain recordings
Alexandre Défossez
Charlotte Caucheteux
Jérémy Rapin
Ori Kabeli
J. King
107
139
0
25 Aug 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
111
306
0
20 Jul 2022
Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification
Zhenhailong Wang
Heng Ji
189
82
0
05 Dec 2021
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation
Qiuqiang Kong
Yin Cao
Haohe Liu
Keunwoo Choi
Yuxuan Wang
190
100
0
12 Sep 2021
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
323
5,868
0
20 Jun 2020
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
M. Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdel-rahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
AIMat
VLM
268
10,913
0
29 Oct 2019
Generating Diverse High-Fidelity Images with VQ-VAE-2
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRL
BDL
200
1,832
0
02 Jun 2019
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
257
5,092
0
02 Nov 2017
1