ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.14971
  4. Cited By
BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation
v1v2 (latest)

BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation

19 October 2024
Jilong Li
Zhenxi Song
Jiaqi Wang
Meishan Zhang
Honghai Liu
Min Zhang
Zhiguo Zhang
ArXiv (abs)PDFHTML

Papers citing "BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation"

20 / 20 papers shown
Title
Bridging Brain with Foundation Models through Self-Supervised Learning
Hamdi Altaheri
Fakhri Karray
Md. Milon Islam
S M Taslim Uddin Raju
Amir-Hossein Karimi
19
0
0
19 Jun 2025
NeuGPT: Unified multi-modal Neural GPT
NeuGPT: Unified multi-modal Neural GPT
Yiqian Yang
Yiqun Duan
Hyejeong Jo
Qiang Zhang
Renjing Xu
Oiwi Parker Jones
Xuming Hu
Chin-Teng Lin
Hui Xiong
90
6
0
28 Oct 2024
MAD: Multi-Alignment MEG-to-Text Decoding
MAD: Multi-Alignment MEG-to-Text Decoding
Yiqian Yang
Hyejeong Jo
Yiqun Duan
Qiang Zhang
Jinni Zhou
Won Hee Lee
Renjing Xu
Hui Xiong
88
11
0
03 Jun 2024
Open-vocabulary Auditory Neural Decoding Using fMRI-prompted LLM
Open-vocabulary Auditory Neural Decoding Using fMRI-prompted LLM
Xiaoyu Chen
Changde Du
Che Liu
Yizhe Wang
Huiguang He
54
3
0
13 May 2024
Are EEG-to-Text Models Working?
Are EEG-to-Text Models Working?
Hyejeong Jo
Yiqian Yang
Juhyeok Han
Yiqun Duan
Hui Xiong
Won Hee Lee
98
19
0
10 May 2024
Deep Representation Learning for Open Vocabulary
  Electroencephalography-to-Text Decoding
Deep Representation Learning for Open Vocabulary Electroencephalography-to-Text Decoding
H. Amrani
D. Micucci
Paolo Napoletano
83
6
0
15 Nov 2023
UniCoRN: Unified Cognitive Signal ReconstructioN bridging cognitive
  signals and human language
UniCoRN: Unified Cognitive Signal ReconstructioN bridging cognitive signals and human language
Nuwa Xi
Sendong Zhao
Hao Wang
Chi-Liang Liu
Bing Qin
Ting Liu
88
21
0
06 Jul 2023
DUB: Discrete Unit Back-translation for Speech Translation
DUB: Discrete Unit Back-translation for Speech Translation
Dong Zhang
Rong Ye
Tom Ko
Mingxuan Wang
Yaqian Zhou
90
27
0
19 May 2023
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning
Qingru Zhang
Minshuo Chen
Alexander Bukharin
Nikos Karampatziakis
Pengcheng He
Yu Cheng
Weizhu Chen
Tuo Zhao
68
126
0
18 Mar 2023
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Max Bain
Jaesung Huh
Tengda Han
Andrew Zisserman
143
242
0
01 Mar 2023
Robust Speech Recognition via Large-Scale Weak Supervision
Robust Speech Recognition via Large-Scale Weak Supervision
Alec Radford
Jong Wook Kim
Tao Xu
Greg Brockman
C. McLeavey
Ilya Sutskever
OffRL
230
3,770
0
06 Dec 2022
Textless Direct Speech-to-Speech Translation with Discrete Speech
  Representation
Textless Direct Speech-to-Speech Translation with Discrete Speech Representation
Xinjian Li
Ye Jia
Chung-Cheng Chiu
100
30
0
31 Oct 2022
Decoding speech perception from non-invasive brain recordings
Decoding speech perception from non-invasive brain recordings
Alexandre Défossez
Charlotte Caucheteux
Jérémy Rapin
Ori Kabeli
J. King
107
139
0
25 Aug 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
111
306
0
20 Jul 2022
Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot
  Sentiment Classification
Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification
Zhenhailong Wang
Heng Ji
189
82
0
05 Dec 2021
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music
  Source Separation
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation
Qiuqiang Kong
Yin Cao
Haohe Liu
Keunwoo Choi
Yuxuan Wang
190
100
0
12 Sep 2021
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
323
5,868
0
20 Jun 2020
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language
  Generation, Translation, and Comprehension
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
M. Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdel-rahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
AIMatVLM
268
10,913
0
29 Oct 2019
Generating Diverse High-Fidelity Images with VQ-VAE-2
Generating Diverse High-Fidelity Images with VQ-VAE-2
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRLBDL
200
1,832
0
02 Jun 2019
Neural Discrete Representation Learning
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDLSSLOCL
257
5,092
0
02 Nov 2017
1