ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.02250
  4. Cited By
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate
  Control

Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control

4 June 2024
Ye-Xin Lu
Yang Ai
Zheng-Yan Sheng
Zhen-Hua Ling
ArXiv (abs)PDFHTML

Papers citing "Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control"

14 / 14 papers shown
Title
Towards High-Quality and Efficient Speech Bandwidth Extension with
  Parallel Amplitude and Phase Prediction
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
Ye-Xin Lu
Yang Ai
Hui-Peng Du
Zhenhua Ling
60
9
0
12 Jan 2024
mdctGAN: Taming transformer-based GAN for speech super-resolution with
  Modified DCT spectra
mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra
Chenhao Shuai
Chaohua Shi
Lu Gan
Hongqing Liu
61
8
0
18 May 2023
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders
Sanghyun Woo
Shoubhik Debnath
Ronghang Hu
Xinlei Chen
Zhuang Liu
In So Kweon
Saining Xie
SyDa
152
806
0
02 Jan 2023
AERO: Audio Super Resolution in the Spectral Domain
AERO: Audio Super Resolution in the Spectral Domain
Moshe Mandel
Or Tal
Yossi Adi
71
26
0
22 Nov 2022
Conditioning and Sampling in Variational Diffusion Models for Speech
  Super-Resolution
Conditioning and Sampling in Variational Diffusion Models for Speech Super-Resolution
Chin-Yun Yu
Sung-Lin Yeh
Gyorgy Fazekas
Hao Tang
DiffM
75
21
0
27 Oct 2022
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling
  Rates
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates
Seungu Han
Junhyeok Lee
DiffM
122
42
0
17 Jun 2022
A ConvNet for the 2020s
A ConvNet for the 2020s
Zhuang Liu
Hanzi Mao
Chaozheng Wu
Christoph Feichtenhofer
Trevor Darrell
Saining Xie
ViT
186
5,213
0
10 Jan 2022
Self-Attention for Audio Super-Resolution
Self-Attention for Audio Super-Resolution
Nathanaël Carraz Rakotonirina
SupR
57
23
0
26 Aug 2021
HiFi-GAN: Generative Adversarial Networks for Efficient and High
  Fidelity Speech Synthesis
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Jungil Kong
Jaehyeon Kim
Jaekyoung Bae
179
1,947
0
12 Oct 2020
ViSQOL v3: An Open Source Production Ready Objective Speech and Audio
  Metric
ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric
Michael Chinen
Felicia S. C. Lim
Jan Skoglund
Nikita Gureev
F. O'Gorman
Andrew Hines
63
142
0
20 Apr 2020
Waveform Modeling and Generation Using Hierarchical Recurrent Neural
  Networks for Speech Bandwidth Extension
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension
Zhenhua Ling
Yang Ai
Yu Gu
Lirong Dai
56
61
0
24 Jan 2018
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
432
10,531
0
21 Jul 2016
Gaussian Error Linear Units (GELUs)
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
174
5,042
0
27 Jun 2016
Scheduled Sampling for Sequence Prediction with Recurrent Neural
  Networks
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Samy Bengio
Oriol Vinyals
Navdeep Jaitly
Noam M. Shazeer
152
2,038
0
09 Jun 2015
1