Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.09584
Cited By
ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric
20 April 2020
Michael Chinen
Felicia S. C. Lim
Jan Skoglund
Nikita Gureev
F. O'Gorman
Andrew Hines
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric"
21 / 71 papers shown
Title
LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models
Teerapat Jenrungrot
Michael Chinen
W. Kleijn
Jan Skoglund
Zalan Borsos
Neil Zeghidour
Marco Tagliasacchi
60
19
0
23 Mar 2023
LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Rodrigo Mira
Buye Xu
Jacob Donley
Anurag Kumar
Stavros Petridis
V. Ithapu
Maja Pantic
28
13
0
20 Nov 2022
High Fidelity Neural Audio Compression
Alexandre Défossez
Jade Copet
Gabriel Synnaeve
Yossi Adi
35
607
0
24 Oct 2022
AudioGen: Textually Guided Audio Generation
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
27
290
0
30 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
73
573
0
07 Sep 2022
NESC: Robust Neural End-2-End Speech Coding with GANs
N. Pia
Kishan Gupta
Srikanth Korse
M. Multrus
Guillaume Fuchs
33
15
0
07 Jul 2022
Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Ali Siahkoohi
Michael Chinen
Tom Denton
W. Kleijn
Jan Skoglund
22
8
0
05 Jul 2022
Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Andrew A. Catellier
S. Voran
22
3
0
27 Jun 2022
Bandwidth-Scalable Fully Mask-Based Deep FCRN Acoustic Echo Cancellation and Postfiltering
Ernst Seidel
R. Olsson
K. Haddad
Zhengyang Li
Pejman Mowlaee
Tim Fingscheidt
21
1
0
09 May 2022
Disentangling speech from surroundings with neural embeddings
Ahmed Omran
Neil Zeghidour
Zalan Borsos
Félix de Chaumont Quitry
M. Slaney
Marco Tagliasacchi
14
8
0
29 Mar 2022
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis
Rishabh Jain
Mariam Yiwere
Dan Bigioi
Peter Corcoran
H. Cucu
27
14
0
22 Mar 2022
Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement
Haoyu Li
Yun Liu
Junichi Yamagishi
15
2
0
22 Mar 2022
AIDA: An Active Inference-based Design Agent for Audio Processing Algorithms
Albert Podusenko
Bart Van Erp
Magnus T. Koudahl
Bert De Vries
14
5
0
26 Dec 2021
Perceptual Evaluation of 360 Audiovisual Quality and Machine Learning Predictions
R. F. Fela
N. Zacharov
Søren Forchhammer
22
3
0
22 Dec 2021
AQP: An Open Modular Python Platform for Objective Speech and Audio Quality Metrics
Jack Geraghty
Jiazheng Li
Alessandro Ragano
Andrew Hines
25
0
0
26 Oct 2021
Objective Measures of Perceptual Audio Quality Reviewed: An Evaluation of Their Application Domain Dependence
Matteo Torcoli
T. Kastner
Jürgen Herre
21
58
0
21 Oct 2021
EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments
Jacob Donley
V. Tourbabin
Jung-Suk Lee
Mark Broyles
Hao Jiang
Jie Shen
Maja Pantic
V. Ithapu
Ravish Mehra
21
62
0
09 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
43
739
0
07 Jul 2021
Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement
Haoyu Li
Junichi Yamagishi
27
9
0
17 Apr 2021
SESQA: semi-supervised learning for speech quality assessment
Joan Serrà
Jordi Pons
Santiago Pascual
21
42
0
01 Oct 2020
Deep Learning-Based Single-Ended Objective Quality Measures for Time-Scale Modified Audio
Timothy Roberts
Aaron Nicolson
K. Paliwal
8
1
0
07 Sep 2020
Previous
1
2