ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.00877
  4. Cited By
High-Fidelity Audio Generation and Representation Learning with Guided
  Adversarial Autoencoder

High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder

1 June 2020
Kazi Nazmul Haque
R. Rana
Björn W Schuller
    DRL
ArXivPDFHTML

Papers citing "High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder"

50 / 73 papers shown
Title
Generative Adversarial Networks
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
223
30,089
0
01 Mar 2022
Guided Generative Adversarial Neural Network for Representation Learning
  and High Fidelity Audio Generation using Fewer Labelled Audio Data
Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data
Kazi Nazmul Haque
R. Rana
John H. L. Hansen
Björn Schuller
GAN
38
3
0
05 Mar 2020
Disentangled Speech Embeddings using Cross-modal Self-supervision
Disentangled Speech Embeddings using Cross-modal Self-supervision
Arsha Nagrani
Joon Son Chung
Samuel Albanie
Andrew Zisserman
SSL
57
88
0
20 Feb 2020
Unsupervised pretraining transfers well across languages
Unsupervised pretraining transfers well across languages
M. Rivière
Armand Joulin
Pierre-Emmanuel Mazaré
Emmanuel Dupoux
SSL
VLM
42
208
0
07 Feb 2020
Learning Robust and Multilingual Speech Representations
Learning Robust and Multilingual Speech Representations
Kazuya Kawakami
Luyu Wang
Chris Dyer
Phil Blunsom
Aaron van den Oord
SSL
68
100
0
29 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
84
82
0
02 Jan 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
354
42,299
0
03 Dec 2019
Analyzing and Improving the Image Quality of StyleGAN
Analyzing and Improving the Image Quality of StyleGAN
Tero Karras
S. Laine
M. Aittala
Janne Hellsten
J. Lehtinen
Timo Aila
GAN
260
5,797
0
03 Dec 2019
Effectiveness of self-supervised pre-training for speech recognition
Effectiveness of self-supervised pre-training for speech recognition
Alexei Baevski
Michael Auli
Abdel-rahman Mohamed
SSL
69
147
0
10 Nov 2019
Learning audio representations via phase prediction
Learning audio representations via phase prediction
Félix de Chaumont Quitry
Marco Tagliasacchi
Dominik Roblek
SSL
AI4TS
33
10
0
25 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
128
666
0
12 Oct 2019
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
Weijie Su
Xizhou Zhu
Yue Cao
Bin Li
Lewei Lu
Furu Wei
Jifeng Dai
VLM
MLLM
SSL
142
1,661
0
22 Aug 2019
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion
  Recognition
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
J. Epps
Björn W. Schuller
75
99
0
13 Jul 2019
Large Scale Adversarial Representation Learning
Large Scale Adversarial Representation Learning
Jeff Donahue
Karen Simonyan
SSL
122
543
0
04 Jul 2019
Self-Supervised Dialogue Learning
Self-Supervised Dialogue Learning
Jiawei Wu
Xin Eric Wang
William Yang Wang
SSL
33
58
0
30 Jun 2019
Self-Supervised Learning for Contextualized Extractive Summarization
Self-Supervised Learning for Contextualized Extractive Summarization
Hong Wang
Xin Eric Wang
Wenhan Xiong
Mo Yu
Xiaoxiao Guo
Shiyu Chang
William Yang Wang
SSL
88
56
0
11 Jun 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech
  Recognition
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
159
3,451
0
18 Apr 2019
Direct Modelling of Speech Emotion from Raw Speech
Direct Modelling of Speech Emotion from Raw Speech
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
J. Epps
65
103
0
08 Apr 2019
Self-Supervised Learning via Conditional Motion Propagation
Self-Supervised Learning via Conditional Motion Propagation
Xiaohang Zhan
Xingang Pan
Ziwei Liu
Dahua Lin
Chen Change Loy
SSL
61
47
0
27 Mar 2019
High-Fidelity Image Generation With Fewer Labels
High-Fidelity Image Generation With Fewer Labels
Mario Lucic
Michael Tschannen
Marvin Ritter
Xiaohua Zhai
Olivier Bachem
Sylvain Gelly
GAN
OOD
77
158
0
06 Mar 2019
GANSynth: Adversarial Neural Audio Synthesis
GANSynth: Adversarial Neural Audio Synthesis
Jesse Engel
Kumar Krishna Agrawal
Shuo Chen
Ishaan Gulrajani
Chris Donahue
Adam Roberts
79
392
0
23 Feb 2019
Adversarial Generation of Time-Frequency Features with application in
  audio synthesis
Adversarial Generation of Time-Frequency Features with application in audio synthesis
Andrés Marafioti
Nicki Holighaus
Nathanael Perraudin
P. Majdak
33
68
0
11 Feb 2019
Transfer Learning From Sound Representations For Anger Detection in
  Speech
Transfer Learning From Sound Representations For Anger Detection in Speech
M. Elshaer
Scott Wisdom
Taniya Mishra
75
17
0
06 Feb 2019
Self-Supervised Generalisation with Meta Auxiliary Learning
Self-Supervised Generalisation with Meta Auxiliary Learning
Shikun Liu
Andrew J. Davison
Edward Johns
SSL
OOD
50
163
0
25 Jan 2019
Unsupervised speech representation learning using WaveNet autoencoders
Unsupervised speech representation learning using WaveNet autoencoders
J. Chorowski
Ron J. Weiss
Samy Bengio
Aaron van den Oord
SSL
72
318
0
25 Jan 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
529
10,527
0
12 Dec 2018
Challenging Common Assumptions in the Unsupervised Learning of
  Disentangled Representations
Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations
Francesco Locatello
Stefan Bauer
Mario Lucic
Gunnar Rätsch
Sylvain Gelly
Bernhard Schölkopf
Olivier Bachem
OOD
111
1,466
0
29 Nov 2018
Training neural audio classifiers with few data
Training neural audio classifiers with few data
Jordi Pons
Joan Serrà
Xavier Serra
62
57
0
24 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.4K
94,511
0
11 Oct 2018
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Andrew Brock
Jeff Donahue
Karen Simonyan
237
5,381
0
28 Sep 2018
How good is my GAN?
How good is my GAN?
K. Shmelkov
Cordelia Schmid
Alahari Karteek
GAN
EGVM
51
348
0
25 Jul 2018
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
280
10,253
0
10 Jul 2018
Adversarial Auto-encoders for Speech Based Emotion Recognition
Adversarial Auto-encoders for Speech Based Emotion Recognition
Saurabh Sahu
Rahul Gupta
Ganesh Sivaraman
Wael AbdAlmageed
C. Espy-Wilson
GAN
50
66
0
06 Jun 2018
Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition
Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition
Pete Warden
74
1,615
0
09 Apr 2018
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word
  Embeddings from Speech
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
3DV
64
184
0
23 Mar 2018
Unsupervised Representation Learning by Predicting Image Rotations
Unsupervised Representation Learning by Predicting Image Rotations
Spyros Gidaris
Praveer Singh
N. Komodakis
OOD
SSL
DRL
227
3,283
0
21 Mar 2018
Adversarial Audio Synthesis
Adversarial Audio Synthesis
Chris Donahue
Julian McAuley
M. Puckette
GAN
133
610
0
12 Feb 2018
Image denoising and restoration with CNN-LSTM Encoder Decoder with
  Direct Attention
Image denoising and restoration with CNN-LSTM Encoder Decoder with Direct Attention
Kazi Nazmul Haque
M. Yousuf
R. Rana
3DV
36
21
0
16 Jan 2018
A Note on the Inception Score
A Note on the Inception Score
Shane T. Barratt
Rishi Sharma
EGVM
86
691
0
06 Jan 2018
Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural
  Network for Sound Events and Scenes
Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes
Anurag Kumar
Maksim Khadkevich
C. Fügen
46
140
0
04 Nov 2017
Progressive Growing of GANs for Improved Quality, Stability, and
  Variation
Progressive Growing of GANs for Improved Quality, Stability, and Variation
Tero Karras
Timo Aila
S. Laine
J. Lehtinen
GAN
118
7,339
0
27 Oct 2017
Mixed Precision Training
Mixed Precision Training
Paulius Micikevicius
Sharan Narang
Jonah Alben
G. Diamos
Erich Elsen
...
Boris Ginsburg
Michael Houston
Oleksii Kuchaiev
Ganesh Venkatesh
Hao Wu
149
1,792
0
10 Oct 2017
Semi-supervised Conditional GANs
Semi-supervised Conditional GANs
K. Sricharan
R. Bala
Matthew Shreve
Hui Ding
K. Saketh
J. Sun
GAN
41
54
0
19 Aug 2017
Unsupervised Domain Adaptation for Robust Speech Recognition via
  Variational Autoencoder-Based Data Augmentation
Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation
Wei-Ning Hsu
Yu Zhang
James R. Glass
44
127
0
19 Jul 2017
Guiding InfoGAN with Semi-Supervision
Guiding InfoGAN with Semi-Supervision
Adrian Spurr
Emre Aksan
Otmar Hilliges
GAN
57
47
0
14 Jul 2017
Adversarial Network Bottleneck Features for Noise Robust Speaker
  Verification
Adversarial Network Bottleneck Features for Noise Robust Speaker Verification
Hong Yu
Zheng-Hua Tan
Zhanyu Ma
Jun Guo
AAML
42
33
0
11 Jun 2017
InfoVAE: Information Maximizing Variational Autoencoders
InfoVAE: Information Maximizing Variational Autoencoders
Shengjia Zhao
Jiaming Song
Stefano Ermon
DRL
79
445
0
07 Jun 2017
Learning Representations of Emotional Speech with Deep Convolutional
  Generative Adversarial Networks
Learning Representations of Emotional Speech with Deep Convolutional Generative Adversarial Networks
Jonathan D. Chang
Stefan Scherer
SSL
GAN
43
104
0
22 Apr 2017
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders
Jesse Engel
Cinjon Resnick
Adam Roberts
Sander Dieleman
Douglas Eck
Karen Simonyan
Mohammad Norouzi
106
623
0
05 Apr 2017
Tacotron: Towards End-to-End Speech Synthesis
Tacotron: Towards End-to-End Speech Synthesis
Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
...
Samy Bengio
Quoc V. Le
Yannis Agiomyrgiannakis
R. Clark
Rif A. Saurous
153
1,819
0
29 Mar 2017
12
Next