Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07902
Cited By
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data
22 September 2017
Wei-Ning Hsu
Yu Zhang
James R. Glass
BDL
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data"
50 / 60 papers shown
Title
Bird Vocalization Embedding Extraction Using Self-Supervised Disentangled Representation Learning
Runwu Shi
Katsutoshi Itoyama
K. Nakadai
SSL
DRL
44
1
0
31 Dec 2024
Unsupervised Representation Learning from Sparse Transformation Analysis
Yue Song
Thomas Anderson Keller
Yisong Yue
Pietro Perona
Max Welling
DRL
33
0
0
07 Oct 2024
Cross-Utterance Conditioned VAE for Speech Generation
Yong Li
Cheng Yu
Guangzhi Sun
Weiqin Zu
Zheng Tian
...
Wei Pan
Chao Zhang
Jun Wang
Yang Yang
Fanglei Sun
21
2
0
08 Sep 2023
Multifactor Sequential Disentanglement via Structured Koopman Autoencoders
Nimrod Berman
Ilana D Naiman
Omri Azencot
CoGe
27
22
0
30 Mar 2023
Learning Interpretable Low-dimensional Representation via Physical Symmetry
Xuanjie Liu
Daniel Y. Chin
Yichen Huang
Gus Xia
26
3
0
05 Feb 2023
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
23
5
0
16 Nov 2022
Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation
Chendong Zhao
Jianzong Wang
Xiaoyang Qu
Haoqian Wang
Jing Xiao
SSL
38
1
0
15 Oct 2022
AudioGen: Textually Guided Audio Generation
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
27
289
0
30 Sep 2022
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech
Jaejin Cho
Jesús Villalba
Laureano Moro Velázquez
Najim Dehak
SSL
39
18
0
10 Aug 2022
Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders
Lies Bollens
T. Francart
Hugo Van hamme
DRL
BDL
17
14
0
01 Jul 2022
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
350
0
21 May 2022
Unsupervised Mismatch Localization in Cross-Modal Sequential Data with Application to Mispronunciations Localization
Wei Wei
Hengguan Huang
Xiangming Gu
Hao Wang
Ye Wang
BDL
22
0
0
05 May 2022
Improved far-field speech recognition using Joint Variational Autoencoder
Shashi Kumar
S. Rath
Abhishek Pandey
DRL
13
0
0
24 Apr 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDL
AI4TS
SSL
19
11
0
01 Mar 2022
Benchmarking Generative Latent Variable Models for Speech
Jakob Drachmann Havtorn
Lasse Borgholt
Søren Hauberg
J. Frellsen
Lars Maaløe
26
3
0
22 Feb 2022
Noise-robust voice conversion with domain adversarial training
Hongqiang Du
Lei Xie
Haizhou Li
19
11
0
26 Jan 2022
Hamiltonian latent operators for content and motion disentanglement in image sequences
Asif Khan
Amos Storkey
29
2
0
02 Dec 2021
Textless Speech Emotion Conversion using Discrete and Decomposed Representations
Felix Kreuk
Adam Polyak
Jade Copet
Eugene Kharitonov
Tu Nguyen
M. Rivière
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
Yossi Adi
25
29
0
14 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
118
1,715
0
26 Oct 2021
Contrastively Disentangled Sequential Variational Autoencoder
M. Kiener
Weiran Wang
Michael Gerndt
CoGe
DRL
27
40
0
22 Oct 2021
A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling
Xiaoyu Bie
Laurent Girin
Simon Leglaive
Thomas Hueber
Xavier Alameda-Pineda
21
12
0
11 Jun 2021
Learning Robust Latent Representations for Controllable Speech Synthesis
Shakti Kumar
Jithin Pradeep
Hussain Zaidi
DRL
28
6
0
10 May 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Wei-Ning Hsu
Anuroop Sriram
Alexei Baevski
Tatiana Likhomanenko
Qiantong Xu
...
Jacob Kahn
Ann Lee
R. Collobert
Gabriel Synnaeve
Michael Auli
SSL
22
236
0
02 Apr 2021
Interpretability and Explainability: A Machine Learning Zoo Mini-tour
Ricards Marcinkevics
Julia E. Vogt
XAI
28
119
0
03 Dec 2020
Mutual Information Based Method for Unsupervised Disentanglement of Video Representation
Aditya Sreekar
Ujjwal Tiwari
A. Namboodiri
DRL
26
4
0
17 Nov 2020
Unsupervised Learning of Disentangled Speech Content and Style Representation
Andros Tjandra
Ruoming Pang
Yu Zhang
Shigeki Karita
BDL
DRL
23
15
0
24 Oct 2020
THIN: THrowable Information Networks and Application for Facial Expression Recognition In The Wild
Estèphe Arnaud
Arnaud Dapogny
Kévin Bailly
CVBM
29
23
0
15 Oct 2020
Dynamic Future Net: Diversified Human Motion Generation
Wenheng Chen
He Wang
Yi Yuan
Tianjia Shao
Kun Zhou
3DH
32
22
0
25 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
38
317
0
09 Aug 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
19
58
0
29 Jul 2020
PCAAE: Principal Component Analysis Autoencoder for organising the latent space of generative networks
Chi-Hieu Pham
Saïd Ladjal
A. Newson
DRL
19
31
0
14 Jun 2020
Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations
Janek Ebbers
Michael Kuhlmann
Tobias Cord-Landwehr
Reinhold Haeb-Umbach
DRL
CoGe
SSL
25
4
0
26 May 2020
S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation
Yizhe Zhu
Martin Renqiang Min
Asim Kadav
H. Graf
CoGe
DRL
27
95
0
23 May 2020
CausalVAE: Structured Causal Disentanglement in Variational Autoencoder
Girish A. Koushik
Furui Liu
Zhitang Chen
Xinwei Shen
Jianye Hao
Jun Wang
OOD
CoGe
CML
41
44
0
18 Apr 2020
Variational inference formulation for a model-free simulation of a dynamical system with unknown parameters by a recurrent neural network
K. Yeo
D. E. C. Grullon
Fan-Keng Sun
Duane S. Boning
Jayant Kalagnanam
BDL
21
3
0
02 Mar 2020
Weakly-Supervised Disentanglement Without Compromises
Francesco Locatello
Ben Poole
Gunnar Rätsch
Bernhard Schölkopf
Olivier Bachem
Michael Tschannen
CoGe
OOD
DRL
184
313
0
07 Feb 2020
Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuanbin Cao
Heiga Zen
Yonghui Wu
11
130
0
06 Feb 2020
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
Wen-Chin Huang
Hao Luo
Hsin-Te Hwang
Chen-Chou Lo
Yu-Huai Peng
Yu Tsao
Hsin-Min Wang
DRL
17
42
0
22 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
32
81
0
02 Jan 2020
Enhancing VAEs for Collaborative Filtering: Flexible Priors & Gating Mechanisms
Daeryong Kim
B. Suh
24
50
0
03 Nov 2019
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Yu-An Chung
James R. Glass
SSL
20
173
0
23 Oct 2019
Independent Subspace Analysis for Unsupervised Learning of Disentangled Representations
Jan Stühmer
Richard Turner
Sebastian Nowozin
DRL
BDL
CoGe
117
25
0
05 Sep 2019
Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders
Yin-Jyun Luo
Kat R. Agres
Dorien Herremans
16
46
0
19 Jun 2019
Disentangling Factors of Variation Using Few Labels
Francesco Locatello
Michael Tschannen
Stefan Bauer
Gunnar Rätsch
Bernhard Schölkopf
Olivier Bachem
DRL
CML
CoGe
29
123
0
03 May 2019
A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations
Mingda Chen
Qingming Tang
Sam Wiseman
Kevin Gimpel
DRL
23
76
0
02 Apr 2019
Multi-modal Probabilistic Prediction of Interactive Behavior via an Interpretable Model
Yeping Hu
Wei Zhan
Liting Sun
Masayoshi Tomizuka
22
45
0
22 Mar 2019
Unsupervised Discovery of Parts, Structure, and Dynamics
Zhenjia Xu
Zhijian Liu
Chen Sun
Kevin Patrick Murphy
William T. Freeman
J. Tenenbaum
Jiajun Wu
OCL
30
61
0
12 Mar 2019
Domain Mismatch Robust Acoustic Scene Classification using Channel Information Conversion
Seongkyu Mun
Suwon Shon
16
21
0
04 Dec 2018
Unsupervised learning with contrastive latent variable models
Kristen A. Severson
S. Ghosh
Kenney Ng
SSL
DRL
19
38
0
14 Nov 2018
Interpreting Models by Allowing to Ask
Sungmin Kang
D. Park
Jaehyuk Chang
Jaegul Choo
13
0
0
13 Nov 2018
1
2
Next