Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.00271
Cited By
Learning Speaker Representations with Mutual Information
1 December 2018
Mirco Ravanelli
Yoshua Bengio
SSL
DRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Speaker Representations with Mutual Information"
50 / 55 papers shown
Title
Universal Pooling Method of Multi-layer Features from Pretrained Models for Speaker Verification
Jin Sob Kim
Hyun Joon Park
Wooseok Shin
Sung Won Han
SLR
50
0
0
12 Sep 2024
Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
Zhenyu Wang
John H. L. Hansen
AAML
38
1
0
23 Aug 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
42
4
0
21 Jul 2024
Contrastive Learning from Synthetic Audio Doppelgängers
Manuel Cherep
Nikhil Singh
42
1
0
09 Jun 2024
Towards Supervised Performance on Speaker Verification with Self-Supervised Learning by Leveraging Large-Scale ASR Models
Victor Miara
Theo Lepage
Reda Dehak
37
1
0
04 Jun 2024
SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning
Luca Zampierin
G. B. Hacene
Bac Nguyen
Mirco Ravanelli
46
2
0
26 Feb 2024
Multi-Domain Adaptation by Self-Supervised Learning for Speaker Verification
Wan Lin
Lantian Li
D. Wang
23
2
0
25 Sep 2023
BEATs: Audio Pre-Training with Acoustic Tokenizers
Sanyuan Chen
Yu-Huan Wu
Chengyi Wang
Shujie Liu
Daniel C. Tompkins
Zhuo Chen
Furu Wei
41
258
0
18 Dec 2022
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Chi-Chang Lee
Cheng-Hung Hu
Yu-Chen Lin
Chu-Song Chen
Hsin-Min Wang
Yu Tsao
41
2
0
18 Jun 2022
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
352
0
21 May 2022
Cross-modal Contrastive Learning for Speech Translation
Rong Ye
Mingxuan Wang
Lei Li
SSL
27
84
0
05 May 2022
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
40
106
0
02 Mar 2022
Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Sung Hwan Mun
Min Hyun Han
Dongjune Lee
Jihwan Kim
N. Kim
SSL
43
3
0
16 Dec 2021
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech
Suwon Shon
Ankita Pasad
Felix Wu
Pablo Brusco
Yoav Artzi
Karen Livescu
Kyu Jeong Han
AuLLM
ELM
45
74
0
19 Nov 2021
Multi network InfoMax: A pre-training method involving graph convolutional networks
Usman Mahmood
Z. Fu
Vince D. Calhoun
Sergey Plis
AI4CE
14
1
0
01 Nov 2021
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling
Shangeth Rajaa
Pham Van Tung
Chng Eng Siong
36
5
0
24 Oct 2021
ProtoInfoMax: Prototypical Networks with Mutual Information Maximization for Out-of-Domain Detection
Iftitahu Ni'mah
Meng Fang
Vlado Menkovski
Mykola Pechenizkiy
35
5
0
27 Aug 2021
Towards quantifying information flows: relative entropy in deep neural networks and the renormalization group
J. Erdmenger
Kevin T. Grosvenor
R. Jefferson
54
17
0
14 Jul 2021
AID-Purifier: A Light Auxiliary Network for Boosting Adversarial Defense
Duhun Hwang
Eunjung Lee
Wonjong Rhee
AAML
167
15
0
14 Jul 2021
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
Jing Liu
Xinxin Zhu
Fei Liu
Longteng Guo
Zijia Zhao
...
Weining Wang
Hanqing Lu
Shiyu Zhou
Jiajun Zhang
Jinqiao Wang
39
37
0
01 Jul 2021
Coherent, super resolved radar beamforming using self-supervised learning
Itai Orr
Moshik Cohen
Harel Damari
M. Halachmi
Z. Zalevsky
33
14
0
21 Jun 2021
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
Disong Wang
Liqun Deng
Y. Yeung
Xiao Chen
Xunying Liu
Helen Meng
DRL
22
136
0
18 Jun 2021
LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn W. Schuller
M. Pantic
SSL
24
53
0
16 Jun 2021
Speaker embeddings by modeling channel-wise correlations
Themos Stafylakis
Johan Rohdin
L. Burget
11
9
0
06 Apr 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
22
6
0
02 Mar 2021
Improving speech recognition models with small samples for air traffic control systems
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
34
32
0
16 Feb 2021
HDMI: High-order Deep Multiplex Infomax
Baoyu Jing
Chanyoung Park
Hanghang Tong
98
164
0
15 Feb 2021
An iterative framework for self-supervised deep speaker representation learning
Danwei Cai
Weiqing Wang
Ming Li
SSL
16
37
0
25 Oct 2020
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning
Sung Hwan Mun
Woohyun Kang
Min Hyun Han
N. Kim
SSL
49
21
0
22 Oct 2020
Learning Speaker Embedding from Text-to-Speech
Jaejin Cho
Piotr Żelasko
Jesus Villalba
Shinji Watanabe
Najim Dehak
29
10
0
21 Oct 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
T. Toda
27
38
0
07 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
15
6
0
05 Aug 2020
Intra-class variation reduction of speaker representation in disentanglement framework
Yoohwan Kwon
Soo-Whan Chung
Hong-Goo Kang
DRL
14
21
0
04 Aug 2020
Whole MILC: generalizing learned dynamics across tasks, datasets, and populations
Usman Mahmood
Md. Mahfuzur Rahman
A. Fedorov
N. Lewis
Z. Fu
Vince D. Calhoun
Sergey Plis
25
22
0
29 Jul 2020
Augmentation adversarial training for self-supervised speaker recognition
Jaesung Huh
Hee-Soo Heo
Jingu Kang
Shinji Watanabe
Joon Son Chung
SSL
48
76
0
23 Jul 2020
Whitening for Self-Supervised Representation Learning
Aleksandr Ermolov
Aliaksandr Siarohin
E. Sangineto
N. Sebe
SSL
33
309
0
13 Jul 2020
Self-supervised Learning for Speech Enhancement
Yuchun Wang
Shrikant Venkataramani
Paris Smaragdis
SSL
19
31
0
18 Jun 2020
A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Dongwei Jiang
Wubo Li
Ruixiong Zhang
Miao Cao
Ne Luo
Yang Han
Wei Zou
Xiangang Li
SSL
25
29
0
20 May 2020
Segment Aggregation for short utterances speaker verification using raw waveforms
Seung-bin Kim
Jee-weon Jung
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
6
5
0
07 May 2020
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?
Abhinav Shukla
Stavros Petridis
M. Pantic
SSL
32
28
0
04 May 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
18
60
0
01 Apr 2020
An end-to-end approach for the verification problem: learning the right distance
João Monteiro
Isabela Albuquerque
Md. Jahangir Alam
R. Devon Hjelm
T. Falk
24
6
0
21 Feb 2020
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning
Anssi Kanervisto
Ville Hautamaki
Tomi Kinnunen
Junichi Yamagishi
16
2
0
06 Feb 2020
Multi-task self-supervised learning for Robust Speech Recognition
Mirco Ravanelli
Jianyuan Zhong
Santiago Pascual
P. Swietojanski
João Monteiro
J. Trmal
Yoshua Bengio
SSL
189
288
0
25 Jan 2020
Visually Guided Self Supervised Learning of Speech Representations
Abhinav Shukla
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
M. Pantic
SSL
27
24
0
13 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
32
81
0
02 Jan 2020
Learnt dynamics generalizes across tasks, datasets, and populations
Usman Mahmood
M. M. Rahman
A. Fedorov
Z. Fu
Vince D. Calhoun
Sergey Plis
22
4
0
04 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey
Shervin Minaee
AmirAli Abdolrashidi
Hang Su
Bennamoun
David C. Zhang
23
84
0
30 Nov 2019
Unsupervised Attributed Multiplex Network Embedding
Chanyoung Park
Donghyun Kim
Jiawei Han
Hwanjo Yu
35
249
0
15 Nov 2019
Deep learning methods in speaker recognition: a review
Dávid Sztahó
György Szaszák
A. Beke
VLM
25
46
0
14 Nov 2019
1
2
Next