ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.00271
  4. Cited By
Learning Speaker Representations with Mutual Information

Learning Speaker Representations with Mutual Information

1 December 2018
Mirco Ravanelli
Yoshua Bengio
    SSL
    DRL
ArXivPDFHTML

Papers citing "Learning Speaker Representations with Mutual Information"

50 / 55 papers shown
Title
Universal Pooling Method of Multi-layer Features from Pretrained Models
  for Speaker Verification
Universal Pooling Method of Multi-layer Features from Pretrained Models for Speaker Verification
Jin Sob Kim
Hyun Joon Park
Wooseok Shin
Sung Won Han
SLR
50
0
0
12 Sep 2024
Toward Improving Synthetic Audio Spoofing Detection Robustness via
  Meta-Learning and Disentangled Training With Adversarial Examples
Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
Zhenyu Wang
John H. L. Hansen
AAML
40
1
0
23 Aug 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
44
4
0
21 Jul 2024
Contrastive Learning from Synthetic Audio Doppelgängers
Contrastive Learning from Synthetic Audio Doppelgängers
Manuel Cherep
Nikhil Singh
45
1
0
09 Jun 2024
Towards Supervised Performance on Speaker Verification with
  Self-Supervised Learning by Leveraging Large-Scale ASR Models
Towards Supervised Performance on Speaker Verification with Self-Supervised Learning by Leveraging Large-Scale ASR Models
Victor Miara
Theo Lepage
Reda Dehak
37
1
0
04 Jun 2024
SKILL: Similarity-aware Knowledge distILLation for Speech
  Self-Supervised Learning
SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning
Luca Zampierin
G. B. Hacene
Bac Nguyen
Mirco Ravanelli
46
2
0
26 Feb 2024
Multi-Domain Adaptation by Self-Supervised Learning for Speaker
  Verification
Multi-Domain Adaptation by Self-Supervised Learning for Speaker Verification
Wan Lin
Lantian Li
D. Wang
23
2
0
25 Sep 2023
BEATs: Audio Pre-Training with Acoustic Tokenizers
BEATs: Audio Pre-Training with Acoustic Tokenizers
Sanyuan Chen
Yu-Huan Wu
Chengyi Wang
Shujie Liu
Daniel C. Tompkins
Zhuo Chen
Furu Wei
41
258
0
18 Dec 2022
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional
  Resampling
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Chi-Chang Lee
Cheng-Hung Hu
Yu-Chen Lin
Chu-Song Chen
Hsin-Min Wang
Yu Tsao
41
2
0
18 Jun 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
352
0
21 May 2022
Cross-modal Contrastive Learning for Speech Translation
Cross-modal Contrastive Learning for Speech Translation
Rong Ye
Mingxuan Wang
Lei Li
SSL
27
84
0
05 May 2022
Audio Self-supervised Learning: A Survey
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
42
106
0
02 Mar 2022
Bootstrap Equilibrium and Probabilistic Speaker Representation Learning
  for Self-supervised Speaker Verification
Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Sung Hwan Mun
Min Hyun Han
Dongjune Lee
Jihwan Kim
N. Kim
SSL
43
3
0
16 Dec 2021
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation
  on Natural Speech
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech
Suwon Shon
Ankita Pasad
Felix Wu
Pablo Brusco
Yoav Artzi
Karen Livescu
Kyu Jeong Han
AuLLM
ELM
45
74
0
19 Nov 2021
Multi network InfoMax: A pre-training method involving graph
  convolutional networks
Multi network InfoMax: A pre-training method involving graph convolutional networks
Usman Mahmood
Z. Fu
Vince D. Calhoun
Sergey Plis
AI4CE
14
1
0
01 Nov 2021
Learning Speaker Representation with Semi-supervised Learning approach
  for Speaker Profiling
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling
Shangeth Rajaa
Pham Van Tung
Chng Eng Siong
38
5
0
24 Oct 2021
ProtoInfoMax: Prototypical Networks with Mutual Information Maximization
  for Out-of-Domain Detection
ProtoInfoMax: Prototypical Networks with Mutual Information Maximization for Out-of-Domain Detection
Iftitahu Ni'mah
Meng Fang
Vlado Menkovski
Mykola Pechenizkiy
35
5
0
27 Aug 2021
Towards quantifying information flows: relative entropy in deep neural
  networks and the renormalization group
Towards quantifying information flows: relative entropy in deep neural networks and the renormalization group
J. Erdmenger
Kevin T. Grosvenor
R. Jefferson
57
17
0
14 Jul 2021
AID-Purifier: A Light Auxiliary Network for Boosting Adversarial Defense
AID-Purifier: A Light Auxiliary Network for Boosting Adversarial Defense
Duhun Hwang
Eunjung Lee
Wonjong Rhee
AAML
167
15
0
14 Jul 2021
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and
  Generation
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
Jing Liu
Xinxin Zhu
Fei Liu
Longteng Guo
Zijia Zhao
...
Weining Wang
Hanqing Lu
Shiyu Zhou
Jiajun Zhang
Jinqiao Wang
39
37
0
01 Jul 2021
Coherent, super resolved radar beamforming using self-supervised
  learning
Coherent, super resolved radar beamforming using self-supervised learning
Itai Orr
Moshik Cohen
Harel Damari
M. Halachmi
Z. Zalevsky
33
14
0
21 Jun 2021
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised
  Speech Representation Disentanglement for One-shot Voice Conversion
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
Disong Wang
Liqun Deng
Y. Yeung
Xiao Chen
Xunying Liu
Helen Meng
DRL
22
136
0
18 Jun 2021
LiRA: Learning Visual Speech Representations from Audio through
  Self-supervision
LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn W. Schuller
Maja Pantic
SSL
24
53
0
16 Jun 2021
Speaker embeddings by modeling channel-wise correlations
Speaker embeddings by modeling channel-wise correlations
Themos Stafylakis
Johan Rohdin
L. Burget
16
9
0
06 Apr 2021
Tune-In: Training Under Negative Environments with Interference for
  Attention Networks Simulating Cocktail Party Effect
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
22
6
0
02 Mar 2021
Improving speech recognition models with small samples for air traffic
  control systems
Improving speech recognition models with small samples for air traffic control systems
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
34
32
0
16 Feb 2021
HDMI: High-order Deep Multiplex Infomax
HDMI: High-order Deep Multiplex Infomax
Baoyu Jing
Chanyoung Park
Hanghang Tong
98
164
0
15 Feb 2021
An iterative framework for self-supervised deep speaker representation
  learning
An iterative framework for self-supervised deep speaker representation learning
Danwei Cai
Weiqing Wang
Ming Li
SSL
19
37
0
25 Oct 2020
Unsupervised Representation Learning for Speaker Recognition via
  Contrastive Equilibrium Learning
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning
Sung Hwan Mun
Woohyun Kang
Min Hyun Han
N. Kim
SSL
49
21
0
22 Oct 2020
Learning Speaker Embedding from Text-to-Speech
Learning Speaker Embedding from Text-to-Speech
Jaejin Cho
Piotr Żelasko
Jesus Villalba
Shinji Watanabe
Najim Dehak
31
10
0
21 Oct 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
T. Toda
27
38
0
07 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with
  Adversarial Learning
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
15
6
0
05 Aug 2020
Intra-class variation reduction of speaker representation in
  disentanglement framework
Intra-class variation reduction of speaker representation in disentanglement framework
Yoohwan Kwon
Soo-Whan Chung
Hong-Goo Kang
DRL
14
21
0
04 Aug 2020
Whole MILC: generalizing learned dynamics across tasks, datasets, and
  populations
Whole MILC: generalizing learned dynamics across tasks, datasets, and populations
Usman Mahmood
Md. Mahfuzur Rahman
A. Fedorov
N. Lewis
Z. Fu
Vince D. Calhoun
Sergey Plis
25
22
0
29 Jul 2020
Augmentation adversarial training for self-supervised speaker
  recognition
Augmentation adversarial training for self-supervised speaker recognition
Jaesung Huh
Hee-Soo Heo
Jingu Kang
Shinji Watanabe
Joon Son Chung
SSL
48
76
0
23 Jul 2020
Whitening for Self-Supervised Representation Learning
Whitening for Self-Supervised Representation Learning
Aleksandr Ermolov
Aliaksandr Siarohin
E. Sangineto
N. Sebe
SSL
33
309
0
13 Jul 2020
Self-supervised Learning for Speech Enhancement
Self-supervised Learning for Speech Enhancement
Yuchun Wang
Shrikant Venkataramani
Paris Smaragdis
SSL
19
31
0
18 Jun 2020
A Further Study of Unsupervised Pre-training for Transformer Based
  Speech Recognition
A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Dongwei Jiang
Wubo Li
Ruixiong Zhang
Miao Cao
Ne Luo
Yang Han
Wei Zou
Xiangang Li
SSL
25
29
0
20 May 2020
Segment Aggregation for short utterances speaker verification using raw
  waveforms
Segment Aggregation for short utterances speaker verification using raw waveforms
Seung-bin Kim
Jee-weon Jung
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
11
5
0
07 May 2020
Does Visual Self-Supervision Improve Learning of Speech Representations
  for Emotion Recognition?
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?
Abhinav Shukla
Stavros Petridis
Maja Pantic
SSL
32
28
0
04 May 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker
  Verification using Raw Waveforms
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
23
60
0
01 Apr 2020
An end-to-end approach for the verification problem: learning the right
  distance
An end-to-end approach for the verification problem: learning the right distance
João Monteiro
Isabela Albuquerque
Md. Jahangir Alam
R. Devon Hjelm
T. Falk
26
6
0
21 Feb 2020
An initial investigation on optimizing tandem speaker verification and
  countermeasure systems using reinforcement learning
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning
Anssi Kanervisto
Ville Hautamaki
Tomi Kinnunen
Junichi Yamagishi
19
2
0
06 Feb 2020
Multi-task self-supervised learning for Robust Speech Recognition
Multi-task self-supervised learning for Robust Speech Recognition
Mirco Ravanelli
Jianyuan Zhong
Santiago Pascual
P. Swietojanski
João Monteiro
J. Trmal
Yoshua Bengio
SSL
189
288
0
25 Jan 2020
Visually Guided Self Supervised Learning of Speech Representations
Visually Guided Self Supervised Learning of Speech Representations
Abhinav Shukla
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
Maja Pantic
SSL
27
24
0
13 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
34
81
0
02 Jan 2020
Learnt dynamics generalizes across tasks, datasets, and populations
Learnt dynamics generalizes across tasks, datasets, and populations
Usman Mahmood
M. M. Rahman
A. Fedorov
Z. Fu
Vince D. Calhoun
Sergey Plis
22
4
0
04 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey
Biometrics Recognition Using Deep Learning: A Survey
Shervin Minaee
AmirAli Abdolrashidi
Hang Su
Bennamoun
David C. Zhang
26
84
0
30 Nov 2019
Unsupervised Attributed Multiplex Network Embedding
Unsupervised Attributed Multiplex Network Embedding
Chanyoung Park
Donghyun Kim
Jiawei Han
Hwanjo Yu
35
249
0
15 Nov 2019
Deep learning methods in speaker recognition: a review
Deep learning methods in speaker recognition: a review
Dávid Sztahó
György Szaszák
A. Beke
VLM
28
46
0
14 Nov 2019
12
Next