ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.00917
  4. Cited By
Analyzing Speaker Information in Self-Supervised Models to Improve
  Zero-Resource Speech Processing

Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing

2 August 2021
Benjamin van Niekerk
Leanne Nortje
Matthew Baas
Herman Kamper
    SSL
ArXivPDFHTML

Papers citing "Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing"

25 / 25 papers shown
Title
Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in
  Any-to-One Voice Conversion
Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion
Giuseppe Ruggiero
Matteo Testa
Jurgen Van de Walle
Luigi Di Caro
26
0
0
25 Sep 2024
Textless NLP -- Zero Resource Challenge with Low Resource Compute
Textless NLP -- Zero Resource Challenge with Low Resource Compute
Krithiga Ramadass
Abrit Pal Singh
Srihari J
Sheetal Kalyani
VLM
26
0
0
24 Sep 2024
Learning Semantic Information from Raw Audio Signal Using Both
  Contextual and Phonetic Representations
Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations
Jaeyeon Kim
Injune Hwang
Kyogu Lee
19
0
0
02 Feb 2024
Representation Learning With Hidden Unit Clustering For Low Resource
  Speech Applications
Representation Learning With Hidden Unit Clustering For Low Resource Speech Applications
Varun Krishna
T. Sai
Sriram Ganapathy
SSL
32
2
0
14 Jul 2023
What Do Self-Supervised Speech Models Know About Words?
What Do Self-Supervised Speech Models Know About Words?
Ankita Pasad
C. Chien
Shane Settle
Karen Livescu
SSL
35
26
0
30 Jun 2023
Zero-Shot Automatic Pronunciation Assessment
Zero-Shot Automatic Pronunciation Assessment
Hongfu Liu
Mingqiang Shi
Ye Wang
19
4
0
31 May 2023
Textually Pretrained Speech Language Models
Textually Pretrained Speech Language Models
Michael Hassid
Tal Remez
Tu Nguyen
Itai Gat
Alexis Conneau
...
Alexandre Défossez
Gabriel Synnaeve
Emmanuel Dupoux
Roy Schwartz
Yossi Adi
VLM
SyDa
31
53
0
22 May 2023
Self-supervised Predictive Coding Models Encode Speaker and Phonetic
  Information in Orthogonal Subspaces
Self-supervised Predictive Coding Models Encode Speaker and Phonetic Information in Orthogonal Subspaces
Oli Danyi Liu
Hao Tang
Sharon Goldwater
SSL
25
12
0
21 May 2023
Adversarial Speaker Disentanglement Using Unannotated External Data for
  Self-supervised Representation Based Voice Conversion
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion
Xintao Zhao
Shuai Wang
Yang Chao
Zhiyong Wu
Helen Meng
32
3
0
16 May 2023
Self-supervised language learning from raw audio: Lessons from the Zero
  Resource Speech Challenge
Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge
Ewan Dunbar
Nicolas Hamilakis
Emmanuel Dupoux
SSL
32
30
0
27 Oct 2022
Opening the Black Box of wav2vec Feature Encoder
Opening the Black Box of wav2vec Feature Encoder
Kwanghee Choi
E. Yeo
SSL
40
15
0
27 Oct 2022
AudioLM: a Language Modeling Approach to Audio Generation
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
49
567
0
07 Sep 2022
Analysis of Self-Supervised Learning and Dimensionality Reduction
  Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Einari Vaaras
Manu Airaksinen
Okko Rasanen
19
5
0
21 Jun 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
131
349
0
21 May 2022
Probing phoneme, language and speaker information in unsupervised speech
  representations
Probing phoneme, language and speaker information in unsupervised speech representations
Maureen de Seyssel
Marvin Lavechin
Yossi Adi
Emmanuel Dupoux
Guillaume Wisniewski
SSL
21
20
0
30 Mar 2022
Are discrete units necessary for Spoken Language Modeling?
Are discrete units necessary for Spoken Language Modeling?
Tu Nguyen
Benoît Sagot
Emmanuel Dupoux
16
24
0
11 Mar 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and
  Self-Supervised Scoring
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring
Herman Kamper
31
25
0
24 Feb 2022
textless-lib: a Library for Textless Spoken Language Processing
textless-lib: a Library for Textless Spoken Language Processing
Eugene Kharitonov
Jade Copet
Kushal Lakhotia
Tu Nguyen
Paden Tomasello
...
A. Elkahky
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
Yossi Adi
27
32
0
15 Feb 2022
Self-Supervised Representation Learning for Speech Using Visual
  Grounding and Masked Language Modeling
Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling
Puyuan Peng
David Harwath
SSL
40
26
0
07 Feb 2022
Voice Conversion Can Improve ASR in Very Low-Resource Settings
Voice Conversion Can Improve ASR in Very Low-Resource Settings
Matthew Baas
Herman Kamper
22
14
0
04 Nov 2021
A Comparison of Discrete and Soft Speech Units for Improved Voice
  Conversion
A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Benjamin van Niekerk
M. Carbonneau
Julian Zaïdi
Matthew Baas
Hugo Seuté
Herman Kamper
DRL
22
111
0
03 Nov 2021
Fast-Slow Transformer for Visually Grounding Speech
Fast-Slow Transformer for Visually Grounding Speech
Puyuan Peng
David Harwath
28
30
0
16 Sep 2021
Speech Representation Learning Combining Conformer CPC with Deep Cluster
  for the ZeroSpeech Challenge 2021
Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
Takashi Maekaku
Xuankai Chang
Yuya Fujita
Li-Wei Chen
Shinji Watanabe
Alexander I. Rudnicky
112
13
0
13 Jul 2021
The Zero Resource Speech Challenge 2021: Spoken language modelling
The Zero Resource Speech Challenge 2021: Spoken language modelling
Ewan Dunbar
Mathieu Bernard
Nicolas Hamilakis
Tu Nguyen
Maureen de Seyssel
Patricia Roze
M. Rivière
Eugene Kharitonov
Emmanuel Dupoux
50
47
0
29 Apr 2021
Generative Spoken Language Modeling from Raw Audio
Generative Spoken Language Modeling from Raw Audio
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
191
337
0
01 Feb 2021
1