ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.12607
  4. Cited By
Generative Pre-Training for Speech with Autoregressive Predictive Coding

Generative Pre-Training for Speech with Autoregressive Predictive Coding

23 October 2019
Yu-An Chung
James R. Glass
    SSL
ArXivPDFHTML

Papers citing "Generative Pre-Training for Speech with Autoregressive Predictive Coding"

50 / 115 papers shown
Title
DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio
  Representation Learning
DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio Representation Learning
Sreyan Ghosh
Ashish Seth
and Deepak Mittal
Maneesh Singh
S. Umesh
SSL
27
6
0
25 Mar 2022
Federated Self-Supervised Learning for Acoustic Event Classification
Federated Self-Supervised Learning for Acoustic Event Classification
Meng Feng
Chieh-Chi Kao
Qingming Tang
Ming Sun
Viktor Rozgic
Spyros Matsoukas
Chao Wang
41
11
0
22 Mar 2022
Audio Self-supervised Learning: A Survey
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
35
106
0
02 Mar 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDL
AI4TS
SSL
19
11
0
01 Mar 2022
Assessing the State of Self-Supervised Human Activity Recognition using
  Wearables
Assessing the State of Self-Supervised Human Activity Recognition using Wearables
H. Haresamudram
Irfan Essa
Thomas Plötz
SSL
42
86
0
22 Feb 2022
Self-supervised Speaker Recognition Training Using Human-Machine
  Dialogues
Self-supervised Speaker Recognition Training Using Human-Machine Dialogues
Metehan Cekic
Ruirui Li
Zeya Chen
Yuguang Yang
A. Stolcke
Upamanyu Madhow
SSL
27
2
0
07 Feb 2022
Sentiment-Aware Automatic Speech Recognition pre-training for enhanced
  Speech Emotion Recognition
Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition
Ayoub Ghriss
Bo Yang
Viktor Rozgic
Elizabeth Shriberg
Chao Wang
27
21
0
27 Jan 2022
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning
  for Speech Pre-Training
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training
Wenyong Huang
Zhenhe Zhang
Y. Yeung
Xin Jiang
Qun Liu
35
23
0
25 Jan 2022
On Training Targets and Activation Functions for Deep Representation
  Learning in Text-Dependent Speaker Verification
On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification
A. Sarkar
Zheng-Hua Tan
16
2
0
17 Jan 2022
Self-Supervised Learning for speech recognition with Intermediate layer
  supervision
Self-Supervised Learning for speech recognition with Intermediate layer supervision
Chengyi Wang
Yu-Huan Wu
Sanyuan Chen
Shujie Liu
Jinyu Li
Yao Qian
Zhenglu Yang
SSL
24
28
0
16 Dec 2021
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
35
363
0
02 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
118
1,715
0
26 Oct 2021
Contrastively Disentangled Sequential Variational Autoencoder
Contrastively Disentangled Sequential Variational Autoencoder
M. Kiener
Weiran Wang
Michael Gerndt
CoGe
DRL
27
40
0
22 Oct 2021
DECAR: Deep Clustering for learning general-purpose Audio
  Representations
DECAR: Deep Clustering for learning general-purpose Audio Representations
Sreyan Ghosh
Sandesh V Katta
Ashish Seth
S. Umesh
SSL
36
12
0
17 Oct 2021
Don't speak too fast: The impact of data bias on self-supervised speech
  models
Don't speak too fast: The impact of data bias on self-supervised speech models
Yen Meng
Yi-Hui Chou
Andy T. Liu
Hung-yi Lee
34
26
0
15 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
  Processing
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
118
193
0
14 Oct 2021
UniSpeech-SAT: Universal Speech Representation Learning with Speaker
  Aware Pre-Training
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training
Sanyuan Chen
Yu Wu
Chengyi Wang
Zhengyang Chen
Zhuo Chen
...
Jian Wu
Yao Qian
Furu Wei
Jinyu Li
Xiangzhan Yu
SSL
30
85
0
12 Oct 2021
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs
  for Robust Speech Recognition
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition
Yiming Wang
Jinyu Li
Heming Wang
Yao Qian
Chengyi Wang
Yu Wu
38
48
0
11 Oct 2021
Large-scale ASR Domain Adaptation using Self- and Semi-supervised
  Learning
Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
DongSeon Hwang
Ananya Misra
Zhouyuan Huo
Nikhil Siddhartha
Shefali Garg
David Qiu
K. Sim
Trevor Strohman
F. Beaufays
Yanzhang He
65
34
0
01 Oct 2021
Incremental Layer-wise Self-Supervised Learning for Efficient Speech
  Domain Adaptation On Device
Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device
Zhouyuan Huo
Dong-Gyo Hwang
K. Sim
Shefali Garg
Ananya Misra
Nikhil Siddhartha
Trevor Strohman
Franccoise Beaufays
48
7
0
01 Oct 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish
  Dutch
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
Jakob Poncelet
Hugo Van hamme
SSL
28
1
0
29 Sep 2021
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Yuanxun Lu
Jinxiang Chai
Xun Cao
29
82
0
22 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning
  for Low-Resource Speech Recognition
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng
Yubei Xiao
Ke Gong
Pan Zhou
Xiaodan Liang
Liang Lin
32
26
0
19 Sep 2021
Improving Accent Identification and Accented Speech Recognition Under a
  Framework of Self-supervised Learning
Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning
Keqi Deng
Songjun Cao
Long Ma
14
29
0
15 Sep 2021
Injecting Text in Self-Supervised Speech Pretraining
Injecting Text in Self-Supervised Speech Pretraining
Zhehuai Chen
Yu Zhang
Andrew Rosenberg
Bhuvana Ramabhadran
Gary Wang
Pedro J. Moreno
SSL
25
36
0
27 Aug 2021
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling
  for Self-Supervised Speech Pre-Training
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training
Yu-An Chung
Yu Zhang
Wei Han
Chung-Cheng Chiu
James Qin
Ruoming Pang
Yonghui Wu
SSL
VLM
12
412
0
07 Aug 2021
Speech Representation Learning Combining Conformer CPC with Deep Cluster
  for the ZeroSpeech Challenge 2021
Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
Takashi Maekaku
Xuankai Chang
Yuya Fujita
Li-Wei Chen
Shinji Watanabe
Alexander I. Rudnicky
115
13
0
13 Jul 2021
Layer-wise Analysis of a Self-supervised Speech Representation Model
Layer-wise Analysis of a Self-supervised Speech Representation Model
Ankita Pasad
Ju-Chieh Chou
Karen Livescu
SSL
26
288
0
10 Jul 2021
As easy as APC: overcoming missing data and class imbalance in time
  series with self-supervised learning
As easy as APC: overcoming missing data and class imbalance in time series with self-supervised learning
Fiorella Wever
Thomas Anderson Keller
L. Symul
Victor Garcia
SSL
AI4TS
28
1
0
29 Jun 2021
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Yosuke Higuchi
Niko Moritz
Jonathan Le Roux
Takaaki Hori
VLM
35
51
0
16 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
55
2,770
0
14 Jun 2021
Scaling Laws for Acoustic Models
Scaling Laws for Acoustic Models
J. Droppo
Oguz H. Elibol
15
22
0
11 Jun 2021
Improving the Adversarial Robustness for Speaker Verification by
  Self-Supervised Learning
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAML
SSL
44
29
0
01 Jun 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised
  Representation Learning from Speech
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
Solène Evain
H. Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
...
François Portet
Solange Rossato
F. Ringeval
D. Schwab
Laurent Besacier
SSL
33
70
0
23 Apr 2021
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised
  Pretrained Representations
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
Jheng-hao Lin
Yist Y. Lin
C. Chien
Hung-yi Lee
30
56
0
07 Apr 2021
General Robot Dynamics Learning and Gen2Real
General Robot Dynamics Learning and Gen2Real
Dengpeng Xing
Jiale Li
Yiming Yang
Bo Xu
DRL
AI4CE
21
3
0
06 Apr 2021
Unsupervised Speech Representation Learning for Behavior Modeling using
  Triplet Enhanced Contextualized Networks
Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Haoqi Li
Brian R. Baucom
Shrikanth Narayanan
P. Georgiou
30
1
0
01 Apr 2021
Fast Development of ASR in African Languages using Self Supervised
  Speech Representation Learning
Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Jama Hussein Mohamud
Lloyd Thompson
A. Ndoye
Laurent Besacier
29
4
0
16 Mar 2021
Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Samik Sadhu
Di He
Che-Wei Huang
Sri Harish Reddy Mallidi
Minhua Wu
Ariya Rastrow
A. Stolcke
J. Droppo
Roland Maas
SSL
20
48
0
09 Mar 2021
Contrastive Semi-supervised Learning for ASR
Contrastive Semi-supervised Learning for ASR
Alex Xiao
Christian Fuegen
Abdel-rahman Mohamed
26
20
0
09 Mar 2021
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for
  Text-Dependent Speaker Verification
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification
A. K. Sarkar
Md. Sahidullah
Zheng-Hua Tan
7
0
0
03 Feb 2021
On Scaling Contrastive Representations for Low-Resource Speech
  Recognition
On Scaling Contrastive Representations for Low-Resource Speech Recognition
Lasse Borgholt
T. M. S. Tax
Jakob Drachmann Havtorn
Lars Maaløe
Christian Igel
SSL
13
5
0
01 Feb 2021
Adversarial Meta Sampling for Multilingual Low-Resource Speech
  Recognition
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition
Yubei Xiao
Ke Gong
Pan Zhou
Guolin Zheng
Xiaodan Liang
Liang Lin
30
34
0
22 Dec 2020
Contrastive Predictive Coding for Human Activity Recognition
Contrastive Predictive Coding for Human Activity Recognition
H. Haresamudram
Irfan Essa
Thomas Ploetz
32
118
0
09 Dec 2020
Vocal Tract Length Perturbation for Text-Dependent Speaker Verification
  with Autoregressive Prediction Coding
Vocal Tract Length Perturbation for Text-Dependent Speaker Verification with Autoregressive Prediction Coding
A. Sarkar
Zheng-Hua Tan
9
13
0
25 Nov 2020
The Zero Resource Speech Benchmark 2021: Metrics and baselines for
  unsupervised spoken language modeling
The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Tu Nguyen
Maureen de Seyssel
Patricia Roze
M. Rivière
Evgeny Kharitonov
Alexei Baevski
Ewan Dunbar
Emmanuel Dupoux
SSL
16
101
0
23 Nov 2020
Towards Semi-Supervised Semantics Understanding from Speech
Towards Semi-Supervised Semantics Understanding from Speech
Cheng-I Jeff Lai
Jin Cao
S. Bodapati
Shang-Wen Li
SSL
22
7
0
11 Nov 2020
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for
  Self-supervised Speech Representation Learning
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Dongwei Jiang
Wubo Li
Miao Cao
Wei Zou
Xiangang Li
SSL
21
65
0
27 Oct 2020
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised
  Discrete Speech Representations
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Wen-Chin Huang
Yi-Chiao Wu
Tomoki Hayashi
T. Toda
BDL
49
37
0
23 Oct 2020
Similarity Analysis of Self-Supervised Speech Representations
Similarity Analysis of Self-Supervised Speech Representations
Yu-An Chung
Yonatan Belinkov
James R. Glass
SSL
36
36
0
22 Oct 2020
Previous
123
Next