ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown
Title
BEiT: BERT Pre-Training of Image Transformers
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
423
2,858
0
15 Jun 2021
Improved Transformer for High-Resolution GANs
Improved Transformer for High-Resolution GANs
Long Zhao
Zizhao Zhang
Ting Chen
Dimitris N. Metaxas
Han Zhang
ViT
133
96
0
14 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
214
3,017
0
14 Jun 2021
Learning the Imaging Landmarks: Unsupervised Key point Detection in Lung
  Ultrasound Videos
Learning the Imaging Landmarks: Unsupervised Key point Detection in Lung Ultrasound Videos
Arpan Tripathi
Mahesh Raveendranatha Panicker
A. Hareendranathan
Yale Tung Chen
Jacob L. Jaremko
K. Narayan
C. Kesavadas
24
0
0
13 Jun 2021
Inverting Adversarially Robust Networks for Image Synthesis
Inverting Adversarially Robust Networks for Image Synthesis
Renan A. Rojas-Gomez
Raymond A. Yeh
Minh Do
A. Nguyen
68
5
0
13 Jun 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
Abhishek Sinha
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
140
121
0
12 Jun 2021
Robust Representation Learning via Perceptual Similarity Metrics
Robust Representation Learning via Perceptual Similarity Metrics
Saeid Asgari Taghanaki
Kristy Choi
Amir Khasahmadi
Anirudh Goyal
SSL
64
29
0
11 Jun 2021
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous
  Distributed Learning
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning
Eugene Belilovsky
Louis Leconte
Lucas Caccia
Michael Eickenberg
Edouard Oyallon
45
7
0
11 Jun 2021
Conditional Variational Autoencoder with Adversarial Learning for
  End-to-End Text-to-Speech
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Jaehyeon Kim
Jungil Kong
Juhee Son
DRL
170
903
0
11 Jun 2021
Score-based Generative Modeling in Latent Space
Score-based Generative Modeling in Latent Space
Arash Vahdat
Karsten Kreis
Jan Kautz
DiffM
142
688
0
10 Jun 2021
Cross-Modal Discrete Representation Learning
Cross-Modal Discrete Representation Learning
Alexander H. Liu
SouYoung Jin
Cheng-I Jeff Lai
Andrew Rouditchenko
A. Oliva
James R. Glass
SSL
84
41
0
10 Jun 2021
Domain Specific Transporter Framework to Detect Fractures in Ultrasound
Domain Specific Transporter Framework to Detect Fractures in Ultrasound
Arpan Tripathi
A. Hareendranathan
Mahesh Raveendranatha Panicker
Jack Zhang
Naveenjyote Boora
Jacob L. Jaremko
12
0
0
09 Jun 2021
Vector Quantized Models for Planning
Vector Quantized Models for Planning
Sherjil Ozair
Yazhe Li
Ali Razavi
Ioannis Antonoglou
Aaron van den Oord
Oriol Vinyals
OffRL
96
51
0
08 Jun 2021
Unsupervised Word Segmentation from Discrete Speech Units in
  Low-Resource Settings
Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings
Marcely Zanon Boito
Bolaji Yusuf
Lucas Ondel
Aline Villavicencio
Laurent Besacier
69
3
0
08 Jun 2021
NWT: Towards natural audio-to-video generation with representation
  learning
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
112
12
0
08 Jun 2021
Interpretable agent communication from scratch (with a generic visual
  processor emerging on the side)
Interpretable agent communication from scratch (with a generic visual processor emerging on the side)
Roberto Dessì
Eugene Kharitonov
Marco Baroni
95
28
0
08 Jun 2021
Weakly-supervised word-level pronunciation error detection in non-native
  English speech
Weakly-supervised word-level pronunciation error detection in non-native English speech
Daniel Korzekwa
Jaime Lorenzo-Trueba
Thomas Drugman
Shira Calamaro
B. Kostek
37
13
0
07 Jun 2021
Neural Distributed Source Coding
Neural Distributed Source Coding
Jay Whang
Alliot Nagle
Anish Acharya
Hyeji Kim
A. Dimakis
93
21
0
05 Jun 2021
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction
  and An Optimal Training Framework
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework
Zeyu Yan
Fei Wen
R. Ying
Chao Ma
Peilin Liu
86
38
0
05 Jun 2021
The Image Local Autoregressive Transformer
The Image Local Autoregressive Transformer
Chenjie Cao
Yue Hong
Xiang Li
Chengrong Wang
C. Xu
Xiangyang Xue
Yanwei Fu
82
13
0
04 Jun 2021
Segmental Contrastive Predictive Coding for Unsupervised Word
  Segmentation
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
SSL
86
37
0
03 Jun 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
  Identity in Dysarthric Voice Conversion
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
Wen-Chin Huang
Kazuhiro Kobayashi
Yu-Huai Peng
Ching-Feng Liu
Yu Tsao
Hsin-Min Wang
Tomoki Toda
86
11
0
02 Jun 2021
Container: Context Aggregation Network
Container: Context Aggregation Network
Peng Gao
Jiasen Lu
Hongsheng Li
Roozbeh Mottaghi
Aniruddha Kembhavi
ViT
108
72
0
02 Jun 2021
What Can I Do Here? Learning New Skills by Imagining Visual Affordances
What Can I Do Here? Learning New Skills by Imagining Visual Affordances
Alexander Khazatsky
Ashvin Nair
Dan Jing
Sergey Levine
LM&Ro
84
33
0
01 Jun 2021
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals
Asma Ghandeharioun
Been Kim
Chun-Liang Li
Brendan Jou
B. Eoff
Rosalind W. Picard
AAML
102
54
0
31 May 2021
Factorising Meaning and Form for Intent-Preserving Paraphrasing
Factorising Meaning and Form for Intent-Preserving Paraphrasing
Tom Hosking
Mirella Lapata
OOD
86
41
0
31 May 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
306
1,246
0
30 May 2021
Diffusion-Based Representation Learning
Diffusion-Based Representation Learning
K. Abstreiter
Sarthak Mittal
Stefan Bauer
Bernhard Schölkopf
Arash Mehrjou
DiffM
96
58
0
29 May 2021
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis
  via Non-Autoregressive Generative Transformers
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis via Non-Autoregressive Generative Transformers
Zhu Zhang
Jianxin Ma
Chang Zhou
Rui Men
Zhikang Li
Ming Ding
Jie Tang
Jingren Zhou
Hongxia Yang
110
47
0
29 May 2021
CogView: Mastering Text-to-Image Generation via Transformers
CogView: Mastering Text-to-Image Generation via Transformers
Ming Ding
Zhuoyi Yang
Wenyi Hong
Wendi Zheng
Chang Zhou
...
Junyang Lin
Xu Zou
Zhou Shao
Hongxia Yang
Jie Tang
ViTVLM
164
784
0
26 May 2021
Deep Neural Networks and End-to-End Learning for Audio Compression
Deep Neural Networks and End-to-End Learning for Audio Compression
Daniela N. Rim
I. Jang
Heeyoul Choi
64
9
0
25 May 2021
EXoN: EXplainable encoder Network
EXoN: EXplainable encoder Network
SeungHwan An
Hosik Choi
Jong-June Jeon
BDLDRL
76
5
0
23 May 2021
Compositional Fine-Grained Low-Shot Learning
Compositional Fine-Grained Low-Shot Learning
Dat T. Huynh
Ehsan Elhamifar
83
4
0
21 May 2021
Combining Transformer Generators with Convolutional Discriminators
Combining Transformer Generators with Convolutional Discriminators
Ricard Durall
Stanislav Frolov
Jörn Hees
Federico Raue
Franz-Josef Pfreundt
Andreas Dengel
J. Keuper
ViT
76
16
0
21 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin
  Dynamics
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
107
25
0
17 May 2021
Priors in Bayesian Deep Learning: A Review
Priors in Bayesian Deep Learning: A Review
Vincent Fortuin
UQCVBDL
141
134
0
14 May 2021
High-Resolution Complex Scene Synthesis with Transformers
High-Resolution Complex Scene Synthesis with Transformers
Manuel Jahn
Robin Rombach
Bjorn Ommer
ViT
85
37
0
13 May 2021
Autoencoding Under Normalization Constraints
Autoencoding Under Normalization Constraints
Sangwoong Yoon
Yung-Kyun Noh
Frank C. Park
OODDUQCV
86
39
0
12 May 2021
Discrete representations in neural models of spoken language
Discrete representations in neural models of spoken language
Bertrand Higy
Lieke Gelderloos
Afra Alishahi
Grzegorz Chrupała
144
6
0
12 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
562
8,017
0
11 May 2021
MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with
  One Transformer VAE
MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with One Transformer VAE
Shih-Lun Wu
Yi-Hsuan Yang
ViT
109
55
0
10 May 2021
Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery
Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery
Thomas Glarner
Janek Ebbers
Reinhold Häb-Umbach
DRL
30
1
0
04 May 2021
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using
  Vector-Quantized Contrastive Predictive Coding
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding
J. Nistal
Cyran Aouameur
Stefan Lattner
G. Richard
105
7
0
04 May 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in
  Distributed Learning
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning
Shuo Wang
Surya Nepal
Kristen Moore
M. Grobler
Carsten Rudolph
A. Abuadbba
FedML
76
8
0
03 May 2021
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions
Chenfei Wu
Lun Huang
Qianxi Zhang
Binyang Li
Lei Ji
Fan Yang
Guillermo Sapiro
Nan Duan
DiffMVGen
120
245
0
30 Apr 2021
Eccentric Regularization: Minimizing Hyperspherical Energy without
  explicit projection
Eccentric Regularization: Minimizing Hyperspherical Energy without explicit projection
Xuefeng Li
Alan Blair
62
0
0
23 Apr 2021
Protecting gender and identity with disentangled speech representations
Protecting gender and identity with disentangled speech representations
Dimitrios Stoidis
Andrea Cavallaro
74
10
0
22 Apr 2021
IB-DRR: Incremental Learning with Information-Back Discrete
  Representation Replay
IB-DRR: Incremental Learning with Information-Back Discrete Representation Replay
Jian Jiang
Edoardo Cetin
Oya Celiktutan
61
9
0
21 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViTVGen
345
514
0
20 Apr 2021
Conditional Variational Capsule Network for Open Set Recognition
Conditional Variational Capsule Network for Open Set Recognition
Yunrui Guo
Guglielmo Camporese
Wenjing Yang
A. Sperduti
Lamberto Ballan
BDL
90
45
0
19 Apr 2021
Previous
123...565758...646566
Next