ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown
Title
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
Rao Fu
Xiaoyu Zhan
Yiwen Chen
Daniel E. Ritchie
Srinath Sridhar
128
79
0
19 Jul 2022
FewGAN: Generating from the Joint Distribution of a Few Images
FewGAN: Generating from the Joint Distribution of a Few Images
Lior Ben-Moshe
Sagie Benaim
Lior Wolf
GAN
112
2
0
18 Jul 2022
Latent-Domain Predictive Neural Speech Coding
Latent-Domain Predictive Neural Speech Coding
Xue Jiang
Xiulian Peng
Huaying Xue
Yuan Zhang
Yan Lu
86
18
0
18 Jul 2022
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step
  Inverse Models
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models
Alex Lamb
Riashat Islam
Yonathan Efroni
Aniket Didolkar
Dipendra Kumar Misra
Dylan J. Foster
Lekan Molu
Rajan Chari
A. Krishnamurthy
John Langford
99
24
0
17 Jul 2022
FashionViL: Fashion-Focused Vision-and-Language Representation Learning
FashionViL: Fashion-Focused Vision-and-Language Representation Learning
Xiaoping Han
Licheng Yu
Xiatian Zhu
Li Zhang
Yi-Zhe Song
Tao Xiang
AI4TS
54
49
0
17 Jul 2022
Position Prediction as an Effective Pretraining Strategy
Position Prediction as an Effective Pretraining Strategy
Shuangfei Zhai
Navdeep Jaitly
Jason Ramapuram
Dan Busbridge
Tatiana Likhomanenko
Joseph Y. Cheng
Walter A. Talbott
Chen Huang
Hanlin Goh
J. Susskind
ViT
91
25
0
15 Jul 2022
Comparing the latent space of generative models
Comparing the latent space of generative models
Andrea Asperti
Valerio Tonelli
DRL
84
13
0
14 Jul 2022
N-Grammer: Augmenting Transformers with latent n-grams
N-Grammer: Augmenting Transformers with latent n-grams
Aurko Roy
Rohan Anil
Guangda Lai
Benjamin Lee
Jeffrey Zhao
...
Yu
Phuong Dao
Christopher Fifty
Zhiwen Chen
Yonghui Wu
77
8
0
13 Jul 2022
Collaborative Quantization Embeddings for Intra-Subject Prostate MR
  Image Registration
Collaborative Quantization Embeddings for Intra-Subject Prostate MR Image Registration
Ziyi Shen
Qianye Yang
Yuming Shen
F. Giganti
V. Stavrinides
...
M. Rusu
G. Sonn
Philip Torr
D. Barratt
Yipeng Hu
69
3
0
13 Jul 2022
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent
  Variable Inference for Text Generation
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation
Jinyi Hu
Xiaoyuan Yi
Wenhao Li
Maosong Sun
Xing Xie
122
21
0
13 Jul 2022
Eliminating Gradient Conflict in Reference-based Line-Art Colorization
Eliminating Gradient Conflict in Reference-based Line-Art Colorization
Zekun Li
Zhengyang Geng
Zhao Kang
Wenyu Chen
Yibo Yang
109
37
0
13 Jul 2022
Learning Representations for CSI Adaptive Quantization and Feedback
Learning Representations for CSI Adaptive Quantization and Feedback
Valentina Rizzello
Matteo Nerini
M. Joham
B. Clerckx
Wolfgang Utschick
MQ
32
6
0
13 Jul 2022
Earthformer: Exploring Space-Time Transformers for Earth System
  Forecasting
Earthformer: Exploring Space-Time Transformers for Earth System Forecasting
Zhihan Gao
Xingjian Shi
Hao Wang
Yi Zhu
Yuyang Wang
Mu Li
Dit-Yan Yeung
AI4TS
100
159
0
12 Jul 2022
SkexGen: Autoregressive Generation of CAD Construction Sequences with
  Disentangled Codebooks
SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks
Xiang Xu
Karl D. D. Willis
Joseph G. Lambourne
Chin-Yi Cheng
P. Jayaraman
Yasutaka Furukawa
97
78
0
11 Jul 2022
A Comparative Study of Self-supervised Speech Representation Based Voice
  Conversion
A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Wen-Chin Huang
Shu-Wen Yang
Tomoki Hayashi
Tomoki Toda
68
17
0
10 Jul 2022
Towards Highly Expressive Machine Learning Models of Non-Melanoma Skin
  Cancer
Towards Highly Expressive Machine Learning Models of Non-Melanoma Skin Cancer
S. Thomas
J. Lefevre
Glenn W. Baxter
N. Hamilton
MedIm
87
2
0
09 Jul 2022
Generative Adversarial Networks and Other Generative Models
Generative Adversarial Networks and Other Generative Models
Markus T. Wenzel
GAN
113
12
0
08 Jul 2022
Hidden Schema Networks
Hidden Schema Networks
Ramses J. Sanchez
L. Conrads
Pascal Welke
K. Cvejoski
C. Ojeda
NAIMILM
65
3
0
08 Jul 2022
End-to-End Binaural Speech Synthesis
End-to-End Binaural Speech Synthesis
Wen-Chin Huang
Dejan Marković
Alexander Richard
I. D. Gebru
Anjali Menon
65
9
0
08 Jul 2022
NESC: Robust Neural End-2-End Speech Coding with GANs
NESC: Robust Neural End-2-End Speech Coding with GANs
N. Pia
Kishan Gupta
Srikanth Korse
M. Multrus
Guillaume Fuchs
105
16
0
07 Jul 2022
Cross-Scale Vector Quantization for Scalable Neural Speech Coding
Cross-Scale Vector Quantization for Scalable Neural Speech Coding
Xue Jiang
Xiulian Peng
Huaying Xue
Yuan Zhang
Yan Lu
MQ
117
10
0
07 Jul 2022
Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Ali Siahkoohi
Michael Chinen
Tom Denton
W. Kleijn
Jan Skoglund
58
9
0
05 Jul 2022
Transformer based Models for Unsupervised Anomaly Segmentation in Brain
  MR Images
Transformer based Models for Unsupervised Anomaly Segmentation in Brain MR Images
Ahmed Ghorbel
Ahmed Aldahdooh
Shadi Albarqouni
Neuherberg
ViTMedIm
77
6
0
05 Jul 2022
Vector Quantisation for Robust Segmentation
Vector Quantisation for Robust Segmentation
Ainkaran Santhirasekaram
Avinash Kori
Mathias Winkler
A. Rockall
Ben Glocker
OOD
77
9
0
05 Jul 2022
Hierarchical Symbolic Reasoning in Hyperbolic Space for Deep
  Discriminative Models
Hierarchical Symbolic Reasoning in Hyperbolic Space for Deep Discriminative Models
Ainkaran Santhirasekaram
Avinash Kori
A. Rockall
Mathias Winkler
Francesca Toni
Ben Glocker
FAtt
71
4
0
05 Jul 2022
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of
  3D Human Motions and Texts
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts
Chuan Guo
Xinxin Xuo
Sen Wang
Li Cheng
VGen
197
244
0
04 Jul 2022
GlowVC: Mel-spectrogram space disentangling model for
  language-independent text-free voice conversion
GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Magdalena Proszewska
Grzegorz Beringer
Daniel Sáez-Trigueros
Thomas Merritt
Abdelhamid Ezzerg
Roberto Barra-Chicote
70
6
0
04 Jul 2022
TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory
  Generative Adversarial Network
TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network
Yuansheng Guan
Guochen Yu
Andong Li
C. Zheng
Jie Wang
125
9
0
04 Jul 2022
Generating gender-ambiguous voices for privacy-preserving speech
  recognition
Generating gender-ambiguous voices for privacy-preserving speech recognition
Dimitrios Stoidis
Andrea Cavallaro
52
14
0
03 Jul 2022
Dissecting Self-Supervised Learning Methods for Surgical Computer Vision
Dissecting Self-Supervised Learning Methods for Surgical Computer Vision
Sanat Ramesh
V. Srivastav
Deepak Alapatt
Tong Yu
Aditya Murali
...
Saurav Sharma
A. Fleurentin
Georgios Exarchakis
Alexandros Karargyris
N. Padoy
134
46
0
01 Jul 2022
Towards Human-Agent Communication via the Information Bottleneck
  Principle
Towards Human-Agent Communication via the Information Bottleneck Principle
Mycal Tucker
J. Shah
R. Levy
Noga Zaslavsky
84
14
0
30 Jun 2022
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for
  Speech Synthesis based on Disentanglement between Prosody and Timbre
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Guangyan Zhang
Ying Qin
Weinan Zhang
Jialun Wu
Mei Li
Yu Gai
Feijun Jiang
Tan Lee
108
27
0
29 Jun 2022
Masked World Models for Visual Control
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
188
149
0
28 Jun 2022
Megapixel Image Generation with Step-Unrolled Denoising Autoencoders
Megapixel Image Generation with Step-Unrolled Denoising Autoencoders
Alex F. McKinney
Chris G. Willcocks
DiffM
47
0
0
24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
199
121
0
23 Jun 2022
A Temporal Extension of Latent Dirichlet Allocation for Unsupervised
  Acoustic Unit Discovery
A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery
W. V. D. Merwe
Herman Kamper
J. D. Preez
57
2
0
23 Jun 2022
Entropy-driven Sampling and Training Scheme for Conditional Diffusion
  Generation
Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation
Sheng-liang Li
Guangcong Zheng
Haibo Wang
Taiping Yao
Yang Chen
Shoudong Ding
Xi Li
DiffM
86
22
0
23 Jun 2022
Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Shangchen Zhou
Kelvin C. K. Chan
Chongyi Li
Chen Change Loy
CVBM
100
239
0
22 Jun 2022
Automated Cancer Subtyping via Vector Quantization Mutual Information
  Maximization
Automated Cancer Subtyping via Vector Quantization Mutual Information Maximization
Zheng Chen
Lingwei Zhu
Ziwei Yang
Takashi Matsubara
116
7
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
302
1,134
0
22 Jun 2022
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Gang Li
Heliang Zheng
Daqing Liu
Chaoyue Wang
Fuchun Sun
Changwen Zheng
135
130
0
21 Jun 2022
Identifiability of deep generative models without auxiliary information
Identifiability of deep generative models without auxiliary information
Bohdan Kivva
Goutham Rajendran
Pradeep Ravikumar
Bryon Aragam
DRL
117
53
0
20 Jun 2022
Latent Variable Modelling Using Variational Autoencoders: A survey
Latent Variable Modelling Using Variational Autoencoders: A survey
Vasanth Kalingeri
CMLDRL
72
2
0
20 Jun 2022
Self-supervised speech unit discovery from articulatory and acoustic
  features using VQ-VAE
Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE
Marc-Antoine Georges
J. Schwartz
Thomas Hueber
SSL
116
5
0
17 Jun 2022
TUSK: Task-Agnostic Unsupervised Keypoints
TUSK: Task-Agnostic Unsupervised Keypoints
Yuhe Jin
Weiwei Sun
J. Hosang
Eduard Trulls
K. M. Yi
76
5
0
16 Jun 2022
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
F. Saleh
Fuwen Tan
Adrian Bulat
Georgios Tzimiropoulos
Brais Martínez
SSL
104
1
0
16 Jun 2022
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use
  Case
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case
Clément Chadebec
Louis J. Vincent
S. Allassonnière
DRL
98
30
0
16 Jun 2022
Discrete Contrastive Diffusion for Cross-Modal Music and Image
  Generation
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
109
49
0
15 Jun 2022
Write and Paint: Generative Vision-Language Models are Unified Modal
  Learners
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Shizhe Diao
Wangchunshu Zhou
Xinsong Zhang
Jiawei Wang
MLLMAI4CE
99
17
0
15 Jun 2022
A Survey of Automated Data Augmentation Algorithms for Deep
  Learning-based Image Classification Tasks
A Survey of Automated Data Augmentation Algorithms for Deep Learning-based Image Classification Tasks
Z. Yang
Richard Sinnott
James Bailey
Qiuhong Ke
86
45
0
14 Jun 2022
Previous
123...484950...646566
Next