ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown
Title
Neuromorphic Wireless Cognition: Event-Driven Semantic Communications
  for Remote Inference
Neuromorphic Wireless Cognition: Event-Driven Semantic Communications for Remote Inference
Jiechen Chen
N. Skatchkovsky
Osvaldo Simeone
80
37
0
13 Jun 2022
Comparative Snippet Generation
Comparative Snippet Generation
Saurabh Jain
Yisong Miao
Min-Yen Kan
42
0
0
11 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
81
51
0
11 Jun 2022
PILC: Practical Image Lossless Compression with an End-to-end GPU
  Oriented Neural Framework
PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework
Ning Kang
Shanzhao Qiu
Shifeng Zhang
Zhenguo Li
Shutao Xia
62
19
0
10 Jun 2022
Draft-and-Revise: Effective Image Generation with Contextual
  RQ-Transformer
Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
82
29
0
09 Jun 2022
SimVP: Simpler yet Better Video Prediction
SimVP: Simpler yet Better Video Prediction
Zhangyang Gao
Cheng Tan
Lirong Wu
Stan Z. Li
109
222
0
09 Jun 2022
Robust Semantic Communications with Masked VQ-VAE Enabled Codebook
Robust Semantic Communications with Masked VQ-VAE Enabled Codebook
Qiyu Hu
Guangyi Zhang
Zhijin Qin
Yunlong Cai
Guanding Yu
Geoffrey Ye Li
AAML
96
151
0
08 Jun 2022
Patch-based Object-centric Transformers for Efficient Video Generation
Patch-based Object-centric Transformers for Efficient Video Generation
Wilson Yan
Ryogo Okumura
Stephen James
Pieter Abbeel
DiffMViT
87
6
0
08 Jun 2022
Fast Unsupervised Brain Anomaly Detection and Segmentation with
  Diffusion Models
Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models
W. H. Pinaya
M. Graham
Robert J. Gray
P. F. D. Costa
Petru-Daniel Tudosiu
...
D. Werring
Geraint Rees
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
DiffMMedIm
86
107
0
07 Jun 2022
Decentralized Low-Latency Collaborative Inference via Ensembles on the
  Edge
Decentralized Low-Latency Collaborative Inference via Ensembles on the Edge
M. Malka
Erez Farhan
Hai Morgenstern
Nir Shlezinger
FedML
74
13
0
07 Jun 2022
Intra-agent speech permits zero-shot task acquisition
Intra-agent speech permits zero-shot task acquisition
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
97
9
0
07 Jun 2022
Recent Advances for Quantum Neural Networks in Generative Learning
Recent Advances for Quantum Neural Networks in Generative Learning
Jinkai Tian
Xiaoyun Sun
Yuxuan Du
Shanshan Zhao
Qing Liu
...
Xingyao Wu
Min-hsiu Hsieh
Tongliang Liu
Wen-Bin Yang
Dacheng Tao
AI4CE
102
85
0
07 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
196
393
0
06 Jun 2022
Variable-rate hierarchical CPC leads to acoustic unit discovery in
  speech
Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Santiago Cuervo
Adrian Lañcucki
R. Marxer
Paweł Rychlikowski
J. Chorowski
SSL
87
13
0
05 Jun 2022
DÁRTAGNAN: Counterfactual Video Generation
DÁRTAGNAN: Counterfactual Video Generation
Hadrien Reynaud
Athanasios Vlontzos
Mischa Dombrowski
Ciarán M. Gilligan-Lee
A. Beqiri
Paul Leeson
Bernhard Kainz
VGenCMLMedIm
91
20
0
03 Jun 2022
Recognition of Unseen Bird Species by Learning from Field Guides
Recognition of Unseen Bird Species by Learning from Field Guides
Andrés C. Rodríguez
Stefano Dáronco
Rodrigo Caye Daudt
Jan Dirk Wegner
Konrad Schindler
68
1
0
03 Jun 2022
Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining
  Unsupervised and Supervised Phonetic Representations
Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations
Chang Liu
Zhenhua Ling
Linghui Chen
75
3
0
02 Jun 2022
Improving Diffusion Models for Inverse Problems using Manifold
  Constraints
Improving Diffusion Models for Inverse Problems using Manifold Constraints
Hyungjin Chung
Byeongsu Sim
Dohoon Ryu
J. C. Ye
DiffMMedIm
253
475
0
02 Jun 2022
Modeling Image Composition for Complex Scene Generation
Modeling Image Composition for Complex Scene Generation
Zuopeng Yang
Daqing Liu
Chaoyue Wang
J. Yang
Dacheng Tao
ViT
117
52
0
02 Jun 2022
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
Jie Shi
Chenfei Wu
Jian Liang
Xiang Liu
Nan Duan
DiffM
88
26
0
01 Jun 2022
PAGER: Progressive Attribute-Guided Extendable Robust Image Generation
PAGER: Progressive Attribute-Guided Extendable Robust Image Generation
Zohreh Azizi
C.-C. Jay Kuo
VLMDiffMGAN
89
9
0
01 Jun 2022
VALHALLA: Visual Hallucination for Machine Translation
VALHALLA: Visual Hallucination for Machine Translation
Yi Li
Yikang Shen
Yoon Kim
Chun-Fu Chen
Rogerio Feris
David D. Cox
Nuno Vasconcelos
MLLM
149
40
0
31 May 2022
Improved Vector Quantized Diffusion Models
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
260
63
0
31 May 2022
Text2Human: Text-Driven Controllable Human Image Generation
Text2Human: Text-Driven Controllable Human Image Generation
Yuming Jiang
Shuai Yang
Haonan Qiu
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
176
48
0
31 May 2022
From Keypoints to Object Landmarks via Self-Training Correspondence: A
  novel approach to Unsupervised Landmark Discovery
From Keypoints to Object Landmarks via Self-Training Correspondence: A novel approach to Unsupervised Landmark Discovery
Dimitrios Mallis
Enrique Sanchez
Matt Bell
Georgios Tzimiropoulos
SSL3DPC
88
7
0
31 May 2022
VQ-AR: Vector Quantized Autoregressive Probabilistic Time Series
  Forecasting
VQ-AR: Vector Quantized Autoregressive Probabilistic Time Series Forecasting
Kashif Rasul
Young-Jin Park
Max Nihlén Ramström
KyungHyun Kim
BDLAI4TS
42
4
0
31 May 2022
SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for
  Structured Representations of High-Rate Time Series
SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series
Iris A. M. Huijben
Arthur A. Nijdam
S. Overeem
M. V. Gilst
Ruud J. G. van Sloun
AI4TS
41
7
0
31 May 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Zhengyao Jiang
Tianjun Zhang
Robert Kirk
Tim Rocktaschel
Edward Grefenstette
OffRL
49
2
0
31 May 2022
Do self-supervised speech models develop human-like perception biases?
Do self-supervised speech models develop human-like perception biases?
Juliette Millet
Ewan Dunbar
SSL
68
23
0
31 May 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
380
633
0
29 May 2022
Multimodal Masked Autoencoders Learn Transferable Representations
Multimodal Masked Autoencoders Learn Transferable Representations
Xinyang Geng
Hao Liu
Lisa Lee
Dale Schuurams
Sergey Levine
Pieter Abbeel
103
119
0
27 May 2022
3DILG: Irregular Latent Grids for 3D Generative Modeling
3DILG: Irregular Latent Grids for 3D Generative Modeling
Biao Zhang
Matthias Nießner
Peter Wonka
3DV
118
90
0
27 May 2022
A Survey on Long-Tailed Visual Recognition
A Survey on Long-Tailed Visual Recognition
Lu Yang
He Jiang
Q. Song
Jun Guo
93
135
0
27 May 2022
Green Hierarchical Vision Transformer for Masked Image Modeling
Green Hierarchical Vision Transformer for Masked Image Modeling
Lang Huang
Shan You
Mingkai Zheng
Fei Wang
Chao Qian
T. Yamasaki
139
73
0
26 May 2022
Learning What and Where: Disentangling Location and Identity Tracking
  Without Supervision
Learning What and Where: Disentangling Location and Identity Tracking Without Supervision
Manuel Traub
S. Otte
Tobias Menge
Matthias Karlbauer
Jannik Thummel
Martin Volker Butz
115
20
0
26 May 2022
Scalable Multi-Agent Model-Based Reinforcement Learning
Scalable Multi-Agent Model-Based Reinforcement Learning
Vladimir Egorov
A. Shpilman
90
27
0
25 May 2022
Structured Uncertainty in the Observation Space of Variational
  Autoencoders
Structured Uncertainty in the Observation Space of Variational Autoencoders
James A. G. Langley
M. Monteiro
Charles Jones
Nick Pawlowski
Ben Glocker
CMLOODBDLDRL
71
2
0
25 May 2022
Emergent Communication through Metropolis-Hastings Naming Game with Deep
  Generative Models
Emergent Communication through Metropolis-Hastings Naming Game with Deep Generative Models
T. Taniguchi
Yuto Yoshida
Akira Taniguchi
Y. Hagiwara
MLLM
73
25
0
24 May 2022
RevUp: Revise and Update Information Bottleneck for Event Representation
RevUp: Revise and Update Information Bottleneck for Event Representation
Mehdi Rezaee
Francis Ferraro
94
1
0
24 May 2022
Generalization Gap in Amortized Inference
Generalization Gap in Amortized Inference
Mingtian Zhang
Peter Hayes
David Barber
BDLCMLDRL
131
14
0
23 May 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
675
6,107
0
23 May 2022
Vector-Quantized Input-Contextualized Soft Prompts for Natural Language
  Understanding
Vector-Quantized Input-Contextualized Soft Prompts for Natural Language Understanding
Rishabh Bhardwaj
Amrita Saha
Guosheng Lin
Soujanya Poria
VLMVPVLM
53
7
0
23 May 2022
Transformer-based out-of-distribution detection for clinically safe
  segmentation
Transformer-based out-of-distribution detection for clinically safe segmentation
M. Graham
Petru-Daniel Tudosiu
P. Wright
W. H. Pinaya
J. U-King-im
...
H. Jäger
D. Werring
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
MedIm
90
21
0
21 May 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
293
368
0
21 May 2022
Coordinating Policies Among Multiple Agents via an Intelligent
  Communication Channel
Coordinating Policies Among Multiple Agents via an Intelligent Communication Channel
Dianbo Liu
Vedant Shah
Oussama Boussif
Cristian Meo
Anirudh Goyal
Tianmin Shu
Michael C. Mozer
N. Heess
Yoshua Bengio
73
0
0
21 May 2022
Tackling Provably Hard Representative Selection via Graph Neural
  Networks
Tackling Provably Hard Representative Selection via Graph Neural Networks
Seyed Mehran Kazemi
Anton Tsitsulin
Hossein Esfandiari
M. Bateni
Deepak Ramachandran
Bryan Perozzi
Vahab Mirrokni
127
3
0
20 May 2022
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
Alexander Kolesnikov
André Susano Pinto
Lucas Beyer
Xiaohua Zhai
Jeremiah Harmsen
N. Houlsby
171
72
0
20 May 2022
Diversity vs. Recognizability: Human-like generalization in one-shot
  generative models
Diversity vs. Recognizability: Human-like generalization in one-shot generative models
Victor Boutin
Lakshya Singhal
Xavier Thomas
Thomas Serre
83
8
0
20 May 2022
Visual Concepts Tokenization
Visual Concepts Tokenization
Tao Yang
Yuwang Wang
Yan Lu
Nanning Zheng
OCLViT
107
15
0
20 May 2022
Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision
  Transformers with Locality
Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality
Xiang Li
Wenhai Wang
Lingfeng Yang
Jian Yang
185
76
0
20 May 2022
Previous
123...495051...646566
Next