ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown
Title
An Empirical Study of Training End-to-End Vision-and-Language
  Transformers
An Empirical Study of Training End-to-End Vision-and-Language Transformers
Zi-Yi Dou
Yichong Xu
Zhe Gan
Jianfeng Wang
Shuohang Wang
...
Pengchuan Zhang
Lu Yuan
Nanyun Peng
Zicheng Liu
Michael Zeng
VLM
106
381
0
03 Nov 2021
Recent Advancements in Self-Supervised Paradigms for Visual Feature
  Representation
Recent Advancements in Self-Supervised Paradigms for Visual Feature Representation
Mrinal Anand
Aditya Garg
SSL
46
2
0
03 Nov 2021
PatchGame: Learning to Signal Mid-level Patches in Referential Games
PatchGame: Learning to Signal Mid-level Patches in Referential Games
Kamal Gupta
Gowthami Somepalli
Anubhav Gupta
Vinoj Jayasundara
Matthias Zwicker
Abhinav Shrivastava
79
4
0
02 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning
Adjacency constraint for efficient hierarchical reinforcement learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
106
17
0
30 Oct 2021
Neural Analysis and Synthesis: Reconstructing Speech from
  Self-Supervised Representations
Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations
Hyeong-Seok Choi
Juheon Lee
W. Kim
Jie Hwan Lee
Hoon Heo
Kyogu Lee
116
158
0
27 Oct 2021
Zero-shot Voice Conversion via Self-supervised Prosody Representation
  Learning
Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning
Shijun Wang
Dimche Kostadinov
Damian Borth
93
11
0
27 Oct 2021
Towards artificial general intelligence via a multimodal foundation
  model
Towards artificial general intelligence via a multimodal foundation model
Nanyi Fei
Zhiwu Lu
Yizhao Gao
Guoxing Yang
Yuqi Huo
...
Ruihua Song
Xin Gao
Tao Xiang
Haoran Sun
Jiling Wen
AI4CELRM
99
230
0
27 Oct 2021
VQ-GNN: A Universal Framework to Scale up Graph Neural Networks using
  Vector Quantization
VQ-GNN: A Universal Framework to Scale up Graph Neural Networks using Vector Quantization
Mucong Ding
Kezhi Kong
Jingling Li
Chen Zhu
John P. Dickerson
Furong Huang
Tom Goldstein
GNNMQ
107
49
0
27 Oct 2021
TopicNet: Semantic Graph-Guided Topic Discovery
TopicNet: Semantic Graph-Guided Topic Discovery
Zhibin Duan
Yishi Xu
Bo Chen
Dongsheng Wang
Chaojie Wang
Mingyuan Zhou
BDL
92
15
0
27 Oct 2021
Evidential Softmax for Sparse Multimodal Distributions in Deep
  Generative Models
Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models
Phil Chen
Masha Itkina
Ransalu Senanayake
Mykel J. Kochenderfer
69
6
0
27 Oct 2021
Object-Aware Regularization for Addressing Causal Confusion in Imitation
  Learning
Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning
Jongjin Park
Younggyo Seo
Chang-Shu Liu
Li Zhao
Tao Qin
Jinwoo Shin
Tie-Yan Liu
CMLOffRL
55
16
0
27 Oct 2021
Fragment-based Sequential Translation for Molecular Optimization
Fragment-based Sequential Translation for Molecular Optimization
Benson Chen
Xiang Fu
Regina Barzilay
Tommi Jaakkola
58
7
0
26 Oct 2021
Discrete Acoustic Space for an Efficient Sampling in Neural
  Text-To-Speech
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech
Mu Li
Jonas Rohnke
Antonio Bonafonte
Mateusz Lajszczak
Trevor Wood
DRL
108
2
0
24 Oct 2021
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text
  Joint Pre-Training
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Ankur Bapna
Yu-An Chung
Na Wu
Anmol Gulati
Ye Jia
J. Clark
Melvin Johnson
Jason Riesa
Alexis Conneau
Yu Zhang
VLM
139
96
0
20 Oct 2021
Momentum Contrastive Autoencoder: Using Contrastive Learning for Latent
  Space Distribution Matching in WAE
Momentum Contrastive Autoencoder: Using Contrastive Learning for Latent Space Distribution Matching in WAE
Devansh Arpit
Aadyot Bhatnagar
Huan Wang
Caiming Xiong
49
0
0
19 Oct 2021
Wideband and Entropy-Aware Deep Soft Bit Quantization
Wideband and Entropy-Aware Deep Soft Bit Quantization
Marius Arvinte
Jonathan I. Tamir
MQ
28
0
0
18 Oct 2021
CycleFlow: Purify Information Factors by Cycle Loss
CycleFlow: Purify Information Factors by Cycle Loss
Haoran Sun
Chen Chen
Lantian Li
Dong Wang
72
1
0
18 Oct 2021
Illiterate DALL-E Learns to Compose
Illiterate DALL-E Learns to Compose
Gautam Singh
Fei Deng
Sungjin Ahn
CoGeOCL
133
139
0
17 Oct 2021
3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with
  Transformers
3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with Transformers
Z. Shi
Zhao Meng
Yiran Xing
Yunpu Ma
Roger Wattenhofer
ViT
82
35
0
17 Oct 2021
Taming Visually Guided Sound Generation
Taming Visually Guided Sound Generation
Vladimir E. Iashin
Esa Rahtu
VLM
131
128
0
17 Oct 2021
Guiding Visual Question Generation
Guiding Visual Question Generation
Nihir Vedd
Zixu Wang
Marek Rei
Yishu Miao
Lucia Specia
140
22
0
15 Oct 2021
Towards Identity Preserving Normal to Dysarthric Voice Conversion
Towards Identity Preserving Normal to Dysarthric Voice Conversion
Wen-Chin Huang
B. Halpern
Lester Phillip Violeta
O. Scharenborg
Tomoki Toda
108
23
0
15 Oct 2021
MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without
  Retraining
MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without Retraining
Ahmed Imtiaz Humayun
Randall Balestriero
Richard Baraniuk
OOD
141
31
0
15 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
  Processing
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
177
203
0
14 Oct 2021
The Deep Generative Decoder: MAP estimation of representations improves
  modeling of single-cell RNA data
The Deep Generative Decoder: MAP estimation of representations improves modeling of single-cell RNA data
Viktoria Schuster
A. Krogh
83
4
0
13 Oct 2021
DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational
  Transformer
DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer
Haozhe Ji
Minlie Huang
103
23
0
12 Oct 2021
Unsupervised Source Separation via Bayesian Inference in the Latent
  Domain
Unsupervised Source Separation via Bayesian Inference in the Latent Domain
Michele Mancusi
Emilian Postolache
Giorgio Mariani
Marco Fumero
Andrea Santilli
Luca Cosmo
Emanuele Rodolà
BDL
62
2
0
11 Oct 2021
Injecting Text and Cross-lingual Supervision in Few-shot Learning from
  Self-Supervised Models
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models
Sanjeev Khudanpur
Desh Raj
Sanjeev Khudanpur
103
6
0
10 Oct 2021
Vector-quantized Image Modeling with Improved VQGAN
Vector-quantized Image Modeling with Improved VQGAN
Jiahui Yu
Xin Li
Jing Yu Koh
Han Zhang
Ruoming Pang
James Qin
Alexander Ku
Yuanzhong Xu
Jason Baldridge
Yonghui Wu
ViTVLMDRL
207
527
0
09 Oct 2021
Cognitive Coding of Speech
Cognitive Coding of Speech
Reza Lotfidereshgi
P. Gournay
60
5
0
08 Oct 2021
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using
  Mel-spectrograms
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using Mel-spectrograms
Chien-Feng Liao
Jen-Yu Liu
Yi-Hsuan Yang
69
5
0
08 Oct 2021
Attention is All You Need? Good Embeddings with Statistics are
  enough:Large Scale Audio Understanding without Transformers/ Convolutions/
  BERTs/ Mixers/ Attention/ RNNs or ....
Attention is All You Need? Good Embeddings with Statistics are enough:Large Scale Audio Understanding without Transformers/ Convolutions/ BERTs/ Mixers/ Attention/ RNNs or ....
Prateek Verma
AI4TS
102
2
0
07 Oct 2021
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image
  Manipulation
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation
Gwanghyun Kim
Taesung Kwon
Jong Chul Ye
DiffM
262
657
0
06 Oct 2021
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation
Aditya Sanghi
Hang Chu
Joseph G. Lambourne
Ye Wang
Chin-Yi Cheng
Marco Fumero
Kamal Rahimi Malekshan
CLIP
136
296
0
06 Oct 2021
Unsupervised Speech Segmentation and Variable Rate Representation
  Learning using Segmental Contrastive Predictive Coding
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
SSL
134
23
0
05 Oct 2021
Truth-Conditional Captioning of Time Series Data
Truth-Conditional Captioning of Time Series Data
Harsh Jhamtani
Taylor Berg-Kirkpatrick
AI4TS
75
8
0
05 Oct 2021
Proxy-bridged Image Reconstruction Network for Anomaly Detection in
  Medical Images
Proxy-bridged Image Reconstruction Network for Anomaly Detection in Medical Images
Kang Zhou
Jing Li
Weixin Luo
Zhengxin Li
Jianlong Yang
Huazhu Fu
Jun Cheng
Jiang-Dong Liu
Shenghua Gao
193
34
0
05 Oct 2021
Causal Representation Learning for Context-Aware Face Transfer
Causal Representation Learning for Context-Aware Face Transfer
Gege Gao
Huaibo Huang
Chaoyou Fu
Ran He
CVBM
72
0
0
04 Oct 2021
A Review of the Gumbel-max Trick and its Extensions for Discrete
  Stochasticity in Machine Learning
A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning
Iris A. M. Huijben
W. Kool
Max B. Paulus
Ruud J. G. van Sloun
115
99
0
04 Oct 2021
An Unsupervised Video Game Playstyle Metric via State Discretization
An Unsupervised Video Game Playstyle Metric via State Discretization
Chiu-Chou Lin
W. Chiu
I-Chen Wu
32
3
0
03 Oct 2021
Calibrated Multiple-Output Quantile Regression with Representation
  Learning
Calibrated Multiple-Output Quantile Regression with Representation Learning
Shai Feldman
Stephen Bates
Yaniv Romano
210
36
0
02 Oct 2021
Audio-to-Image Cross-Modal Generation
Audio-to-Image Cross-Modal Generation
Maciej Żelaszczyk
Jacek Mańdziuk
DiffM
118
17
0
27 Sep 2021
Fully Spiking Variational Autoencoder
Fully Spiking Variational Autoencoder
Hiromichi Kamata
Yusuke Mukuta
Tatsuya Harada
BDLDRL
106
43
0
26 Sep 2021
Learnable Triangulation for Deep Learning-based 3D Reconstruction of
  Objects of Arbitrary Topology from Single RGB Images
Learnable Triangulation for Deep Learning-based 3D Reconstruction of Objects of Arbitrary Topology from Single RGB Images
Tarek Ben Charrada
Hedi Tabia
A. Chetouani
Hamid Laga
3DV
74
0
0
24 Sep 2021
Multi-view Contrastive Self-Supervised Learning of Accounting Data
  Representations for Downstream Audit Tasks
Multi-view Contrastive Self-Supervised Learning of Accounting Data Representations for Downstream Audit Tasks
Marco Schreyer
Timur Sattarov
Damian Borth
MLAU
76
15
0
23 Sep 2021
Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Chao Xie
Yi-Chiao Wu
Patrick Lumban Tobing
Wen-Chin Huang
Tomoki Toda
67
8
0
22 Sep 2021
Intuitive and Efficient Roof Modeling for Reconstruction and Synthesis
Intuitive and Efficient Roof Modeling for Reconstruction and Synthesis
Jing Ren
Biao Zhang
Bojian Wu
Jianqiang Huang
Lubin Fan
M. Ovsjanikov
Peter Wonka
70
19
0
16 Sep 2021
Disentangling Generative Factors in Natural Language with Discrete
  Variational Autoencoders
Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders
Giangiacomo Mercatali
André Freitas
CoGeDRL
59
23
0
15 Sep 2021
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Changhan Wang
Wei-Ning Hsu
Yossi Adi
Adam Polyak
Ann Lee
Peng-Jen Chen
Jiatao Gu
J. Pino
VLM
106
32
0
14 Sep 2021
A Temporal Variational Model for Story Generation
A Temporal Variational Model for Story Generation
David Wilmot
Frank Keller
DRL
113
9
0
14 Sep 2021
Previous
123...545556...646566
Next