v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown

Title
BEiT: BERT Pre-Training of Image Transformers Hangbo Bao Li Dong Songhao Piao Furu Wei ViT 423 2,858 0 15 Jun 2021
Improved Transformer for High-Resolution GANs Long Zhao Zizhao Zhang Ting Chen Dimitris N. Metaxas Han Zhang ViT 133 96 0 14 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units Wei-Ning Hsu Benjamin Bolte Yao-Hung Hubert Tsai Kushal Lakhotia Ruslan Salakhutdinov Abdel-rahman Mohamed SSL 214 3,017 0 14 Jun 2021
Learning the Imaging Landmarks: Unsupervised Key point Detection in Lung Ultrasound Videos Arpan Tripathi Mahesh Raveendranatha Panicker A. Hareendranathan Yale Tung Chen Jacob L. Jaremko K. Narayan C. Kesavadas 24 0 0 13 Jun 2021
Inverting Adversarially Robust Networks for Image Synthesis Renan A. Rojas-Gomez Raymond A. Yeh Minh Do A. Nguyen 68 5 0 13 Jun 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation Abhishek Sinha Jiaming Song Chenlin Meng Stefano Ermon VLM DiffM 140 121 0 12 Jun 2021
Robust Representation Learning via Perceptual Similarity Metrics Saeid Asgari Taghanaki Kristy Choi Amir Khasahmadi Anirudh Goyal SSL 64 29 0 11 Jun 2021
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning Eugene Belilovsky Louis Leconte Lucas Caccia Michael Eickenberg Edouard Oyallon 45 7 0 11 Jun 2021
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech Jaehyeon Kim Jungil Kong Juhee Son DRL 170 903 0 11 Jun 2021
Score-based Generative Modeling in Latent Space Arash Vahdat Karsten Kreis Jan Kautz DiffM 142 688 0 10 Jun 2021
Cross-Modal Discrete Representation Learning Alexander H. Liu SouYoung Jin Cheng-I Jeff Lai Andrew Rouditchenko A. Oliva James R. Glass SSL 84 41 0 10 Jun 2021
Domain Specific Transporter Framework to Detect Fractures in Ultrasound Arpan Tripathi A. Hareendranathan Mahesh Raveendranatha Panicker Jack Zhang Naveenjyote Boora Jacob L. Jaremko 12 0 0 09 Jun 2021
Vector Quantized Models for Planning Sherjil Ozair Yazhe Li Ali Razavi Ioannis Antonoglou Aaron van den Oord Oriol Vinyals OffRL 96 51 0 08 Jun 2021
Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings Marcely Zanon Boito Bolaji Yusuf Lucas Ondel Aline Villavicencio Laurent Besacier 69 3 0 08 Jun 2021
NWT: Towards natural audio-to-video generation with representation learning Rayhane Mama Marc S. Tyndel Hashiam Kadhim Cole Clifford Ragavan Thurairatnam VGen 112 12 0 08 Jun 2021
Interpretable agent communication from scratch (with a generic visual processor emerging on the side) Roberto Dessì Eugene Kharitonov Marco Baroni 95 28 0 08 Jun 2021
Weakly-supervised word-level pronunciation error detection in non-native English speech Daniel Korzekwa Jaime Lorenzo-Trueba Thomas Drugman Shira Calamaro B. Kostek 37 13 0 07 Jun 2021
Neural Distributed Source Coding Jay Whang Alliot Nagle Anish Acharya Hyeji Kim A. Dimakis 93 21 0 05 Jun 2021
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework Zeyu Yan Fei Wen R. Ying Chao Ma Peilin Liu 86 38 0 05 Jun 2021
The Image Local Autoregressive Transformer Chenjie Cao Yue Hong Xiang Li Chengrong Wang C. Xu Xiangyang Xue Yanwei Fu 82 13 0 04 Jun 2021
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation Saurabhchand Bhati Jesús Villalba Piotr Żelasko Laureano Moro-Velazquez Najim Dehak SSL 86 37 0 03 Jun 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion Wen-Chin Huang Kazuhiro Kobayashi Yu-Huai Peng Ching-Feng Liu Yu Tsao Hsin-Min Wang Tomoki Toda 86 11 0 02 Jun 2021
Container: Context Aggregation Network Peng Gao Jiasen Lu Hongsheng Li Roozbeh Mottaghi Aniruddha Kembhavi ViT 108 72 0 02 Jun 2021
What Can I Do Here? Learning New Skills by Imagining Visual Affordances Alexander Khazatsky Ashvin Nair Dan Jing Sergey Levine LM&Ro 84 33 0 01 Jun 2021
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals Asma Ghandeharioun Been Kim Chun-Liang Li Brendan Jou B. Eoff Rosalind W. Picard AAML 102 54 0 31 May 2021
Factorising Meaning and Form for Intent-Preserving Paraphrasing Tom Hosking Mirella Lapata OOD 86 41 0 31 May 2021
Cascaded Diffusion Models for High Fidelity Image Generation Jonathan Ho Chitwan Saharia William Chan David J. Fleet Mohammad Norouzi Tim Salimans 306 1,246 0 30 May 2021
Diffusion-Based Representation Learning K. Abstreiter Sarthak Mittal Stefan Bauer Bernhard Schölkopf Arash Mehrjou DiffM 96 58 0 29 May 2021
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis via Non-Autoregressive Generative Transformers Zhu Zhang Jianxin Ma Chang Zhou Rui Men Zhikang Li Ming Ding Jie Tang Jingren Zhou Hongxia Yang 110 47 0 29 May 2021
CogView: Mastering Text-to-Image Generation via Transformers Ming Ding Zhuoyi Yang Wenyi Hong Wendi Zheng Chang Zhou ... Junyang Lin Xu Zou Zhou Shao Hongxia Yang Jie Tang ViT VLM 164 784 0 26 May 2021
Deep Neural Networks and End-to-End Learning for Audio Compression Daniela N. Rim I. Jang Heeyoul Choi 64 9 0 25 May 2021
EXoN: EXplainable encoder Network SeungHwan An Hosik Choi Jong-June Jeon BDL DRL 76 5 0 23 May 2021
Compositional Fine-Grained Low-Shot Learning Dat T. Huynh Ehsan Elhamifar 83 4 0 21 May 2021
Combining Transformer Generators with Convolutional Discriminators Ricard Durall Stanislav Frolov Jörn Hees Federico Raue Franz-Josef Pfreundt Andreas Dengel J. Keuper ViT 76 16 0 21 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics V. Jayaram John Thickstun DiffM 107 25 0 17 May 2021
Priors in Bayesian Deep Learning: A Review Vincent Fortuin UQCV BDL 141 134 0 14 May 2021
High-Resolution Complex Scene Synthesis with Transformers Manuel Jahn Robin Rombach Bjorn Ommer ViT 85 37 0 13 May 2021
Autoencoding Under Normalization Constraints Sangwoong Yoon Yung-Kyun Noh Frank C. Park OODD UQCV 86 39 0 12 May 2021
Discrete representations in neural models of spoken language Bertrand Higy Lieke Gelderloos Afra Alishahi Grzegorz Chrupała 144 6 0 12 May 2021
Diffusion Models Beat GANs on Image Synthesis Prafulla Dhariwal Alex Nichol 562 8,017 0 11 May 2021
MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with One Transformer VAE Shih-Lun Wu Yi-Hsuan Yang ViT 109 55 0 10 May 2021
Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery Thomas Glarner Janek Ebbers Reinhold Häb-Umbach DRL 30 1 0 04 May 2021
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding J. Nistal Cyran Aouameur Stefan Lattner G. Richard 105 7 0 04 May 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning Shuo Wang Surya Nepal Kristen Moore M. Grobler Carsten Rudolph A. Abuadbba FedML 76 8 0 03 May 2021
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions Chenfei Wu Lun Huang Qianxi Zhang Binyang Li Lei Ji Fan Yang Guillermo Sapiro Nan Duan DiffM VGen 120 245 0 30 Apr 2021
Eccentric Regularization: Minimizing Hyperspherical Energy without explicit projection Xuefeng Li Alan Blair 62 0 0 23 Apr 2021
Protecting gender and identity with disentangled speech representations Dimitrios Stoidis Andrea Cavallaro 74 10 0 22 Apr 2021
IB-DRR: Incremental Learning with Information-Back Discrete Representation Replay Jian Jiang Edoardo Cetin Oya Celiktutan 61 9 0 21 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers Wilson Yan Yunzhi Zhang Pieter Abbeel A. Srinivas ViT VGen 345 514 0 20 Apr 2021
Conditional Variational Capsule Network for Open Set Recognition Yunrui Guo Guglielmo Camporese Wenjing Yang A. Sperduti Lamberto Ballan BDL 90 45 0 19 Apr 2021