v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown

Title
Neuromorphic Wireless Cognition: Event-Driven Semantic Communications for Remote Inference Jiechen Chen N. Skatchkovsky Osvaldo Simeone 80 37 0 13 Jun 2022
Comparative Snippet Generation Saurabh Jain Yisong Miao Min-Yen Kan 42 0 0 11 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion Curtis Hawthorne Ian Simon Adam Roberts Neil Zeghidour Josh Gardner Ethan Manilow Jesse Engel DiffM 81 51 0 11 Jun 2022
PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework Ning Kang Shanzhao Qiu Shifeng Zhang Zhenguo Li Shutao Xia 62 19 0 10 Jun 2022
Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer Doyup Lee Chiheon Kim Saehoon Kim Minsu Cho Wook-Shin Han 82 29 0 09 Jun 2022
SimVP: Simpler yet Better Video Prediction Zhangyang Gao Cheng Tan Lirong Wu Stan Z. Li 109 222 0 09 Jun 2022
Robust Semantic Communications with Masked VQ-VAE Enabled Codebook Qiyu Hu Guangyi Zhang Zhijin Qin Yunlong Cai Guanding Yu Geoffrey Ye Li AAML 96 151 0 08 Jun 2022
Patch-based Object-centric Transformers for Efficient Video Generation Wilson Yan Ryogo Okumura Stephen James Pieter Abbeel DiffM ViT 87 6 0 08 Jun 2022
Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models W. H. Pinaya M. Graham Robert J. Gray P. F. D. Costa Petru-Daniel Tudosiu ... D. Werring Geraint Rees P. Nachev Sebastien Ourselin M. Jorge Cardoso DiffM MedIm 86 107 0 07 Jun 2022
Decentralized Low-Latency Collaborative Inference via Ensembles on the Edge M. Malka Erez Farhan Hai Morgenstern Nir Shlezinger FedML 74 13 0 07 Jun 2022
Intra-agent speech permits zero-shot task acquisition Chen Yan Federico Carnevale Petko Georgiev Adam Santoro Aurelia Guy Alistair Muldal Chia-Chun Hung Josh Abramson Timothy Lillicrap Greg Wayne LM&Ro 97 9 0 07 Jun 2022
Recent Advances for Quantum Neural Networks in Generative Learning Jinkai Tian Xiaoyun Sun Yuxuan Du Shanshan Zhao Qing Liu ... Xingyao Wu Min-hsiu Hsieh Tongliang Liu Wen-Bin Yang Dacheng Tao AI4CE 102 85 0 07 Jun 2022
Blended Latent Diffusion Omri Avrahami Ohad Fried Dani Lischinski DiffM 196 393 0 06 Jun 2022
Variable-rate hierarchical CPC leads to acoustic unit discovery in speech Santiago Cuervo Adrian Lañcucki R. Marxer Paweł Rychlikowski J. Chorowski SSL 87 13 0 05 Jun 2022
DÁRTAGNAN: Counterfactual Video Generation Hadrien Reynaud Athanasios Vlontzos Mischa Dombrowski Ciarán M. Gilligan-Lee A. Beqiri Paul Leeson Bernhard Kainz VGen CML MedIm 91 20 0 03 Jun 2022
Recognition of Unseen Bird Species by Learning from Field Guides Andrés C. Rodríguez Stefano Dáronco Rodrigo Caye Daudt Jan Dirk Wegner Konrad Schindler 68 1 0 03 Jun 2022
Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations Chang Liu Zhenhua Ling Linghui Chen 75 3 0 02 Jun 2022
Improving Diffusion Models for Inverse Problems using Manifold Constraints Hyungjin Chung Byeongsu Sim Dohoon Ryu J. C. Ye DiffM MedIm 253 475 0 02 Jun 2022
Modeling Image Composition for Complex Scene Generation Zuopeng Yang Daqing Liu Chaoyue Wang J. Yang Dacheng Tao ViT 117 52 0 02 Jun 2022
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder Jie Shi Chenfei Wu Jian Liang Xiang Liu Nan Duan DiffM 88 26 0 01 Jun 2022
PAGER: Progressive Attribute-Guided Extendable Robust Image Generation Zohreh Azizi C.-C. Jay Kuo VLM DiffM GAN 89 9 0 01 Jun 2022
VALHALLA: Visual Hallucination for Machine Translation Yi Li Yikang Shen Yoon Kim Chun-Fu Chen Rogerio Feris David D. Cox Nuno Vasconcelos MLLM 149 40 0 31 May 2022
Improved Vector Quantized Diffusion Models Zhicong Tang Shuyang Gu Jianmin Bao Dong Chen Fang Wen DiffM 260 63 0 31 May 2022
Text2Human: Text-Driven Controllable Human Image Generation Yuming Jiang Shuai Yang Haonan Qiu Wayne Wu Chen Change Loy Ziwei Liu DiffM 176 48 0 31 May 2022
From Keypoints to Object Landmarks via Self-Training Correspondence: A novel approach to Unsupervised Landmark Discovery Dimitrios Mallis Enrique Sanchez Matt Bell Georgios Tzimiropoulos SSL 3DPC 88 7 0 31 May 2022
VQ-AR: Vector Quantized Autoregressive Probabilistic Time Series Forecasting Kashif Rasul Young-Jin Park Max Nihlén Ramström KyungHyun Kim BDL AI4TS 42 4 0 31 May 2022
SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series Iris A. M. Huijben Arthur A. Nijdam S. Overeem M. V. Gilst Ruud J. G. van Sloun AI4TS 41 7 0 31 May 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions Zhengyao Jiang Tianjun Zhang Robert Kirk Tim Rocktaschel Edward Grefenstette OffRL 49 2 0 31 May 2022
Do self-supervised speech models develop human-like perception biases? Juliette Millet Ewan Dunbar SSL 68 23 0 31 May 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers Wenyi Hong Ming Ding Wendi Zheng Xinghan Liu Jie Tang DiffM 380 633 0 29 May 2022
Multimodal Masked Autoencoders Learn Transferable Representations Xinyang Geng Hao Liu Lisa Lee Dale Schuurams Sergey Levine Pieter Abbeel 103 119 0 27 May 2022
3DILG: Irregular Latent Grids for 3D Generative Modeling Biao Zhang Matthias Nießner Peter Wonka 3DV 118 90 0 27 May 2022
A Survey on Long-Tailed Visual Recognition Lu Yang He Jiang Q. Song Jun Guo 93 135 0 27 May 2022
Green Hierarchical Vision Transformer for Masked Image Modeling Lang Huang Shan You Mingkai Zheng Fei Wang Chao Qian T. Yamasaki 139 73 0 26 May 2022
Learning What and Where: Disentangling Location and Identity Tracking Without Supervision Manuel Traub S. Otte Tobias Menge Matthias Karlbauer Jannik Thummel Martin Volker Butz 115 20 0 26 May 2022
Scalable Multi-Agent Model-Based Reinforcement Learning Vladimir Egorov A. Shpilman 90 27 0 25 May 2022
Structured Uncertainty in the Observation Space of Variational Autoencoders James A. G. Langley M. Monteiro Charles Jones Nick Pawlowski Ben Glocker CML OOD BDL DRL 71 2 0 25 May 2022
Emergent Communication through Metropolis-Hastings Naming Game with Deep Generative Models T. Taniguchi Yuto Yoshida Akira Taniguchi Y. Hagiwara MLLM 73 25 0 24 May 2022
RevUp: Revise and Update Information Bottleneck for Event Representation Mehdi Rezaee Francis Ferraro 94 1 0 24 May 2022
Generalization Gap in Amortized Inference Mingtian Zhang Peter Hayes David Barber BDL CML DRL 131 14 0 23 May 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding Chitwan Saharia William Chan Saurabh Saxena Lala Li Jay Whang ... Raphael Gontijo-Lopes Tim Salimans Jonathan Ho David J Fleet Mohammad Norouzi VLM 675 6,107 0 23 May 2022
Vector-Quantized Input-Contextualized Soft Prompts for Natural Language Understanding Rishabh Bhardwaj Amrita Saha Guosheng Lin Soujanya Poria VLM VPVLM 53 7 0 23 May 2022
Transformer-based out-of-distribution detection for clinically safe segmentation M. Graham Petru-Daniel Tudosiu P. Wright W. H. Pinaya J. U-King-im ... H. Jäger D. Werring P. Nachev Sebastien Ourselin M. Jorge Cardoso MedIm 90 21 0 21 May 2022
Self-Supervised Speech Representation Learning: A Review Abdel-rahman Mohamed Hung-yi Lee Lasse Borgholt Jakob Drachmann Havtorn Joakim Edin ... Shang-Wen Li Karen Livescu Lars Maaløe Tara N. Sainath Shinji Watanabe SSL AI4TS 293 368 0 21 May 2022
Coordinating Policies Among Multiple Agents via an Intelligent Communication Channel Dianbo Liu Vedant Shah Oussama Boussif Cristian Meo Anirudh Goyal Tianmin Shu Michael C. Mozer N. Heess Yoshua Bengio 73 0 0 21 May 2022
Tackling Provably Hard Representative Selection via Graph Neural Networks Seyed Mehran Kazemi Anton Tsitsulin Hossein Esfandiari M. Bateni Deepak Ramachandran Bryan Perozzi Vahab Mirrokni 127 3 0 20 May 2022
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes Alexander Kolesnikov André Susano Pinto Lucas Beyer Xiaohua Zhai Jeremiah Harmsen N. Houlsby 171 72 0 20 May 2022
Diversity vs. Recognizability: Human-like generalization in one-shot generative models Victor Boutin Lakshya Singhal Xavier Thomas Thomas Serre 83 8 0 20 May 2022
Visual Concepts Tokenization Tao Yang Yuwang Wang Yan Lu Nanning Zheng OCL ViT 107 15 0 20 May 2022
Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality Xiang Li Wenhai Wang Lingfeng Yang Jian Yang 185 76 0 20 May 2022