v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown

Title
Multiresolution Signal Processing of Financial Market Objects Ioana Boier 35 2 0 28 Oct 2022
One-Shot Acoustic Matching Of Audio Signals -- Learning to Hear Music In Any Room/ Concert Hall Prateek Verma C. Chafe J. Berger 67 1 0 27 Oct 2022
Broken Neural Scaling Laws Ethan Caballero Kshitij Gupta Irina Rish David M. Krueger 194 76 0 26 Oct 2022
Leveraging Demonstrations with Latent Space Priors Jonas Gehring Deepak Gopinath Jungdam Won Andreas Krause Gabriel Synnaeve Nicolas Usunier 81 6 0 26 Oct 2022
Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models Filippos Christianos Peter Karkus Boris Ivanovic Stefano V. Albrecht Marco Pavone 97 10 0 26 Oct 2022
Discovering Design Concepts for CAD Sketches Yuezhi Yang Hao Pan 73 12 0 26 Oct 2022
A Survey on Artificial Intelligence for Music Generation: Agents, Domains and Perspectives Carlos Hernandez-Olivan Javier Hernandez-Olivan J. R. Beltrán MGen 103 7 0 25 Oct 2022
High Fidelity Neural Audio Compression Alexandre Défossez Jade Copet Gabriel Synnaeve Yossi Adi 136 674 0 24 Oct 2022
Language Model Pre-Training with Sparse Latent Typing Liliang Ren Zixuan Zhang H. Wang Clare R. Voss Chengxiang Zhai Heng Ji 101 3 0 23 Oct 2022
VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors Yifeng Zhu Abhishek Joshi Peter Stone Yuke Zhu LM&Ro 97 134 0 20 Oct 2022
Coordinates Are NOT Lonely -- Codebook Prior Helps Implicit Neural 3D Representations Fukun Yin Wen Liu Zilong Huang Pei Cheng Tao Chen Gang Yu 63 19 0 20 Oct 2022
DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion Chihiro Watanabe Hirokazu Kameoka DRL 116 0 0 20 Oct 2022
Representation Learning with Diffusion Models Jeremias Traub DiffM 97 8 0 20 Oct 2022
OCR-VQGAN: Taming Text-within-Image Generation Juan A. Rodriguez David Vazquez I. Laradji M. Pedersoli Pau Rodríguez López 154 20 0 19 Oct 2022
PoseGPT: Quantization-based 3D Human Motion Generation and Forecasting Thomas Lucas Fabien Baradel Philippe Weinzaepfel Grégory Rogez 109 75 0 19 Oct 2022
FedForgery: Generalized Face Forgery Detection with Residual Federated Learning Decheng Liu Zhan Dang Chunlei Peng Yu Zheng Shuang Li N. Wang Xinbo Gao FedML 88 35 0 18 Oct 2022
Explaining Image Classification with Visual Debates Avinash Kori Ben Glocker Francesca Toni 69 1 0 17 Oct 2022
Improving Object-centric Learning with Query Optimization Baoxiong Jia Yu Liu Siyuan Huang OCL 99 52 0 17 Oct 2022
DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion Models Yueqin Yin Lianghua Huang Yu Liu Kaiqiang Huang DiffM 76 12 0 16 Oct 2022
Character-Centric Story Visualization via Visual Planning and Token Alignment Hong Chen Rujun Han Te-Lin Wu Hideki Nakayama Nanyun Peng DiffM VGen 96 32 0 16 Oct 2022
Generalization with Lossy Affordances: Leveraging Broad Offline Data for Learning Visuomotor Tasks Kuan Fang Patrick Yin Ashvin Nair Homer Walke Ge Yan Sergey Levine OffRL 96 25 0 12 Oct 2022
Anomaly Detection using Generative Models and Sum-Product Networks in Mammography Scans Marc Dietrichstein David Major Martin Trapp M. Wimmer Dimitrios Lenis Philip Winter Astrid Berg Theresa Neubauer Katja Bühler MedIm 49 4 0 12 Oct 2022
JukeDrummer: Conditional Beat-aware Audio-domain Drum Accompaniment Generation via Transformer VQ-VAE Yueh-Kao Wu Ching-Yu Chiu Yi-Hsuan Yang ViT 79 15 0 12 Oct 2022
3D Brain and Heart Volume Generative Models: A Survey Yanbin Liu Girish Dwivedi F. Boussaïd Bennamoun MedIm AI4CE 107 6 0 12 Oct 2022
Underspecification in Scene Description-to-Depiction Tasks Ben Hutchinson Jason Baldridge Vinodkumar Prabhakaran DiffM 128 34 0 11 Oct 2022
Automatic Speech Recognition of Low-Resource Languages Based on Chukchi Anastasia N. Safonova Tatiana Yudina Emil Nadimanov Cydnie Davenport 54 3 0 11 Oct 2022
Style-Guided Inference of Transformer for High-resolution Image Synthesis Jonghwa Yim Minjae Kim ViT 103 0 0 11 Oct 2022
Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval Peitian Zhang Zheng Liu Shitao Xiao Zhicheng Dou Jing Yao 98 6 0 11 Oct 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward DaeJin Jo Sungwoong Kim D. W. Nam Taehwan Kwon Seungeun Rho Jongmin Kim Donghoon Lee OffRL 76 10 0 11 Oct 2022
Continual Learning by Modeling Intra-Class Variation L. Yu Tianyang Hu Lanqing Hong Zhen Liu Adrian Weller Weiyang Liu CLL 83 13 0 11 Oct 2022
Race Bias Analysis of Bona Fide Errors in face anti-spoofing Latifah Abduh I. Ivrissimtzis CVBM 98 2 0 11 Oct 2022
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training Yuxin Song Min Yang Wenhao Wu Dongliang He Fu Li Jingdong Wang ViT 150 9 0 11 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling Yuntian Deng Noriyuki Kojima Alexander M. Rush DiffM 86 4 0 11 Oct 2022
ConchShell: A Generative Adversarial Networks that Turns Pictures into Piano Music Wanshu Fan Yu-Chuan Su Yuxin Huang GAN 36 2 0 11 Oct 2022
f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation Jiatao Gu Shuangfei Zhai Yizhe Zhang Miguel Angel Bautista J. Susskind DiffM 103 27 0 10 Oct 2022
LMQFormer: A Laplace-Prior-Guided Mask Query Transformer for Lightweight Snow Removal Junhong Lin Nanfeng Jiang Zhentao Zhang Weiling Chen Tiesong Zhao 88 21 0 10 Oct 2022
HORIZON: High-Resolution Semantically Controlled Panorama Synthesis Kun Yan Lei Ji Chenfei Wu Jian Liang Ming Zhou Nan Duan Shuai Ma 80 0 0 10 Oct 2022
Scaling Up Probabilistic Circuits by Latent Variable Distillation Hoang Trung-Dung Honghua Zhang Guy Van den Broeck TPM 73 27 0 10 Oct 2022
Dual-distribution discrepancy with self-supervised refinement for anomaly detection in medical images Yu Cai Hao Chen Xin Yang Yu Zhou Kwang-Ting Cheng 133 51 0 09 Oct 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation Wanrong Zhu An Yan Yujie Lu Wenda Xu Xinze Wang Miguel P. Eckstein William Yang Wang 137 36 0 07 Oct 2022
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training Zi-Hua Zhang Long Zhou Junyi Ao Shujie Liu Lirong Dai Jinyu Li Furu Wei 134 58 0 07 Oct 2022
Dynamic Latent Separation for Deep Learning Yi-Lin Tuan Zih-Yun Chiu William Yang Wang 89 0 0 07 Oct 2022
A deep learning approach for detection and localization of leaf anomalies David M. Calabro Massimiliano Lupo Pasini Nicola Ferro S. Perotto 44 1 0 07 Oct 2022
Compressed Vision for Efficient Video Understanding Olivia Wiles João Carreira Iain Barr Andrew Zisserman Mateusz Malinowski 56 7 0 06 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual Description Ruben Villegas Mohammad Babaeizadeh Pieter-Jan Kindermans Hernan Moraldo Han Zhang M. Saffar Santiago Castro Julius Kunze D. Erhan DiffM VGen 173 396 0 05 Oct 2022
Temporally Consistent Transformers for Video Generation Wilson Yan Danijar Hafner Stephen James Pieter Abbeel DiffM 94 31 0 05 Oct 2022
Progressive Text-to-Image Generation Zhengcong Fei Mingyuan Fan Li Zhu Junshi Huang 185 4 0 05 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data Ye Zhu Yuehua Wu N. Sebe Yan Yan 119 19 0 05 Oct 2022
Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning Dianbo Liu Vedant Shah Oussama Boussif Cristian Meo Anirudh Goyal Tianmin Shu Michael C. Mozer N. Heess Yoshua Bengio 134 8 0 04 Oct 2022
Enhancing Spatiotemporal Prediction Model using Modular Design and Beyond Haoyu Pan Hao Wu Tan Yang AI4TS 80 0 0 04 Oct 2022