v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown

Title
Physics Driven Domain Specific Transporter Framework with Attention Mechanism for Ultrasound Imaging Arpan Tripathi A. Rakkunedeth Mahesh Raveendranatha Panicker Jack Zhang Naveenjyote Boora Jessica Knight Jacob L. Jaremko Yale Tung Chen K. Narayan C. Kesavadas OOD MedIm 97 0 0 13 Sep 2021
Incorporating Real-world Noisy Speech in Neural-network-based Speech Enhancement Systems Yangyang Xia Buye Xu Anurag Kumar 40 7 0 11 Sep 2021
StreamHover: Livestream Transcript Summarization and Annotation Sangwoo Cho Franck Dernoncourt Timothy Jeewun Ganter Trung Bui Nedim Lipka Walter Chang Hailin Jin Jonathan Brandt H. Foroosh Fei Liu 3DGS AI4TS 79 29 0 11 Sep 2021
Text-Free Prosody-Aware Generative Spoken Language Modeling Eugene Kharitonov Ann Lee Adam Polyak Yossi Adi Jade Copet ... Tu Nguyen M. Rivière Abdel-rahman Mohamed Emmanuel Dupoux Wei-Ning Hsu 118 122 0 07 Sep 2021
Self-supervised Tumor Segmentation through Layer Decomposition Xiaoman Zhang Weidi Xie Chaoqin Huang Yanfeng Wang Ya Zhang Xin Chen Qi Tian 83 6 0 07 Sep 2021
Aspect-Controllable Opinion Summarization Reinald Kim Amplayo Stefanos Angelidis Mirella Lapata 76 75 0 07 Sep 2021
Multi-Agent Variational Occlusion Inference Using People as Sensors Masha Itkina Ye-Ji Mun Katherine Driggs-Campbell Mykel J. Kochenderfer 96 25 0 05 Sep 2021
What Users Want? WARHOL: A Generative Model for Recommendation Jules Samaran Ugo Tanielian Romain Beaumont Flavian Vasile HAI 42 0 0 02 Sep 2021
Learning Disentangled Representations in the Imaging Domain Xiao Liu Pedro Sanchez Spyridon Thermos Alison Q. OÑeil Sotirios A. Tsaftaris OOD DRL 202 72 0 26 Aug 2021
Adversarially Robust One-class Novelty Detection Shao-Yuan Lo Poojan Oza Vishal M. Patel AAML 78 32 0 25 Aug 2021
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis Patrick Esser Robin Rombach A. Blattmann Bjorn Ommer DiffM 119 162 0 19 Aug 2021
Transformers predicting the future. Applying attention in next-frame and time series forecasting Radostin Cholakov T. Kolev AI4TS 55 17 0 18 Aug 2021
Cross-modal Spectrum Transformation Network For Acoustic Scene classification Yang Liu A. Neophytou Sunando Sengupta Eric Sommerlade 97 9 0 13 Aug 2021
PixelSynth: Generating a 3D-Consistent Experience from a Single Image C. Rockwell David Fouhey Justin Johnson VGen 146 86 0 12 Aug 2021
Analysis of ODE2VAE with Examples Batuhan Koyuncu DRL 38 0 0 10 Aug 2021
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition Shoki Sakamoto Akira Taniguchi T. Taniguchi Hirokazu Kameoka BDL 63 5 0 10 Aug 2021
Information Bottleneck Approach to Spatial Attention Learning Qiuxia Lai Yu Li Ailing Zeng Minhao Liu Hanqiu Sun Qiang Xu 98 9 0 07 Aug 2021
Applying the Information Bottleneck Principle to Prosodic Representation Learning Guangyan Zhang Ying Qin Daxin Tan Tan Lee 79 4 0 05 Aug 2021
RockGPT: Reconstructing three-dimensional digital rocks from single two-dimensional slice from the perspective of video generation Qi Zheng Dongxiao Zhang 33 22 0 05 Aug 2021
Deep Quantized Representation for Enhanced Reconstruction Akash Gupta Abhishek Aich Kevin Rodriguez G. Reddy Amit K. Roy-Chowdhury 15 5 0 29 Jul 2021
Fast and Scalable Image Search For Histology Chengkuan Chen Ming Y. Lu Drew F. K. Williamson Tiffany Y. Chen A. J. Schaumberg Faisal Mahmood 34 2 0 28 Jul 2021
Unsupervised Learning of Neurosymbolic Encoders Eric Zhan Jennifer J. Sun Ann Kennedy Yisong Yue Swarat Chaudhuri 102 14 0 28 Jul 2021
Improving Robot Localisation by Ignoring Visual Distraction Oscar Alejandro Mendez Maldonado M. Vowels Richard Bowden 35 1 0 25 Jul 2021
Introducing: DeepHead, Wide-band Electromagnetic Imaging Paradigm Ahmed Al-Saffar L. Guo A. Abbosh MedIm 21 0 0 23 Jul 2021
Abstract Reasoning via Logic-guided Generation Sihyun Yu Sangwoo Mo SungSoo Ahn Jinwoo Shin 72 6 0 22 Jul 2021
Data synthesis and adversarial networks: A review and meta-analysis in cancer imaging Richard Osuala Kaisar Kushibar Lidia Garrucho Akis Linardos Zuzanna Szafranowska Stefan Klein Ben Glocker Oliver Díaz Karim Lekadir MedIm 106 45 0 20 Jul 2021
Generative Video Transformer: Can Objects be the Words? Yi-Fu Wu Jaesik Yoon Sungjin Ahn ViT 114 34 0 20 Jul 2021
GenRadar: Self-supervised Probabilistic Camera Synthesis based on Radar Frequencies Carsten Ditzel Klaus C. J. Dietmayer 56 3 0 19 Jul 2021
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Ye Jia Michelle Tadmor Ramanovich Tal Remez Roi Pomerantz 107 73 0 19 Jul 2021
Unsupervised Skill-Discovery and Skill-Learning in Minecraft J. J. Nieto Roger Creus Xavier Giró-i-Nieto SSL DRL 71 4 0 18 Jul 2021
Learning De-identified Representations of Prosody from Raw Audio J. Weston R. Lenain U. Meepegama E. Fristed SSL 68 17 0 17 Jul 2021
CCVS: Context-aware Controllable Video Synthesis G. L. Moing Jean Ponce Cordelia Schmid 105 81 0 16 Jul 2021
Codified audio language modeling learns useful representations for music information retrieval Rodrigo Castellon Chris Donahue Percy Liang 148 91 0 12 Jul 2021
PocketVAE: A Two-step Model for Groove Generation and Control Kyungyun Lee Wonil Kim Juhan Nam 46 1 0 11 Jul 2021
SoundStream: An End-to-End Neural Audio Codec Neil Zeghidour Alejandro Luebs Ahmed Omran Jan Skoglund Marco Tagliasacchi AI4TS 120 806 0 07 Jul 2021
Discrete-Valued Neural Communication Dianbo Liu DianboLiu Alex Lamb Kenji Kawaguchi Anirudh Goyal Chen Sun Michael C. Mozer Yoshua Bengio 95 52 0 06 Jul 2021
Classical Planning in Deep Latent Space Masataro Asai Hiroshi Kajino A. Fukunaga Christian Muise VLM 98 19 0 30 Jun 2021
A Generative Model for Raw Audio Using Transformer Architectures Prateek Verma C. Chafe 89 29 0 30 Jun 2021
CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders Kevin Frans Lisa Soros Olaf Witkowski CLIP 105 212 0 28 Jun 2021
Transflower: probabilistic autoregressive dance generation with multimodal attention Guillermo Valle Pérez G. Henter Jonas Beskow A. Holzapfel Pierre-Yves Oudeyer Simon Alexanderson 133 43 0 25 Jun 2021
On Incorporating Inductive Biases into VAEs Ning Miao Emile Mathieu N. Siddharth Yee Whye Teh Tom Rainforth CML DRL 99 11 0 25 Jun 2021
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance Hieu-Thi Luong Junichi Yamagishi 85 0 0 25 Jun 2021
NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation Xiaohui Zeng R. Urtasun R. Zemel Sanja Fidler Renjie Liao DiffM 45 2 0 25 Jun 2021
Generative Modeling for Multi-task Visual Learning Zhipeng Bao M. Hebert Yu-Xiong Wang 64 17 0 25 Jun 2021
Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging Liangqiong Qu N. Balachandar Miao Zhang D. Rubin MedIm 89 23 0 24 Jun 2021
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning Hao Tan Jie Lei Thomas Wolf Joey Tianyi Zhou 120 67 0 21 Jun 2021
Trainable Class Prototypes for Few-Shot Learning Jianyi Li Guizhong Liu VLM 49 2 0 21 Jun 2021
Discrete Auto-regressive Variational Attention Models for Text Modeling Xianghong Fang Haoli Bai Jian Li Zenglin Xu Michael Lyu Irwin King 73 3 0 16 Jun 2021
Test Sample Accuracy Scales with Training Sample Density in Neural Networks Xu Ji Razvan Pascanu Devon Hjelm Balaji Lakshminarayanan Andrea Vedaldi 70 8 0 15 Jun 2021
Self-Supervised Learning with Kernel Dependence Maximization Yazhe Li Roman Pogodin Danica J. Sutherland Arthur Gretton SSL 100 85 0 15 Jun 2021