Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 2,839 papers shown
Title
Bidirectional Autoregressive Diffusion Model for Dance Generation
Canyu Zhang
Youbao Tang
Ning Zhang
Ruei-Sung Lin
Mei Han
Jing Xiao
Song Wang
38
8
0
06 Feb 2024
MOMENT: A Family of Open Time-series Foundation Models
Mononito Goswami
Konrad Szafer
Arjun Choudhry
Yifu Cai
Shuo Li
Artur Dubrawski
AIFin
AI4TS
74
119
0
06 Feb 2024
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning and Levels-of-Experts
Kun Wang
Hao Wu
Guibin Zhang
Sihang Li
Yuxuan Liang
Yuankai Wu
Roger Zimmermann
Yang Wang
32
9
0
06 Feb 2024
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies
Zhixuan Chu
Yan Wang
Feng Zhu
Lu Yu
Longfei Li
Jinjie Gu
LLMAG
30
8
0
06 Feb 2024
Revisiting the Dataset Bias Problem from a Statistical Perspective
Kien Do
D. Nguyen
Hung Le
T. Le
Dang Nguyen
Haripriya Harikumar
T. Tran
Santu Rana
Svetha Venkatesh
34
0
0
05 Feb 2024
Denoising Diffusion via Image-Based Rendering
Titas Anciukevicius
Fabian Manhardt
Federico Tombari
Paul Henderson
67
11
0
05 Feb 2024
InstanceDiffusion: Instance-level Control for Image Generation
Xudong Wang
Trevor Darrell
Sai Saketh Rambhatla
Rohit Girdhar
Ishan Misra
VLM
DiffM
39
91
0
05 Feb 2024
ISPA: Inter-Species Phonetic Alphabet for Transcribing Animal Sounds
Masato Hagiwara
Marius Miron
Jen-Yu Liu
39
1
0
05 Feb 2024
Minimum Description Length and Generalization Guarantees for Representation Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
81
8
0
05 Feb 2024
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Yang Jin
Zhicheng Sun
Kun Xu
Kun Xu
Liwei Chen
...
Yuliang Liu
Di Zhang
Yang Song
Kun Gai
Yadong Mu
VGen
55
44
0
05 Feb 2024
Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Sheng Luo
Wei Chen
Wanxin Tian
Rui Liu
Luanxuan Hou
...
Ling Shao
Yi Yang
Bojun Gao
Qun Li
Guobin Wu
81
14
0
05 Feb 2024
Trinity: Syncretizing Multi-/Long-tail/Long-term Interests All in One
Jing Yan
Liu Jiang
Jianfei Cui
Zhichen Zhao
Xingyan Bin
Feng Zhang
Zuotao Liu
17
2
0
05 Feb 2024
Focal Modulation Networks for Interpretable Sound Classification
Luca Della Libera
Cem Subakan
Mirco Ravanelli
41
2
0
05 Feb 2024
Can Large Language Models Learn Independent Causal Mechanisms?
Gaël Gendron
Bao Trung Nguyen
A. Peng
Michael Witbrock
Gillian Dobbie
LRM
43
4
0
04 Feb 2024
FoldToken: Learning Protein Language via Vector Quantization and Beyond
Zhangyang Gao
Cheng Tan
Jue Wang
Yufei Huang
Lirong Wu
Stan Z. Li
35
9
0
04 Feb 2024
MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers
Yatong Bai
Mo Zhou
Vishal M. Patel
Somayeh Sojoudi
AAML
39
8
0
03 Feb 2024
Position: Graph Foundation Models are Already Here
Haitao Mao
Zhikai Chen
Wenzhuo Tang
Jianan Zhao
Yao Ma
Tong Zhao
Neil Shah
Mikhail Galkin
Jiliang Tang
AI4CE
69
30
0
03 Feb 2024
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation
Yurui Chen
Jing Zhang
Ziyang Xie
Wenye Li
Feihu Zhang
Jiachen Lu
Li Zhang
65
12
0
03 Feb 2024
Spiking Music: Audio Compression with Event Based Auto-encoders
Martim Lisboa
Guillaume Bellec
53
2
0
02 Feb 2024
Can MLLMs Perform Text-to-Image In-Context Learning?
Yuchen Zeng
Wonjun Kang
Yicong Chen
Hyung Il Koo
Kangwook Lee
MLLM
41
11
0
02 Feb 2024
An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec
Linping Xu
Jiawei Jiang
Dejun Zhang
Xianjun Xia
Li Chen
Yijian Xiao
Piao Ding
Shenyi Song
Sixing Yin
Ferdous Sohel
42
6
0
02 Feb 2024
Neural Language of Thought Models
Yi-Fu Wu
Minseung Lee
Sungjin Ahn
MLLM
VLM
85
6
0
02 Feb 2024
Large Language Models for Time Series: A Survey
Xiyuan Zhang
Ranak Roy Chowdhury
Rajesh K. Gupta
Jingbo Shang
AI4TS
90
55
0
02 Feb 2024
A Survey for Foundation Models in Autonomous Driving
Haoxiang Gao
Yaqian Li
Kaiwen Long
Ming Yang
Yiqing Shen
VLM
LRM
58
29
0
02 Feb 2024
IMUGPT 2.0: Language-Based Cross Modality Transfer for Sensor-Based Human Activity Recognition
Zi-Jian Leng
Amitrajit Bhattacharjee
Hrudhai Rajasekhar
Lizhe Zhang
Elizabeth Bruda
Hyeokhyen Kwon
Thomas Plötz
VLM
59
13
0
01 Feb 2024
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders
Yingji Zhang
Danilo S. Carvalho
Marco Valentino
Ian Pratt-Hartmann
André Freitas
DRL
61
5
0
01 Feb 2024
Dynamic Texture Transfer using PatchMatch and Transformers
Guo Pu
Shiyao Xu
Xixin Cao
Zhouhui Lian
49
2
0
01 Feb 2024
AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error
Jonas Ricker
Denis Lukovnikov
Asja Fischer
72
32
0
31 Jan 2024
Robustly overfitting latents for flexible neural image compression
Yura Perugachi-Diaz
Arwin Gansekoele
Sandjai Bhulai
51
1
0
31 Jan 2024
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis
Zecheng Tang
Chenfei Wu
Zekai Zhang
Mingheng Ni
Sheng-Siang Yin
...
Zhengyuan Yang
Lijuan Wang
Zicheng Liu
Juntao Li
Nan Duan
32
10
0
30 Jan 2024
Effective Communication with Dynamic Feature Compression
Pietro Talli
Francesco Pase
Federico Chiariotti
Andrea Zanella
Michele Zorzi
53
3
0
29 Jan 2024
Image-Text Out-Of-Context Detection Using Synthetic Multimodal Misinformation
Fatma Shalabi
H. Nguyen
Hichem Felouat
Ching-Chun Chang
Isao Echizen
54
5
0
29 Jan 2024
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors
Shiyin Dong
Mingrui Zhu
Kun Cheng
Nannan Wang
Xinbo Gao
DiffM
30
3
0
29 Jan 2024
Diffusion Facial Forgery Detection
Harry Cheng
Yangyang Guo
Tianyi Wang
L. Nie
Mohan S. Kankanhalli
74
18
0
29 Jan 2024
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Qingcheng Zhao
Pengyu Long
Qixuan Zhang
Dafei Qin
Hanming Liang
Longwen Zhang
Yingliang Zhang
Jingyi Yu
Lan Xu
DiffM
3DH
38
25
0
28 Jan 2024
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach
Shaofeng Zhang
Jinfa Huang
Qiang-feng Zhou
Zhibin Wang
Fan Wang
Jiebo Luo
Junchi Yan
DiffM
58
12
0
28 Jan 2024
A Survey on Neural Topic Models: Methods, Applications, and Challenges
Xiaobao Wu
Thong Nguyen
Anh Tuan Luu
BDL
36
36
0
27 Jan 2024
Annotated Hands for Generative Models
Yue Yang
Atith N Gandhi
Greg Turk
DiffM
GAN
31
3
0
26 Jan 2024
Residual Quantization with Implicit Neural Codebooks
Iris A. M. Huijben
Matthijs Douze
Matthew Muckley
Ruud J. G. van Sloun
Jakob Verbeek
MQ
34
11
0
26 Jan 2024
Deep Joint Source-Channel Coding for Efficient and Reliable Cross-Technology Communication
Shumin Yao
Xiaodong Xu
Hao Chen
Yaping Sun
Qinglin Zhao
33
1
0
26 Jan 2024
Within-basket Recommendation via Neural Pattern Associator
Kai Luo
Tianshu Shen
Lan Yao
Ga Wu
A. Liblong
Istvan Fehervari
Ruijian An
Jawad Ahmed
Harshit Mishra
Charu Pujari
AI4TS
32
2
0
25 Jan 2024
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Xinlei Chen
Zhuang Liu
Saining Xie
Kaiming He
DiffM
43
54
0
25 Jan 2024
Improving Antibody Humanness Prediction using Patent Data
Talip Uçar
Aubin Ramon
Dino Oglic
Rebecca Croasdale-wood
Tom Diethe
Pietro Sormanni
38
1
0
25 Jan 2024
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Fanghua Yu
Jinjin Gu
Zheyuan Li
Jinfan Hu
Xiangtao Kong
Xintao Wang
Jingwen He
Yu Qiao
Chao Dong
47
134
0
24 Jan 2024
Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models
T. Golling
Lukas Heinrich
Michael Kagan
Samuel Klein
Matthew Leigh
Margarita Osadchy
J. A. Raine
47
24
0
24 Jan 2024
Generative Human Motion Stylization in Latent Space
Chuan Guo
Yuxuan Mu
Wei Ji
Peng Dai
Youliang Yan
Juwei Lu
Li Cheng
VGen
43
11
0
24 Jan 2024
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
Zhaozhi Xie
Bochen Guan
Weihao Jiang
Muyang Yi
Yue Ding
Hongtao Lu
Lei Zhang
VLM
46
13
0
23 Jan 2024
CloSe: A 3D Clothing Segmentation Dataset and Model
Dimitrije Antic
Garvita Tiwari
Batuhan Ozcomlekci
R. Marin
Gerard Pons-Moll
3DPC
3DH
22
9
0
22 Jan 2024
Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis
R. Vinotha
D. Hepsiba
L. D. V. Anand
Deepak John Reji
13
1
0
22 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
60
4
0
21 Jan 2024
Previous
1
2
3
...
24
25
26
...
55
56
57
Next