ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDL
    SSL
    OCL
ArXivPDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 2,839 papers shown
Title
Bidirectional Autoregressive Diffusion Model for Dance Generation
Bidirectional Autoregressive Diffusion Model for Dance Generation
Canyu Zhang
Youbao Tang
Ning Zhang
Ruei-Sung Lin
Mei Han
Jing Xiao
Song Wang
38
8
0
06 Feb 2024
MOMENT: A Family of Open Time-series Foundation Models
MOMENT: A Family of Open Time-series Foundation Models
Mononito Goswami
Konrad Szafer
Arjun Choudhry
Yifu Cai
Shuo Li
Artur Dubrawski
AIFin
AI4TS
74
119
0
06 Feb 2024
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning
  and Levels-of-Experts
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning and Levels-of-Experts
Kun Wang
Hao Wu
Guibin Zhang
Sihang Li
Yuxuan Liang
Yuankai Wu
Roger Zimmermann
Yang Wang
32
9
0
06 Feb 2024
Professional Agents -- Evolving Large Language Models into Autonomous
  Experts with Human-Level Competencies
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies
Zhixuan Chu
Yan Wang
Feng Zhu
Lu Yu
Longfei Li
Jinjie Gu
LLMAG
30
8
0
06 Feb 2024
Revisiting the Dataset Bias Problem from a Statistical Perspective
Revisiting the Dataset Bias Problem from a Statistical Perspective
Kien Do
D. Nguyen
Hung Le
T. Le
Dang Nguyen
Haripriya Harikumar
T. Tran
Santu Rana
Svetha Venkatesh
34
0
0
05 Feb 2024
Denoising Diffusion via Image-Based Rendering
Denoising Diffusion via Image-Based Rendering
Titas Anciukevicius
Fabian Manhardt
Federico Tombari
Paul Henderson
67
11
0
05 Feb 2024
InstanceDiffusion: Instance-level Control for Image Generation
InstanceDiffusion: Instance-level Control for Image Generation
Xudong Wang
Trevor Darrell
Sai Saketh Rambhatla
Rohit Girdhar
Ishan Misra
VLM
DiffM
39
91
0
05 Feb 2024
ISPA: Inter-Species Phonetic Alphabet for Transcribing Animal Sounds
ISPA: Inter-Species Phonetic Alphabet for Transcribing Animal Sounds
Masato Hagiwara
Marius Miron
Jen-Yu Liu
39
1
0
05 Feb 2024
Minimum Description Length and Generalization Guarantees for
  Representation Learning
Minimum Description Length and Generalization Guarantees for Representation Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
81
8
0
05 Feb 2024
Video-LaVIT: Unified Video-Language Pre-training with Decoupled
  Visual-Motional Tokenization
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Yang Jin
Zhicheng Sun
Kun Xu
Kun Xu
Liwei Chen
...
Yuliang Liu
Di Zhang
Yang Song
Kun Gai
Yadong Mu
VGen
55
44
0
05 Feb 2024
Delving into Multi-modal Multi-task Foundation Models for Road Scene
  Understanding: From Learning Paradigm Perspectives
Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Sheng Luo
Wei Chen
Wanxin Tian
Rui Liu
Luanxuan Hou
...
Ling Shao
Yi Yang
Bojun Gao
Qun Li
Guobin Wu
81
14
0
05 Feb 2024
Trinity: Syncretizing Multi-/Long-tail/Long-term Interests All in One
Trinity: Syncretizing Multi-/Long-tail/Long-term Interests All in One
Jing Yan
Liu Jiang
Jianfei Cui
Zhichen Zhao
Xingyan Bin
Feng Zhang
Zuotao Liu
17
2
0
05 Feb 2024
Focal Modulation Networks for Interpretable Sound Classification
Focal Modulation Networks for Interpretable Sound Classification
Luca Della Libera
Cem Subakan
Mirco Ravanelli
41
2
0
05 Feb 2024
Can Large Language Models Learn Independent Causal Mechanisms?
Can Large Language Models Learn Independent Causal Mechanisms?
Gaël Gendron
Bao Trung Nguyen
A. Peng
Michael Witbrock
Gillian Dobbie
LRM
43
4
0
04 Feb 2024
FoldToken: Learning Protein Language via Vector Quantization and Beyond
FoldToken: Learning Protein Language via Vector Quantization and Beyond
Zhangyang Gao
Cheng Tan
Jue Wang
Yufei Huang
Lirong Wu
Stan Z. Li
35
9
0
04 Feb 2024
MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly
  Mixed Classifiers
MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers
Yatong Bai
Mo Zhou
Vishal M. Patel
Somayeh Sojoudi
AAML
39
8
0
03 Feb 2024
Position: Graph Foundation Models are Already Here
Position: Graph Foundation Models are Already Here
Haitao Mao
Zhikai Chen
Wenzhuo Tang
Jianan Zhao
Yao Ma
Tong Zhao
Neil Shah
Mikhail Galkin
Jiliang Tang
AI4CE
69
30
0
03 Feb 2024
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation
Yurui Chen
Jing Zhang
Ziyang Xie
Wenye Li
Feihu Zhang
Jiachen Lu
Li Zhang
65
12
0
03 Feb 2024
Spiking Music: Audio Compression with Event Based Auto-encoders
Spiking Music: Audio Compression with Event Based Auto-encoders
Martim Lisboa
Guillaume Bellec
53
2
0
02 Feb 2024
Can MLLMs Perform Text-to-Image In-Context Learning?
Can MLLMs Perform Text-to-Image In-Context Learning?
Yuchen Zeng
Wonjun Kang
Yicong Chen
Hyung Il Koo
Kangwook Lee
MLLM
41
11
0
02 Feb 2024
An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec
An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec
Linping Xu
Jiawei Jiang
Dejun Zhang
Xianjun Xia
Li Chen
Yijian Xiao
Piao Ding
Shenyi Song
Sixing Yin
Ferdous Sohel
42
6
0
02 Feb 2024
Neural Language of Thought Models
Neural Language of Thought Models
Yi-Fu Wu
Minseung Lee
Sungjin Ahn
MLLM
VLM
85
6
0
02 Feb 2024
Large Language Models for Time Series: A Survey
Large Language Models for Time Series: A Survey
Xiyuan Zhang
Ranak Roy Chowdhury
Rajesh K. Gupta
Jingbo Shang
AI4TS
90
55
0
02 Feb 2024
A Survey for Foundation Models in Autonomous Driving
A Survey for Foundation Models in Autonomous Driving
Haoxiang Gao
Yaqian Li
Kaiwen Long
Ming Yang
Yiqing Shen
VLM
LRM
58
29
0
02 Feb 2024
IMUGPT 2.0: Language-Based Cross Modality Transfer for Sensor-Based
  Human Activity Recognition
IMUGPT 2.0: Language-Based Cross Modality Transfer for Sensor-Based Human Activity Recognition
Zi-Jian Leng
Amitrajit Bhattacharjee
Hrudhai Rajasekhar
Lizhe Zhang
Elizabeth Bruda
Hyeokhyen Kwon
Thomas Plötz
VLM
59
13
0
01 Feb 2024
Improving Semantic Control in Discrete Latent Spaces with Transformer
  Quantized Variational Autoencoders
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders
Yingji Zhang
Danilo S. Carvalho
Marco Valentino
Ian Pratt-Hartmann
André Freitas
DRL
61
5
0
01 Feb 2024
Dynamic Texture Transfer using PatchMatch and Transformers
Dynamic Texture Transfer using PatchMatch and Transformers
Guo Pu
Shiyao Xu
Xixin Cao
Zhouhui Lian
49
2
0
01 Feb 2024
AEROBLADE: Training-Free Detection of Latent Diffusion Images Using
  Autoencoder Reconstruction Error
AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error
Jonas Ricker
Denis Lukovnikov
Asja Fischer
72
32
0
31 Jan 2024
Robustly overfitting latents for flexible neural image compression
Robustly overfitting latents for flexible neural image compression
Yura Perugachi-Diaz
Arwin Gansekoele
Sandjai Bhulai
51
1
0
31 Jan 2024
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis
Zecheng Tang
Chenfei Wu
Zekai Zhang
Mingheng Ni
Sheng-Siang Yin
...
Zhengyuan Yang
Lijuan Wang
Zicheng Liu
Juntao Li
Nan Duan
32
10
0
30 Jan 2024
Effective Communication with Dynamic Feature Compression
Effective Communication with Dynamic Feature Compression
Pietro Talli
Francesco Pase
Federico Chiariotti
Andrea Zanella
Michele Zorzi
53
3
0
29 Jan 2024
Image-Text Out-Of-Context Detection Using Synthetic Multimodal
  Misinformation
Image-Text Out-Of-Context Detection Using Synthetic Multimodal Misinformation
Fatma Shalabi
H. Nguyen
Hichem Felouat
Ching-Chun Chang
Isao Echizen
54
5
0
29 Jan 2024
Bridging Generative and Discriminative Models for Unified Visual
  Perception with Diffusion Priors
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors
Shiyin Dong
Mingrui Zhu
Kun Cheng
Nannan Wang
Xinbo Gao
DiffM
30
3
0
29 Jan 2024
Diffusion Facial Forgery Detection
Diffusion Facial Forgery Detection
Harry Cheng
Yangyang Guo
Tianyi Wang
L. Nie
Mohan S. Kankanhalli
74
18
0
29 Jan 2024
Media2Face: Co-speech Facial Animation Generation With Multi-Modality
  Guidance
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Qingcheng Zhao
Pengyu Long
Qixuan Zhang
Dafei Qin
Hanming Liang
Longwen Zhang
Yingliang Zhang
Jingyi Yu
Lan Xu
DiffM
3DH
38
25
0
28 Jan 2024
Continuous-Multiple Image Outpainting in One-Step via Positional Query
  and A Diffusion-based Approach
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach
Shaofeng Zhang
Jinfa Huang
Qiang-feng Zhou
Zhibin Wang
Fan Wang
Jiebo Luo
Junchi Yan
DiffM
58
12
0
28 Jan 2024
A Survey on Neural Topic Models: Methods, Applications, and Challenges
A Survey on Neural Topic Models: Methods, Applications, and Challenges
Xiaobao Wu
Thong Nguyen
Anh Tuan Luu
BDL
36
36
0
27 Jan 2024
Annotated Hands for Generative Models
Annotated Hands for Generative Models
Yue Yang
Atith N Gandhi
Greg Turk
DiffM
GAN
31
3
0
26 Jan 2024
Residual Quantization with Implicit Neural Codebooks
Residual Quantization with Implicit Neural Codebooks
Iris A. M. Huijben
Matthijs Douze
Matthew Muckley
Ruud J. G. van Sloun
Jakob Verbeek
MQ
34
11
0
26 Jan 2024
Deep Joint Source-Channel Coding for Efficient and Reliable
  Cross-Technology Communication
Deep Joint Source-Channel Coding for Efficient and Reliable Cross-Technology Communication
Shumin Yao
Xiaodong Xu
Hao Chen
Yaping Sun
Qinglin Zhao
33
1
0
26 Jan 2024
Within-basket Recommendation via Neural Pattern Associator
Within-basket Recommendation via Neural Pattern Associator
Kai Luo
Tianshu Shen
Lan Yao
Ga Wu
A. Liblong
Istvan Fehervari
Ruijian An
Jawad Ahmed
Harshit Mishra
Charu Pujari
AI4TS
32
2
0
25 Jan 2024
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Xinlei Chen
Zhuang Liu
Saining Xie
Kaiming He
DiffM
43
54
0
25 Jan 2024
Improving Antibody Humanness Prediction using Patent Data
Improving Antibody Humanness Prediction using Patent Data
Talip Uçar
Aubin Ramon
Dino Oglic
Rebecca Croasdale-wood
Tom Diethe
Pietro Sormanni
38
1
0
25 Jan 2024
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic
  Image Restoration In the Wild
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Fanghua Yu
Jinjin Gu
Zheyuan Li
Jinfan Hu
Xiangtao Kong
Xintao Wang
Jingwen He
Yu Qiao
Chao Dong
47
134
0
24 Jan 2024
Masked Particle Modeling on Sets: Towards Self-Supervised High Energy
  Physics Foundation Models
Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models
T. Golling
Lukas Heinrich
Michael Kagan
Samuel Klein
Matthew Leigh
Margarita Osadchy
J. A. Raine
47
24
0
24 Jan 2024
Generative Human Motion Stylization in Latent Space
Generative Human Motion Stylization in Latent Space
Chuan Guo
Yuxuan Mu
Wei Ji
Peng Dai
Youliang Yan
Juwei Lu
Li Cheng
VGen
43
11
0
24 Jan 2024
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
Zhaozhi Xie
Bochen Guan
Weihao Jiang
Muyang Yi
Yue Ding
Hongtao Lu
Lei Zhang
VLM
46
13
0
23 Jan 2024
CloSe: A 3D Clothing Segmentation Dataset and Model
CloSe: A 3D Clothing Segmentation Dataset and Model
Dimitrije Antic
Garvita Tiwari
Batuhan Ozcomlekci
R. Marin
Gerard Pons-Moll
3DPC
3DH
22
9
0
22 Jan 2024
Empowering Communication: Speech Technology for Indian and Western
  Accents through AI-powered Speech Synthesis
Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis
R. Vinotha
D. Hepsiba
L. D. V. Anand
Deepak John Reji
13
1
0
22 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
60
4
0
21 Jan 2024
Previous
123...242526...555657
Next