ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDL
    SSL
    OCL
ArXivPDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 2,826 papers shown
Title
ProtoP-OD: Explainable Object Detection with Prototypical Parts
ProtoP-OD: Explainable Object Detection with Prototypical Parts
Pavlos Rath-Manakidis
Frederik Strothmann
Tobias Glasmachers
Laurenz Wiskott
ViT
45
1
0
29 Feb 2024
Uncertainty-Based Extensible Codebook for Discrete Federated Learning in
  Heterogeneous Data Silos
Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos
Tianyi Zhang
Yu Cao
Dianbo Liu
FedML
29
0
0
29 Feb 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
64
4
0
29 Feb 2024
Generalizability Under Sensor Failure: Tokenization + Transformers
  Enable More Robust Latent Spaces
Generalizability Under Sensor Failure: Tokenization + Transformers Enable More Robust Latent Spaces
Geeling Chau
Yujin An
Ahamed Raffey Iqbal
Soon-Jo Chung
Yisong Yue
Sabera Talukder
OOD
51
4
0
28 Feb 2024
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Zijie Li
Saurabh Patil
Francis Ogoke
Dule Shu
Wilson Zhen
Michael Schneier
John R. Buchanan
A. Farimani
AI4CE
45
5
0
27 Feb 2024
Rethinking Mutual Information for Language Conditioned Skill Discovery
  on Imitation Learning
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning
Zhaoxun Ju
Chao Yang
Hongbo Wang
Yu Qiao
Gang Hua
LM&Ro
54
3
0
27 Feb 2024
BiVRec: Bidirectional View-based Multimodal Sequential Recommendation
BiVRec: Bidirectional View-based Multimodal Sequential Recommendation
Jiaxi Hu
Jingtong Gao
Xiangyu Zhao
Yuehong Hu
Yuxuan Liang
Yiqi Wang
Ming He
Zitao Liu
Hongzhi Yin
HAI
63
1
0
27 Feb 2024
Inpainting Computational Fluid Dynamics with Deep Learning
Inpainting Computational Fluid Dynamics with Deep Learning
Dule Shu
Wilson Zhen
Zijie Li
A. Farimani
AI4CE
44
0
0
27 Feb 2024
Sora: A Review on Background, Technology, Limitations, and Opportunities
  of Large Vision Models
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Yixin Liu
Kai Zhang
Yuan Li
Zhiling Yan
Chujie Gao
...
Yue Huang
Hanchi Sun
Jianfeng Gao
Lifang He
Lichao Sun
VLM
VGen
EGVM
84
267
0
27 Feb 2024
Video as the New Language for Real-World Decision Making
Video as the New Language for Real-World Decision Making
Sherry Yang
Jacob Walker
Jack Parker-Holder
Yilun Du
Jake Bruce
Andre Barreto
Pieter Abbeel
Dale Schuurmans
VGen
44
47
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
79
89
0
27 Feb 2024
Self-Supervised Speech Quality Estimation and Enhancement Using Only
  Clean Speech
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech
Szu-Wei Fu
Kuo-Hsuan Hung
Yu Tsao
Yu-Chiang Frank Wang
SSL
45
11
0
26 Feb 2024
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing
  Different Modalities as Different Languages
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages
Minsu Kim
Jee-weon Jung
Hyeongseop Rha
Soumi Maiti
Siddhant Arora
Xuankai Chang
Shinji Watanabe
Y. Ro
51
7
0
25 Feb 2024
A Statistical Analysis of Wasserstein Autoencoders for Intrinsically
  Low-dimensional Data
A Statistical Analysis of Wasserstein Autoencoders for Intrinsically Low-dimensional Data
Saptarshi Chakraborty
Peter L. Bartlett
46
1
0
24 Feb 2024
Genie: Generative Interactive Environments
Genie: Generative Interactive Environments
Jake Bruce
Michael Dennis
Ashley D. Edwards
Jack Parker-Holder
Yuge Shi
...
Konrad Zolna
Jeff Clune
Nando de Freitas
Satinder Singh
Tim Rocktaschel
VGen
VLM
74
151
0
23 Feb 2024
Constraint Latent Space Matters: An Anti-anomalous Waveform
  Transformation Solution from Photoplethysmography to Arterial Blood Pressure
Constraint Latent Space Matters: An Anti-anomalous Waveform Transformation Solution from Photoplethysmography to Arterial Blood Pressure
Cheng Bian
Xiaoyu Li
Qi Bi
Guangpu Zhu
Jiegeng Lyu
Weile Zhang
Yelei Li
Zijing Zeng
48
0
0
23 Feb 2024
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction
  Prediction via Microenvironment-Aware Protein Embedding
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding
Lirong Wu
Yijun Tian
Yufei Huang
Siyuan Li
Haitao Lin
Nitesh Chawla
Stan Z. Li
39
22
0
22 Feb 2024
The Effect of Batch Size on Contrastive Self-Supervised Speech
  Representation Learning
The Effect of Batch Size on Contrastive Self-Supervised Speech Representation Learning
Nik Vaessen
David A. van Leeuwen
49
3
0
21 Feb 2024
Generative AI for Secure Physical Layer Communications: A Survey
Generative AI for Secure Physical Layer Communications: A Survey
Changyuan Zhao
Hongyang Du
Dusit Niyato
Jiawen Kang
Zehui Xiong
Dong In Kim
Xuemin
X. Shen
K. B. Letaief
71
20
0
21 Feb 2024
Learning Highly Dynamic Behaviors for Quadrupedal Robots
Learning Highly Dynamic Behaviors for Quadrupedal Robots
Chong Zhang
Jiapeng Sheng
Tingguang Li
He Zhang
Cheng Zhou
Qing Zhu
Rui Zhao
Yizheng Zhang
Lei Han
39
5
0
21 Feb 2024
Unsupervised Concept Discovery Mitigates Spurious Correlations
Unsupervised Concept Discovery Mitigates Spurious Correlations
Md Rifat Arefin
Yan Zhang
A. Baratin
Francesco Locatello
Irina Rish
Dianbo Liu
Kenji Kawaguchi
61
6
0
20 Feb 2024
Skill or Luck? Return Decomposition via Advantage Functions
Skill or Luck? Return Decomposition via Advantage Functions
Hsiao-Ru Pan
Bernhard Schölkopf
OffRL
30
3
0
20 Feb 2024
Two-stage Rainfall-Forecasting Diffusion Model
Two-stage Rainfall-Forecasting Diffusion Model
Xudong Ling
Chaorong Li
Fengqing Qin
Lihong Zhu
Yuanyuan Huang
DiffM
27
5
0
20 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRM
VLM
66
44
0
19 Feb 2024
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Jun Zhan
Junqi Dai
Jiasheng Ye
Yunhua Zhou
Dong Zhang
...
Jie Fu
Tao Gui
Tianxiang Sun
Yugang Jiang
Xipeng Qiu
MLLM
37
124
0
19 Feb 2024
Pushing Auto-regressive Models for 3D Shape Generation at Capacity and
  Scalability
Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability
Xue-Qing Qian
Yu Wang
Simian Luo
Yinda Zhang
Ying Tai
...
Xiangyang Xue
Bo Zhao
Tiejun Huang
Yunsheng Wu
Yanwei Fu
38
6
0
19 Feb 2024
Integrating Pre-Trained Language Model with Physical Layer
  Communications
Integrating Pre-Trained Language Model with Physical Layer Communications
Ju-Hyung Lee
Dong-Ho Lee
Joohan Lee
Jay Pujara
42
4
0
18 Feb 2024
SDiT: Spiking Diffusion Model with Transformer
SDiT: Spiking Diffusion Model with Transformer
Shu Yang
Hanzhi Ma
Chengting Yu
Aili Wang
Er-ping Li
DiffM
43
4
0
18 Feb 2024
CoLLaVO: Crayon Large Language and Vision mOdel
CoLLaVO: Crayon Large Language and Vision mOdel
Byung-Kwan Lee
Beomchan Park
Chae Won Kim
Yonghyun Ro
VLM
MLLM
53
16
0
17 Feb 2024
Generative Cross-Modal Retrieval: Memorizing Images in Multimodal
  Language Models for Retrieval and Beyond
Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond
Chak Tou Leong
Wenjie Wang
Leigang Qu
Liqiang Nie
Wenjie Li
Tat-Seng Chua
34
19
0
16 Feb 2024
Symbolic Autoencoding for Self-Supervised Sequence Learning
Symbolic Autoencoding for Self-Supervised Sequence Learning
Mohammad Hossein Amani
Nicolas Mario Baldwin
Amin Mansouri
Martin Josifoski
Maxime Peyrard
Robert West
31
1
0
16 Feb 2024
APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum
  Encoding and Decoding
APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding
Yang Ai
Xiao-Hang Jiang
Ye-Xin Lu
Hui-Peng Du
Zhenhua Ling
31
21
0
16 Feb 2024
Privacy for Fairness: Information Obfuscation for Fair Representation
  Learning with Local Differential Privacy
Privacy for Fairness: Information Obfuscation for Fair Representation Learning with Local Differential Privacy
Songjie Xie
Youlong Wu
Jiaxuan Li
Ming Ding
Khaled B. Letaief
AAML
47
1
0
16 Feb 2024
PRISE: LLM-Style Sequence Compression for Learning Temporal Action
  Abstractions in Control
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
Ruijie Zheng
Ching-An Cheng
Hal Daumé
Furong Huang
Andrey Kolobov
38
9
0
16 Feb 2024
Seed Optimization with Frozen Generator for Superior Zero-shot Low-light
  Enhancement
Seed Optimization with Frozen Generator for Superior Zero-shot Low-light Enhancement
Yuxuan Gu
Yi Jin
Ben Wang
Zhixiang Wei
Xiaoxiao Ma
Pengyang Ling
Haoxuan Wang
H. Chen
Enhong Chen
32
0
0
15 Feb 2024
Arrange, Inpaint, and Refine: Steerable Long-term Music Audio Generation
  and Editing via Content-based Controls
Arrange, Inpaint, and Refine: Steerable Long-term Music Audio Generation and Editing via Content-based Controls
Liwei Lin
Gus Xia
Yixiao Zhang
Junyan Jiang
32
12
0
14 Feb 2024
One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill
One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill
Sangwoo Shin
Daehee Lee
Minjong Yoo
Woo Kyung Kim
Honguk Woo
41
10
0
13 Feb 2024
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model
  on 100K hours of data
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Mateusz Lajszczak
Guillermo Cámbara
Yang Li
Fatih Beyhan
Arent van Korlaar
...
Bartosz Putrycz
Soledad López Gambino
Kayeon Yoo
Elena Sokolova
Thomas Drugman
LM&MA
43
75
0
12 Feb 2024
SemTra: A Semantic Skill Translator for Cross-Domain Zero-Shot Policy
  Adaptation
SemTra: A Semantic Skill Translator for Cross-Domain Zero-Shot Policy Adaptation
Sangwoo Shin
Minjong Yoo
Jeongwoo Lee
Honguk Woo
56
4
0
12 Feb 2024
SpeechCLIP+: Self-supervised multi-task representation learning for
  speech via CLIP and speech-image data
SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data
Hsuan-Fu Wang
Yi-Jen Shih
Heng-Jui Chang
Layne Berry
Puyuan Peng
Hung-yi Lee
Hsin-Min Wang
David Harwath
VLM
56
2
0
10 Feb 2024
Inducing Systematicity in Transformers by Attending to Structurally
  Quantized Embeddings
Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
Yichen Jiang
Xiang Zhou
Mohit Bansal
44
1
0
09 Feb 2024
Sparse-VQ Transformer: An FFN-Free Framework with Vector Quantization
  for Enhanced Time Series Forecasting
Sparse-VQ Transformer: An FFN-Free Framework with Vector Quantization for Enhanced Time Series Forecasting
Yanjun Zhao
Tian Zhou
Chao Chen
Liang Sun
Yi Qian
Rong Jin
AI4TS
37
2
0
08 Feb 2024
FusionSF: Fuse Heterogeneous Modalities in a Vector Quantized Framework
  for Robust Solar Power Forecasting
FusionSF: Fuse Heterogeneous Modalities in a Vector Quantized Framework for Robust Solar Power Forecasting
Ziqing Ma
Wen-wu Wang
Tian Zhou
Chao Chen
Bingqing Peng
Liang Sun
Rong Jin
38
2
0
08 Feb 2024
SpiRit-LM: Interleaved Spoken and Written Language Model
SpiRit-LM: Interleaved Spoken and Written Language Model
Tu Nguyen
Benjamin Muller
Bokai Yu
Marta R. Costa-jussá
Maha Elbayad
...
Itai Gat
Gabriel Synnaeve
Juan Pino
Benoît Sagot
Emmanuel Dupoux
AuLLM
VLM
66
37
0
08 Feb 2024
Improving Token-Based World Models with Parallel Observation Prediction
Improving Token-Based World Models with Parallel Observation Prediction
Lior Cohen
Kaixin Wang
Bingyi Kang
Shie Mannor
57
4
0
08 Feb 2024
Compression of Structured Data with Autoencoders: Provable Benefit of
  Nonlinearities and Depth
Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth
Kevin Kögler
Aleksandr Shevchenko
Hamed Hassani
Marco Mondelli
MLT
48
0
0
07 Feb 2024
Bidirectional Autoregressive Diffusion Model for Dance Generation
Bidirectional Autoregressive Diffusion Model for Dance Generation
Canyu Zhang
Youbao Tang
Ning Zhang
Ruei-Sung Lin
Mei Han
Jing Xiao
Song Wang
38
8
0
06 Feb 2024
MOMENT: A Family of Open Time-series Foundation Models
MOMENT: A Family of Open Time-series Foundation Models
Mononito Goswami
Konrad Szafer
Arjun Choudhry
Yifu Cai
Shuo Li
Artur Dubrawski
AIFin
AI4TS
74
119
0
06 Feb 2024
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning
  and Levels-of-Experts
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning and Levels-of-Experts
Kun Wang
Hao Wu
Guibin Zhang
Sihang Li
Yuxuan Liang
Yuankai Wu
Roger Zimmermann
Yang Wang
32
9
0
06 Feb 2024
Professional Agents -- Evolving Large Language Models into Autonomous
  Experts with Human-Level Competencies
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies
Zhixuan Chu
Yan Wang
Feng Zhu
Lu Yu
Longfei Li
Jinjie Gu
LLMAG
30
8
0
06 Feb 2024
Previous
123...232425...555657
Next