Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1412.6980
Cited By
v1
v2
v3
v4
v5
v6
v7
v8
v9 (latest)
Adam: A Method for Stochastic Optimization
22 December 2014
Diederik P. Kingma
Jimmy Ba
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adam: A Method for Stochastic Optimization"
50 / 1,154 papers shown
Title
TRADE: Transfer of Distributions between External Conditions with Normalizing Flows
Stefan Wahl
Armand Rousselot
Felix Dräxler
Ullrich Kothe
Ullrich Köthe
160
1
0
25 Oct 2024
Bio2Token: All-atom tokenization of any biomolecular structure with Mamba
Andrew Liu
Axel Elaldi
Nathan Russell
Olivia Viessmann
Mamba
96
3
0
24 Oct 2024
Monge-Ampere Regularization for Learning Arbitrary Shapes from Point Clouds
Chuanxiang Yang
Yuanfeng Zhou
Guangshun Wei
Long Ma
Junhui Hou
Yuan Liu
Wenping Wang
82
0
0
24 Oct 2024
Unified Microphone Conversion: Many-to-Many Device Mapping via Feature-wise Linear Modulation
Myeonghoon Ryu
Hongseok Oh
Suji Lee
Han Park
54
0
0
23 Oct 2024
VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning
Yifan Peng
Krishna Puvvada
Zhehuai Chen
Piotr .Zelasko
He Huang
Kunal Dhawan
Ke Hu
Shinji Watanabe
Jagadeesh Balam
Boris Ginsburg
123
5
0
23 Oct 2024
Estimating the Spectral Moments of the Kernel Integral Operator from Finite Sample Matrices
Chanwoo Chun
SueYeon Chung
Daniel D. Lee
54
1
0
23 Oct 2024
MotionGlot: A Multi-Embodied Motion Generation Model
Sudarshan Harithas
Srinath Sridhar
146
2
0
22 Oct 2024
Graph Sampling for Scalable and Expressive Graph Neural Networks on Homophilic Graphs
Haolin Li
Luana Ruiz
Luana Ruiz
73
0
0
22 Oct 2024
Science Out of Its Ivory Tower: Improving Accessibility with Reinforcement Learning
Haining Wang
Jason Clark
Hannah McKelvey
Leila Sterman
Zheng Gao
Zuoyu Tian
Sandra Kübler
Xiaozhong Liu
91
1
0
22 Oct 2024
EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting
Bohao Liao
Wei-dong Zhai
Zengyu Wan
Tianzhu Zhang
Wenfei Yang
Zheng-jun Zha
Yang Cao
Zheng-Jun Zha
3DGS
276
4
0
20 Oct 2024
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
118
0
0
19 Oct 2024
Predictive variational inference: Learn the predictively optimal posterior distribution
Jinlin Lai
Yuling Yao
BDL
73
0
0
18 Oct 2024
Improving Vision Transformers by Overlapping Heads in Multi-Head Self-Attention
Tianxiao Zhang
Bo Luo
G. Wang
ViT
76
1
0
18 Oct 2024
In-context learning and Occam's razor
Eric Elmoznino
Tom Marty
Tejas Kasetty
Léo Gagnon
Sarthak Mittal
Mahan Fathi
Dhanya Sridhar
Guillaume Lajoie
119
1
0
17 Oct 2024
AutoAL: Automated Active Learning with Differentiable Query Strategy Search
Yifeng Wang
Xueying Zhan
Siyu Huang
OOD
107
0
0
17 Oct 2024
Text-Guided Multi-Property Molecular Optimization with a Diffusion Language Model
Yida Xiong
Kun Li
Jiameng Chen
Hongzhi Zhang
Di Lin
Shirui Pan
Wenbin Hu
87
3
0
17 Oct 2024
PiLocNet: Physics-informed neural network on 3D localization with rotating point spread function
Mingda Lu
Zitian Ao
Chao Wang
S. Prasad
Raymond H. F. Chan
3DPC
24
0
0
17 Oct 2024
Inductive Gradient Adjustment For Spectral Bias In Implicit Neural Representations
Kexuan Shi
Hai Chen
Leheng Zhang
Shuhang Gu
84
1
0
17 Oct 2024
A Data-driven Contact Estimation Method for Wheeled-Biped Robots
Ü. Bora Gökbakan
Frederike Dümbgen
Stéphane Caron
74
0
0
16 Oct 2024
Bayesian Experimental Design via Contrastive Diffusions
Jacopo Iollo
Christophe Heinkelé
Pierre Alliez
Florence Forbes
109
0
0
15 Oct 2024
High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion
Junhwa Hur
Charles Herrmann
Saurabh Saxena
Janne Kontkanen
Wei-Sheng Lai
Yichang Shih
Michael Rubinstein
David J. Fleet
Deqing Sun
106
2
0
15 Oct 2024
Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution
Hongyu An
Xinfeng Zhang
Shijie Zhao
Ruiqin Xiong
Ruiqin Xiong
SupR
419
1
0
15 Oct 2024
Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for Classification
Ján Cegin
Branislav Pecher
Jakub Simko
Ivan Srba
Maria Bielikova
Peter Brusilovsky
31
0
0
14 Oct 2024
The Epochal Sawtooth Phenomenon: Unveiling Training Loss Oscillations in Adam and Other Optimizers
Qi Liu
Wanjing Ma
67
0
0
14 Oct 2024
Diversity-Aware Reinforcement Learning for de novo Drug Design
Hampus Gummesson Svensson
C. Tyrchan
Ola Engkvist
M. Chehreghani
56
2
0
14 Oct 2024
Emulators for stellar profiles in binary population modeling
Elizabeth Teng
Ugur Demir
Zoheyr Doctor
Philipp M. Srivastava
Shamal Lalvani
...
Matthias U. Kruckow
K. A. Rocha
Meng Sun
Zepei Xing
E. Zapartas
81
0
0
14 Oct 2024
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
Hanwen Du
Bo Peng
Xia Ning
71
0
0
12 Oct 2024
Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution Generalization of Misinformation Detection Models
Ivo Verhoeven
Pushkar Mishra
Ekaterina Shutova
84
1
0
12 Oct 2024
radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction
Yanmei Zhang
Rui Yang
Yutao Yue
Eng Gee Lim
92
1
0
11 Oct 2024
IGNN-Solver: A Graph Neural Solver for Implicit Graph Neural Networks
Junchao Lin
Zenan Ling
Zhanbo Feng
Feng Zhou
Jingwen Xu
Feng Zhou
Tianqi Hou
Zhenyu Liao
Robert C. Qiu
GNN
AI4CE
153
0
0
11 Oct 2024
Scaling Laws for Predicting Downstream Performance in LLMs
Yangyi Chen
Binxuan Huang
Yifan Gao
Zhengyang Wang
Jingfeng Yang
Heng Ji
LRM
102
12
0
11 Oct 2024
Neuroplastic Expansion in Deep Reinforcement Learning
Jiashun Liu
J. Obando-Ceron
Rameswar Panda
L. Pan
87
5
0
10 Oct 2024
Deep Generative Quantile Bayes
Jungeum Kim
Percy S. Zhai
Veronika Rockova
180
0
0
10 Oct 2024
Phase Diagram from Nonlinear Interaction between Superconducting Order and Density: Toward Data-Based Holographic Superconductor
Sejin Kim
Kyung Kiu Kim
Yunseok Seo
43
0
0
09 Oct 2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Sihyun Yu
Sangkyung Kwak
Huiwon Jang
Jongheon Jeong
Jonathan Huang
Jinwoo Shin
Saining Xie
OCL
151
98
0
09 Oct 2024
Estimating the Number of HTTP/3 Responses in QUIC Using Deep Learning
Barak Gahtan
Robert J. Shahla
R. Cohen
A. Bronstein
80
0
0
08 Oct 2024
A Parameter Update Balancing Algorithm for Multi-task Ranking Models in Recommendation Systems
Jun Yuan
Guohao Cai
Zhenhua Dong
179
0
0
08 Oct 2024
Adaptive Random Fourier Features Training Stabilized By Resampling With Applications in Image Regression
Aku Kammonen
Anamika Pandey
E. von Schwerin
Raúl Tempone
60
0
0
08 Oct 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen
Shuze Liu
Shangtong Zhang
OffRL
360
1
0
08 Oct 2024
Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time Series
Byoungwoo Park
Hyungi Lee
Juho Lee
AI4TS
154
1
0
08 Oct 2024
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
Kaiyue Wen
Huaqing Zhang
Hongzhou Lin
Jingzhao Zhang
MoE
LRM
135
7
0
07 Oct 2024
Fast Training of Sinusoidal Neural Fields via Scaling Initialization
Taesun Yeom
Sangyoon Lee
Jaeho Lee
96
3
0
07 Oct 2024
Automated Detection of Defects on Metal Surfaces using Vision Transformers
Toqa Alaa
Mostafa Kotb
Arwa Zakaria
Mariam Diab
Walid Gomaa
144
1
0
06 Oct 2024
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
Tianjian Li
Haoran Xu
Weiting Tan
Kenton Murray
Daniel Khashabi
91
1
0
06 Oct 2024
Learning Object Properties Using Robot Proprioception via Differentiable Robot-Object Interaction
Julius Berner
Chao Liu
Pingchuan Ma
John Eastman
Daniela Rus
Dylan Randle
Yuri Ivanov
Wojciech Matusik
79
0
0
04 Oct 2024
Error Correction Code Transformer: From Non-Unified to Unified
Yongli Yan
Jieao Zhu
Tianyue Zheng
Jiaqi He
Linglong Dai
53
1
0
04 Oct 2024
Towards Universal Certified Robustness with Multi-Norm Training
Enyi Jiang
Gagandeep Singh
Gagandeep Singh
AAML
132
1
0
03 Oct 2024
ControlAR: Controllable Image Generation with Autoregressive Models
Zongming Li
Tianheng Cheng
Shoufa Chen
Peize Sun
Haocheng Shen
Longjin Ran
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
DiffM
200
19
0
03 Oct 2024
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu
Claire Chen
Shangtong Zhang
OffRL
177
3
0
03 Oct 2024
OmniSR: Shadow Removal under Direct and Indirect Lighting
Jiamin Xu
Zelong Li
Yuxin Zheng
Chenyu Huang
Renshu Gu
Weiwei Xu
Gang Xu
3DV
135
2
0
02 Oct 2024
Previous
1
2
3
...
9
10
11
...
22
23
24
Next