Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1312.5602
Cited By
Playing Atari with Deep Reinforcement Learning
19 December 2013
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Playing Atari with Deep Reinforcement Learning"
50 / 122 papers shown
Title
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL
Yu-Heng Hung
Kai-Jie Lin
Yu-Heng Lin
Chien-Yi Wang
Cheng Sun
Ping-Chun Hsieh
18
0
0
28 May 2025
Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning
Chi Zhang
Ziying Jia
George Atia
Sihong He
Yue Wang
29
0
0
24 May 2025
VideoGameBench: Can Vision-Language Models complete popular video games?
Alex Zhang
Thomas Griffiths
Karthik Narasimhan
Ofir Press
VLM
35
0
0
23 May 2025
Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Delayed Observation
Songchen Fu
Siang Chen
Shaojing Zhao
Letian Bai
Ta Li
Yonghong Yan
86
0
0
06 May 2025
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
Lynn Cherif
Flemming Kondrup
David Venuto
Ankit Anand
Doina Precup
Khimya Khetarpal
LM&Ro
105
0
0
24 Apr 2025
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
Jun Wang
Weinan Zhang
OffRL
64
1
0
21 Apr 2025
Learning to Reason under Off-Policy Guidance
Jianhao Yan
Yafu Li
Zican Hu
Zhi Wang
Ganqu Cui
Xiaoye Qu
Yu Cheng
Yue Zhang
OffRL
LRM
52
8
0
21 Apr 2025
Generative Auto-Bidding with Value-Guided Explorations
Jingtong Gao
Yewen Li
Shuai Mao
Peng Jiang
Nan Jiang
...
Fei Pan
Peng Jiang
Kun Gai
Bo An
Xiangyu Zhao
OffRL
87
0
0
20 Apr 2025
Generative Framework for Personalized Persuasion: Inferring Causal, Counterfactual, and Latent Knowledge
Donghuo Zeng
Roberto Legaspi
Yuewen Sun
Xinshuai Dong
Kazushi Ikeda
Peter Spirtes
Kun Zhang
CML
52
0
0
08 Apr 2025
Deep Reinforcement Learning Algorithms for Option Hedging
Andrei Neagu
Frédéric Godin
Leila Kosseim
43
0
0
07 Apr 2025
I Can Hear You Coming: RF Sensing for Uncooperative Satellite Evasion
Cameron Mehlman
Gregory Falco
63
0
0
04 Apr 2025
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning
Aniket Didolkar
Andrii Zadaianchuk
Rabiul Awal
Maximilian Seitzer
E. Gavves
Aishwarya Agrawal
OCL
VLM
128
3
0
27 Mar 2025
Design of Reward Function on Reinforcement Learning for Automated Driving
Takeru Goto
Yuki Kizumi
Shun Iwasaki
49
4
0
20 Mar 2025
Probabilistic Shielding for Safe Reinforcement Learning
Edwin Hamel-De le Court
Francesco Belardinelli
Alex W. Goodall
49
0
0
09 Mar 2025
Assessing Autonomous Inspection Regimes: Active Versus Passive Satellite Inspection
Joshua Aurand
Christopher Pang
Sina Mokhtar
Henry Lei
Steven Cutlip
S. Phillips
142
0
0
26 Feb 2025
Yes, Q-learning Helps Offline In-Context RL
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Andrei Polubarov
Nikita Lyubaykin
Alexander Derevyagin
Igor Kiselev
Vladislav Kurenkov
OffRL
OnRL
340
1
0
24 Feb 2025
IGN : Implicit Generative Networks
Haozheng Luo
Tianyi Wu
Feiyu Han
Zhijun Yan
OffRL
55
1
0
24 Feb 2025
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control
Jielong Yang
Daoyuan Huang
57
0
0
21 Feb 2025
Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems
Ehsan Sabouni
Hijaz Ahmad
Vittorio Giammarino
Christos G. Cassandras
I. Paschalidis
Wenchao Li
134
2
0
21 Feb 2025
Question Answering with Texts and Tables through Deep Reinforcement Learning
M. M. José
Flávio Nakasato Cação
Maria F. Ribeiro
Rafael M. Cheang
Paulo Pirozelli
Fabio Gagliardi Cozman
LMTD
RALM
196
0
0
21 Feb 2025
TeLL-Drive: Enhancing Autonomous Driving with Teacher LLM-Guided Deep Reinforcement Learning
Chengkai Xu
Jiaqi Liu
Shiyu Fang
Jian Sun
Dong Chen
Peng Hang
Jian Sun
133
1
0
21 Feb 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
124
1
0
20 Feb 2025
Learning Strategy Representation for Imitation Learning in Multi-Agent Games
Shiqi Lei
Kanghon Lee
Linjing Li
Jinkyoo Park
OffRL
65
0
0
17 Feb 2025
DECAF: Learning to be Fair in Multi-agent Resource Allocation
Ashwin Kumar
William Yeoh
101
1
0
06 Feb 2025
RLOMM: An Efficient and Robust Online Map Matching Framework with Reinforcement Learning
Minxiao Chen
Haitao Yuan
Nan Jiang
Zhihan Zheng
Sai Wu
Ao Zhou
Shuaiqiang Wang
109
0
0
05 Feb 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
112
1
0
28 Jan 2025
ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
48
0
0
24 Jan 2025
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Long-Fei Li
Yu Zhang
Peng Zhao
Zhi Zhou
136
5
0
17 Jan 2025
Dynamic Portfolio Optimization via Augmented DDPG with Quantum Price Levels-Based Trading Strategy
Runsheng Lin
Zihan Xing
Mingze Ma
Raymond S.T. Lee
54
2
0
15 Jan 2025
CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving
Bhargava Uppuluri
Anjel Patel
Neil Mehta
Sridhar Kamath
Pratyush Chakraborty
69
0
0
10 Jan 2025
Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba
Wall Kim
Mamba
80
0
0
10 Jan 2025
On the role of Artificial Intelligence methods in modern force-controlled manufacturing robotic tasks
Vincenzo Petrone
Enrico Ferrentino
Pasquale Chiacchio
54
1
0
10 Jan 2025
Explainable Reinforcement Learning for Formula One Race Strategy
Devin Thomas
Junqi Jiang
Avinash Kori
Aaron Russo
Steffen Winkler
Stuart Sale
Joseph McMillan
Francesco Belardinelli
Antonio Rago
LRM
42
0
0
07 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
101
0
0
03 Jan 2025
Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors
Niels Justesen
Maria Kaselimi
Sam Snodgrass
Miruna Vozaru
Matthew Schlegel
...
Albert Wang
Christoffer Holmgård
Georgios N. Yannakakis
S. Risi
Julian Togelius
124
0
0
03 Jan 2025
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Xin Liu
Yaran Chen
Haoran Li
SSL
109
0
0
14 Dec 2024
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Zhiwei Jia
Yuesong Nan
Huixi Zhao
Gengdai Liu
EGVM
106
1
0
22 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
82
1
0
06 Nov 2024
GraphXForm: Graph transformer for computer-aided molecular design
Jonathan Pirnay
Jan G. Rittig
Alexander B. Wolf
Martin Grohe
Jakob Burger
Alexander Mitsos
D. G. Grimm
AI4CE
73
1
0
03 Nov 2024
LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Bowen Li
Zhaoyu Li
Qiwei Du
Jinqi Luo
Wenshan Wang
...
Katia Sycara
Pradeep Kumar Ravikumar
Alexander G. Gray
X. Si
Sebastian A. Scherer
AI4CE
LRM
100
3
0
01 Nov 2024
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization
Zeyuan Ma
Hongshu Guo
Yue-Jiao Gong
Jun Zhang
Kay Chen Tan
194
4
0
01 Nov 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
75
0
0
27 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
56
1
0
27 Oct 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
92
18
0
26 Oct 2024
Entity-based Reinforcement Learning for Autonomous Cyber Defence
Isaac Symes Thompson
Alberto Caron
Chris Hicks
V. Mavroudis
AAML
78
3
0
23 Oct 2024
Bridging Swarm Intelligence and Reinforcement Learning
Karthik Soma
Yann Bouteiller
Heiko Hamann
Giovanni Beltrame
42
0
0
23 Oct 2024
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
80
0
0
19 Oct 2024
Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
Zixuan Yang
Jiaqi Zheng
Guihai Chen
OffRL
55
0
0
19 Oct 2024
Process Reward Model with Q-Value Rankings
W. Li
Yixuan Li
LRM
88
17
0
15 Oct 2024
Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies
Jiajie Yu
Yuhong Wang
Wei Ma
OffRL
87
1
0
14 Oct 2024
1
2
3
Next