Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.01843
Cited By
Learning to Perform Physics Experiments via Deep Reinforcement Learning
6 November 2016
Misha Denil
Pulkit Agrawal
Tejas D. Kulkarni
Tom Erez
Peter W. Battaglia
Nando de Freitas
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to Perform Physics Experiments via Deep Reinforcement Learning"
50 / 102 papers shown
Title
MUST: Multi-Scale Structural-Temporal Link Prediction Model for UAV Ad Hoc Networks
Cunlai Pu
Fangrui Wu
Rajput Ramiz Sharafat
Guangzhao Dai
Xiangbo Shu
21
0
0
14 May 2025
Extending a Quantum Reinforcement Learning Exploration Policy with Flags to Connect Four
Filipe Santos
João Paulo Fernandes
Luís Macedo
16
0
0
07 May 2025
Application of Deep Reinforcement Learning for Intrusion Detection in Internet of Things: A Systematic Review
Saeid Jamshidia
Amin Nikanjama
Kawser Wazed Nafia
Foutse Khomha
Rasoul Rastab
25
2
0
20 Apr 2025
Human-AI Experience in Integrated Development Environments: A Systematic Literature Review
Agnia Sergeyuk
Ilya Zakharov
Ekaterina Koshchenko
M. Izadi
58
0
0
08 Mar 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
57
0
0
27 Feb 2025
Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models
Daiki Chijiwa
Taku Hasegawa
Kyosuke Nishida
Kuniko Saito
Susumu Takeuchi
47
0
0
18 Feb 2025
Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning
Gavin B. Rens
43
0
0
03 Jan 2025
MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services
Hongjia Wu
Hui Zeng
Zehui Xiong
Jiawen Kang
Zhiping Cai
Tse-Tin Chan
Dusit Niyato
Zhu Han
30
0
0
25 Oct 2024
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
34
1
0
30 Jul 2024
PrimeGuard: Safe and Helpful LLMs through Tuning-Free Routing
Blazej Manczak
Eliott Zemour
Eric Lin
Vaikkunth Mugunthan
26
2
0
23 Jul 2024
E
2
C
F
D
\mathrm{E^{2}CFD}
E
2
CFD
: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
Zepeng Wang
Chao Ma
Linjiang Zhou
Libing Wu
Lei Yang
Xiaochuan Shi
Guojun Peng
OffRL
37
0
0
08 Jul 2024
Artificial intelligence, rationalization, and the limits of control in the public sector: the case of tax policy optimization
Jakob Mokander
Ralph Schroeder
36
6
0
07 Jul 2024
Model-Free Active Exploration in Reinforcement Learning
Alessio Russo
Alexandre Proutière
OffRL
21
2
0
30 Jun 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
Zhaolin Gao
Jonathan D. Chang
Wenhao Zhan
Owen Oertell
Gokul Swamy
Kianté Brantley
Thorsten Joachims
J. Andrew Bagnell
Jason D. Lee
Wen Sun
OffRL
40
31
0
25 Apr 2024
A Review of Reward Functions for Reinforcement Learning in the context of Autonomous Driving
Ahmed Abouelazm
Jonas Michel
J. M. Zöllner
40
6
0
12 Apr 2024
A proximal policy optimization based intelligent home solar management
Kode Creer
Imitiaz Parvez
21
0
0
05 Apr 2024
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Yi-Fan Yao
Zuxin Liu
Zhepeng Cen
Peide Huang
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
71
6
0
23 Dec 2023
Partial End-to-end Reinforcement Learning for Robustness Against Modelling Error in Autonomous Racing
Andrew Murdoch
J. C. Schoeman
H. W. Jordaan
19
2
0
11 Dec 2023
Learning active tactile perception through belief-space control
J. Tremblay
D. Meger
F. Hogan
Gregory Dudek
37
1
0
30 Nov 2023
Closed Drafting as a Case Study for First-Principle Interpretability, Memory, and Generalizability in Deep Reinforcement Learning
Ryan Rezai
Jason Wang
27
0
0
31 Oct 2023
Fundamental Limits of Deep Learning-Based Binary Classifiers Trained with Hinge Loss
T. Getu
Georges Kaddoum
M. Bennis
40
1
0
13 Sep 2023
Robotic Ultrasound Imaging: State-of-the-Art and Future Perspectives
Zhongliang Jiang
Septimiu E. Salcudeanb
Nassir Navab
19
85
0
08 Jul 2023
Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Yu-Chih Chen
So Yeon Min
Chase Davis
Ruslan Salakhutdinov
A. Azaria
Yuan-Fang Li
Tom Michael Mitchell
A. Bovik
LM&Ro
LLMAG
78
33
0
03 May 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
30
3
0
20 Apr 2023
Multi-Agent Reinforcement Learning with Action Masking for UAV-enabled Mobile Communications
D. Rizvi
David P. Boyle
21
4
0
29 Mar 2023
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
18
0
0
19 Jan 2023
On the Challenges of using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects
Sumana Basu
M. Legault
Adriana Romero Soriano
Doina Precup
OffRL
26
3
0
02 Jan 2023
Instrumental Variables in Causal Inference and Machine Learning: A Survey
Anpeng Wu
Kun Kuang
Ruoxuan Xiong
Fei Wu
SyDa
CML
25
6
0
12 Dec 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
35
0
0
23 Nov 2022
Reinforcement Learning-based Defect Mitigation for Quality Assurance of Additive Manufacturing
Jihoon Chung
Bo Shen
A. C. Law
Zhenyu
Zhen Kong
OffRL
OnRL
AI4CE
8
22
0
28 Oct 2022
AACHER: Assorted Actor-Critic Deep Reinforcement Learning with Hindsight Experience Replay
Adarsh Sehgal
Muskan Sehgal
Hung M. La
14
2
0
24 Oct 2022
A model-based approach to meta-Reinforcement Learning: Transformers and tree search
Brieuc Pinon
Jean-Charles Delvenne
Raphaël Jungers
OffRL
29
3
0
24 Aug 2022
Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning
Gaurav R. Bagwe
Xianhao Chen
Xiaoyong Yuan
Lan Zhang
38
4
0
21 Jul 2022
Explainability in Deep Reinforcement Learning, a Review into Current Methods and Applications
Tom Hickling
Abdelhafid Zenati
Nabil Aouf
P. Spencer
XAI
AI4TS
43
22
0
05 Jul 2022
Innovations in Integrating Machine Learning and Agent-Based Modeling of Biomedical Systems
N. Sivakumar
C. Mura
S. Peirce
AI4CE
45
21
0
02 Jun 2022
Hyperparameter Tuning for Deep Reinforcement Learning Applications
M. Kiran
Melis Ozyildirim
37
22
0
26 Jan 2022
CoMPS: Continual Meta Policy Search
Glen Berseth
Zhiwei Zhang
Grace Zhang
Chelsea Finn
Sergey Levine
CLL
OffRL
28
16
0
08 Dec 2021
Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction
A. Ahmad
Shuo Cheng
D. Saraswat
Aly El Gamal
Luu Anh Tuan
Gurmukh Johal
OffRL
OnRL
20
1
0
22 Oct 2021
Shifting Capsule Networks from the Cloud to the Deep Edge
Miguel Costa
Diogo Costa
T. Gomes
Sandro Pinto
21
5
0
06 Oct 2021
Secure Bayesian Federated Analytics for Privacy-Preserving Trend Detection
Amit Chaulwar
M. Huth
FedML
4
3
0
28 Jul 2021
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation
Yaowen Yao
Li Xiao
Zhicheng An
Wanpeng Zhang
Dijun Luo
66
20
0
05 Jul 2021
Parameter-free Gradient Temporal Difference Learning
Andrew Jacobsen
Alan Chan
OffRL
21
2
0
10 May 2021
Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement Learning
Changyin Sun
Lijun Sun
19
16
0
02 May 2021
Network Defense is Not a Game
Andres Molina-Markham
Ransom K. Winder
Ahmad Ridley
AAML
25
12
0
20 Apr 2021
Intuitive Physics Guided Exploration for Sample Efficient Sim2real Transfer
B. L. Semage
Thommen George Karimpanal
Santu Rana
Svetha Venkatesh
PINN
17
0
0
18 Apr 2021
Replay in Deep Learning: Current Approaches and Missing Biological Elements
Tyler L. Hayes
G. Krishnan
M. Bazhenov
H. Siegelmann
T. Sejnowski
Christopher Kanan
CLL
36
130
0
01 Apr 2021
Deep Hedging of Derivatives Using Reinforcement Learning
Jay Cao
Jacky Chen
J. Hull
Zissis Poulos
17
71
0
29 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
15
17
0
15 Mar 2021
Comparing Popular Simulation Environments in the Scope of Robotics and Reinforcement Learning
Marian Korber
Johann Lange
S. Rediske
Simon Steinmann
Roland Glück
11
50
0
08 Mar 2021
Bridge the Vision Gap from Field to Command: A Deep Learning Network Enhancing Illumination and Details
Zhuqing Jiang
Chang Liu
Yanan Wang
Kai Li
Aidong Men
Haiying Wang
Haiyong Luo
6
2
0
20 Jan 2021
1
2
3
Next