Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.15656
Cited By
Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems
31 May 2022
N. Sultana
Jeffrey Chan
Tabinda Sarwar
•. A. K. Qin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems"
13 / 13 papers shown
Title
Deep Policy Dynamic Programming for Vehicle Routing Problems
W. Kool
H. V. Hoof
J. Gromicho
Max Welling
72
120
0
23 Feb 2021
POMO: Policy Optimization with Multiple Optima for Reinforcement Learning
Yeong-Dae Kwon
Jinho Choo
Byoungjip Kim
Iljoo Yoon
Youngjune Gwon
Seungjai Min
94
336
0
30 Oct 2020
Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep Reinforcement Learning
P. Costa
Jason Rhuggenaath
Yingqian Zhang
A. Akçay
69
141
0
03 Apr 2020
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
141
299
0
16 Oct 2019
An Efficient Graph Convolutional Network Technique for the Travelling Salesman Problem
Chaitanya K. Joshi
T. Laurent
Xavier Bresson
GNN
100
374
0
04 Jun 2019
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
136
2,445
0
13 Dec 2018
Learning to Perform Local Rewriting for Combinatorial Optimization
Xinyun Chen
Yuandong Tian
NAI
OffRL
118
348
0
30 Sep 2018
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
57
629
0
17 Aug 2017
Learning Combinatorial Optimization Algorithms over Graphs
H. Dai
Elias Boutros Khalil
Yuyu Zhang
B. Dilkina
Le Song
114
1,472
0
05 Apr 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
110
1,342
0
27 Feb 2017
Neural Combinatorial Optimization with Reinforcement Learning
Irwan Bello
Hieu H. Pham
Quoc V. Le
Mohammad Norouzi
Samy Bengio
158
1,492
0
29 Nov 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
202
8,875
0
04 Feb 2016
Pointer Networks
Oriol Vinyals
Meire Fortunato
Navdeep Jaitly
121
3,059
0
09 Jun 2015
1