ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.15656
  4. Cited By
Sample-Efficient, Exploration-Based Policy Optimisation for Routing
  Problems

Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems

31 May 2022
N. Sultana
Jeffrey Chan
Tabinda Sarwar
•. A. K. Qin
ArXiv (abs)PDFHTML

Papers citing "Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems"

13 / 13 papers shown
Title
Deep Policy Dynamic Programming for Vehicle Routing Problems
Deep Policy Dynamic Programming for Vehicle Routing Problems
W. Kool
H. V. Hoof
J. Gromicho
Max Welling
72
120
0
23 Feb 2021
POMO: Policy Optimization with Multiple Optima for Reinforcement
  Learning
POMO: Policy Optimization with Multiple Optima for Reinforcement Learning
Yeong-Dae Kwon
Jinho Choo
Byoungjip Kim
Iljoo Yoon
Youngjune Gwon
Seungjai Min
94
336
0
30 Oct 2020
Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep
  Reinforcement Learning
Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep Reinforcement Learning
P. Costa
Jason Rhuggenaath
Yingqian Zhang
A. Akçay
69
141
0
03 Apr 2020
Soft Actor-Critic for Discrete Action Settings
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
141
299
0
16 Oct 2019
An Efficient Graph Convolutional Network Technique for the Travelling
  Salesman Problem
An Efficient Graph Convolutional Network Technique for the Travelling Salesman Problem
Chaitanya K. Joshi
T. Laurent
Xavier Bresson
GNN
100
374
0
04 Jun 2019
Soft Actor-Critic Algorithms and Applications
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
136
2,445
0
13 Dec 2018
Learning to Perform Local Rewriting for Combinatorial Optimization
Learning to Perform Local Rewriting for Combinatorial Optimization
Xinyun Chen
Yuandong Tian
NAIOffRL
118
348
0
30 Sep 2018
Scalable trust-region method for deep reinforcement learning using
  Kronecker-factored approximation
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
57
629
0
17 Aug 2017
Learning Combinatorial Optimization Algorithms over Graphs
Learning Combinatorial Optimization Algorithms over Graphs
H. Dai
Elias Boutros Khalil
Yuyu Zhang
B. Dilkina
Le Song
114
1,472
0
05 Apr 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
110
1,342
0
27 Feb 2017
Neural Combinatorial Optimization with Reinforcement Learning
Neural Combinatorial Optimization with Reinforcement Learning
Irwan Bello
Hieu H. Pham
Quoc V. Le
Mohammad Norouzi
Samy Bengio
158
1,492
0
29 Nov 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
202
8,875
0
04 Feb 2016
Pointer Networks
Pointer Networks
Oriol Vinyals
Meire Fortunato
Navdeep Jaitly
121
3,059
0
09 Jun 2015
1