ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.10149
  4. Cited By
Continuous Control with Action Quantization from Demonstrations

Continuous Control with Action Quantization from Demonstrations

19 October 2021
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
Matthieu Geist
Olivier Pietquin
    OffRL
ArXivPDFHTML

Papers citing "Continuous Control with Action Quantization from Demonstrations"

50 / 52 papers shown
Title
Is Bang-Bang Control All You Need? Solving Continuous Control with
  Bernoulli Policies
Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies
Tim Seyde
Igor Gilitschenski
Wilko Schwarting
Bartolomeo Stellato
Martin Riedmiller
Markus Wulfmeier
Daniela Rus
50
44
0
03 Nov 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
251
874
0
12 Oct 2021
Implicit Behavioral Cloning
Implicit Behavioral Cloning
Peter R. Florence
Corey Lynch
Andy Zeng
Oscar Ramirez
Ayzaan Wahid
Laura Downs
Adrian S. Wong
Johnny Lee
Igor Mordatch
Jonathan Tompson
OffRL
91
378
0
01 Sep 2021
Implicitly Regularized RL with Implicit Q-Values
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
Matthieu Geist
OffRL
45
9
0
16 Aug 2021
What Matters in Learning from Offline Human Demonstrations for Robot
  Manipulation
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
232
491
0
06 Aug 2021
A Minimalist Approach to Offline Reinforcement Learning
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
101
804
0
12 Jun 2021
What Matters for Adversarial Imitation Learning?
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
Matthieu Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
74
77
0
01 Jun 2021
Hyperparameter Selection for Imitation Learning
Hyperparameter Selection for Imitation Learning
Léonard Hussenot
Marcin Andrychowicz
Damien Vincent
Robert Dadashi
Anton Raichuk
...
Sabela Ramos
Manu Orsini
Olivier Bachem
Matthieu Geist
Olivier Pietquin
67
18
0
25 May 2021
Parrot: Data-Driven Behavioral Priors for Reinforcement Learning
Parrot: Data-Driven Behavioral Priors for Reinforcement Learning
Avi Singh
Huihan Liu
G. Zhou
Albert Yu
Nicholas Rhinehart
Sergey Levine
OffRL
OnRL
46
140
0
19 Nov 2020
Learning to Represent Action Values as a Hypergraph on the Action
  Vertices
Learning to Represent Action Values as a Hypergraph on the Action Vertices
Arash Tavakoli
Mehdi Fatemi
Petar Kormushev
36
23
0
28 Oct 2020
Munchausen Reinforcement Learning
Munchausen Reinforcement Learning
Nino Vieillard
Olivier Pietquin
Matthieu Geist
OffRL
22
90
0
28 Jul 2020
Strictly Batch Imitation Learning by Energy-based Distribution Matching
Strictly Batch Imitation Learning by Energy-based Distribution Matching
Daniel Jarrett
Ioana Bica
M. Schaar
OffRL
28
62
0
25 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
104
1,780
0
08 Jun 2020
Primal Wasserstein Imitation Learning
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
35
124
0
08 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
102
225
0
01 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
477
1,994
0
04 May 2020
Continuous-Discrete Reinforcement Learning for Hybrid Control in
  Robotics
Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Michael Neunert
A. Abdolmaleki
Markus Wulfmeier
Thomas Lampe
Jost Tobias Springenberg
Roland Hafner
Francesco Romano
J. Buchli
N. Heess
Martin Riedmiller
53
91
0
02 Jan 2020
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
94
1,811
0
13 Dec 2019
Behavior Regularized Offline Reinforcement Learning
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
68
678
0
26 Nov 2019
A Divergence Minimization Perspective on Imitation Learning Methods
A Divergence Minimization Perspective on Imitation Learning Methods
Seyed Kamyar Seyed Ghasemipour
R. Zemel
S. Gu
51
248
0
06 Nov 2019
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and
  Reinforcement Learning
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
54
429
0
25 Oct 2019
CAQL: Continuous Action Q-Learning
CAQL: Continuous Action Q-Learning
Moonkyung Ryu
Yinlam Chow
Ross Anderson
Christian Tjandraatmadja
Craig Boutilier
215
42
0
26 Sep 2019
Goal-conditioned Imitation Learning
Goal-conditioned Imitation Learning
Yiming Ding
Carlos Florensa
Mariano Phielipp
Pieter Abbeel
47
221
0
13 Jun 2019
Distributional Policy Optimization: An Alternative Approach for
  Continuous Control
Distributional Policy Optimization: An Alternative Approach for Continuous Control
Chen Tessler
Guy Tennenholtz
Shie Mannor
OffRL
32
44
0
23 May 2019
Random Expert Distillation: Imitation Learning via Expert Policy Support
  Estimation
Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation
Ruohan Wang
C. Ciliberto
P. Amadori
Y. Demiris
44
62
0
16 May 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
56
28
0
25 Mar 2019
Learning Latent Plans from Play
Learning Latent Plans from Play
Corey Lynch
Mohi Khansari
Ted Xiao
Vikash Kumar
Jonathan Tompson
Sergey Levine
P. Sermanet
SSL
LM&Ro
71
396
0
05 Mar 2019
Discretizing Continuous Action Space for On-Policy Optimization
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
55
119
0
29 Jan 2019
Soft Actor-Critic Algorithms and Applications
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
105
2,391
0
13 Dec 2018
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward
  Bias in Adversarial Imitation Learning
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
71
257
0
09 Sep 2018
Learning Dexterous In-Hand Manipulation
Learning Dexterous In-Hand Manipulation
OpenAI OpenAI
Marcin Andrychowicz
Bowen Baker
Maciek Chociej
Rafal Jozefowicz
...
Szymon Sidor
Joshua Tobin
Peter Welinder
Lilian Weng
Wojciech Zaremba
86
1,865
0
01 Aug 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic
  Manipulation
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
...
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
98
1,454
0
27 Jun 2018
Hierarchical Imitation and Reinforcement Learning
Hierarchical Imitation and Reinforcement Learning
Hoang Minh Le
Nan Jiang
Alekh Agarwal
Miroslav Dudík
Yisong Yue
Hal Daumé
42
191
0
01 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
157
5,121
0
26 Feb 2018
One-Shot Imitation from Observing Humans via Domain-Adaptive
  Meta-Learning
One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
Tianhe Yu
Chelsea Finn
Annie Xie
Sudeep Dasari
Tianhao Zhang
Pieter Abbeel
Sergey Levine
47
359
0
05 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
227
8,236
0
04 Jan 2018
Action Branching Architectures for Deep Reinforcement Learning
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
39
260
0
24 Nov 2017
DDCO: Discovery of Deep Continuous Options for Robot Learning from
  Demonstrations
DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations
S. Krishnan
Roy Fox
Ion Stoica
Ken Goldberg
53
82
0
15 Oct 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning
  and Demonstrations
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
E. Todorov
Sergey Levine
109
1,079
0
28 Sep 2017
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics
  Problems with Sparse Rewards
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Matej Vecerík
Todd Hester
Jonathan Scholz
Fumin Wang
Olivier Pietquin
Bilal Piot
N. Heess
Thomas Rothörl
Thomas Lampe
Martin Riedmiller
OffRL
57
661
0
27 Jul 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
243
18,685
0
20 Jul 2017
Discrete Sequential Prediction of Continuous Actions for Deep RL
Discrete Sequential Prediction of Continuous Actions for Deep RL
Luke Metz
Julian Ibarz
Navdeep Jaitly
James Davidson
BDL
OffRL
57
118
0
14 May 2017
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
84
764
0
15 Nov 2016
Input Convex Neural Networks
Input Convex Neural Networks
Brandon Amos
Lei Xu
J. Zico Kolter
262
611
0
22 Sep 2016
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
114
3,089
0
10 Jun 2016
Continuous Deep Q-Learning with Model-based Acceleration
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
62
1,010
0
02 Mar 2016
Deep Reinforcement Learning in Large Discrete Action Spaces
Deep Reinforcement Learning in Large Discrete Action Spaces
Gabriel Dulac-Arnold
Richard Evans
H. V. Hasselt
P. Sunehag
Timothy Lillicrap
Jonathan J. Hunt
Timothy A. Mann
T. Weber
T. Degris
Ben Coppin
OffRL
52
573
0
24 Dec 2015
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
198
3,781
0
18 Nov 2015
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
134
7,590
0
22 Sep 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
210
13,174
0
09 Sep 2015
12
Next