Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.02787
Cited By
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
1 October 2019
Cristian Bodnar
A. Li
Karol Hausman
P. Pastor
Mrinal Kalakrishnan
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping"
34 / 34 papers shown
Title
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
44
0
0
27 Oct 2024
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Taehyun Cho
Seung Han
Kyungjae Lee
Seokhun Ju
Dohyeong Kim
Jungwoo Lee
72
0
0
31 Jul 2024
Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent
Wei Chen
Zhiyuan Li
LLMAG
30
5
0
17 Apr 2024
Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behaviors and Adversarial Style Sampling for Assistive Tasks
Takayuki Osa
Tatsuya Harada
42
2
0
01 Mar 2024
Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model
Mark Rowland
Wenliang Kevin Li
Rémi Munos
Clare Lyle
Yunhao Tang
Will Dabney
OOD
OffRL
30
1
0
12 Feb 2024
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
88
12
0
11 Feb 2024
Distributional Bellman Operators over Mean Embeddings
Wenliang Kevin Li
Grégoire Delétang
Matthew Aitchison
Marcus Hutter
Anian Ruoss
Arthur Gretton
Mark Rowland
OffRL
15
4
0
09 Dec 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
32
9
0
25 Sep 2023
Safe Reinforcement Learning with Dead-Ends Avoidance and Recovery
Xiao Zhang
Hai Zhang
Hongtu Zhou
Chang Huang
Di Zhang
Chen Ye
Junqiao Zhao
OffRL
35
4
0
24 Jun 2023
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
Mark Rowland
Yunhao Tang
Clare Lyle
Rémi Munos
Marc G. Bellemare
Will Dabney
15
10
0
28 May 2023
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Kaiwen Wang
Kevin Zhou
Runzhe Wu
Nathan Kallus
Wen Sun
OffRL
34
18
0
25 May 2023
Learn to Grasp via Intention Discovery and its Application to Challenging Clutter
Chao Zhao
Chunli Jiang
Junhao Cai
Hongyu Yu
M. Y. Wang
Qifeng Chen
24
0
0
05 Apr 2023
An Analysis of Quantile Temporal-Difference Learning
Mark Rowland
Rémi Munos
M. G. Azar
Yunhao Tang
Georg Ostrovski
Anna Harutyunyan
K. Tuyls
Marc G. Bellemare
Will Dabney
16
23
0
11 Jan 2023
All the Feels: A dexterous hand with large-area tactile sensing
Raunaq M. Bhirangi
Abigail DeFranco
Jacob Adkins
Carmel Majidi
Abhi Gupta
Tess Hellebrekers
Vikash Kumar
33
12
0
27 Oct 2022
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
Marc G. Bellemare
OffRL
27
11
0
15 Jul 2022
Efficient Risk-Averse Reinforcement Learning
Ido Greenberg
Yinlam Chow
Mohammad Ghavamzadeh
Shie Mannor
36
39
0
10 May 2022
Efficient and Accurate Candidate Generation for Grasp Pose Detection in SE(3)
A. T. Pas
Colin Keil
Robert W. Platt
26
4
0
03 Apr 2022
Let's Handle It: Generalizable Manipulation of Articulated Objects
Zhutian Yang
Aidan Curtis
30
1
0
23 Feb 2022
Learn to Grasp with Less Supervision: A Data-Efficient Maximum Likelihood Grasp Sampling Loss
Xinghao Zhu
Yefan Zhou
Yongxiang Fan
Lingfeng Sun
Jianyu Chen
Masayoshi Tomizuka
32
15
0
10 Aug 2021
Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation
Jinyoung Choi
C. Dance
Jung-Eun Kim
Seulbin Hwang
Kyungsik Park
UQCV
15
26
0
07 Apr 2021
Discovering Diverse Solutions in Deep Reinforcement Learning by Maximizing State-Action-Based Mutual Information
Takayuki Osa
Voot Tangkaratt
Masashi Sugiyama
21
31
0
12 Mar 2021
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents
Wei Qiu
Xinrun Wang
Runsheng Yu
Xu He
R. Wang
Bo An
S. Obraztsova
Zinovi Rabinovich
35
50
0
16 Feb 2021
COCOI: Contact-aware Online Context Inference for Generalizable Non-planar Pushing
Zhuo Xu
Wenhao Yu
Alexander Herzog
Wenlong Lu
Chuyuan Fu
Masayoshi Tomizuka
Yunfei Bai
Chenxi Liu
Daniel Ho
OffRL
27
17
0
23 Nov 2020
A Geometric Perspective on Self-Supervised Policy Adaptation
Cristian Bodnar
Karol Hausman
Gabriel Dulac-Arnold
Rico Jonschkowski
SSL
44
5
0
14 Nov 2020
ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning
Yufei Wang
G. Narasimhan
Xingyu Lin
Brian Okorn
David Held
OffRL
LRM
30
13
0
13 Nov 2020
Accelerating Grasp Exploration by Leveraging Learned Priors
Han Yu Li
Michael Danielczuk
Ashwin Balakrishna
V. Satish
Ken Goldberg
11
10
0
11 Nov 2020
RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer
Daniel Ho
Kanishka Rao
Zhuo Xu
Eric Jang
Mohi Khansari
Yunfei Bai
GAN
LM&Ro
45
97
0
06 Nov 2020
Learning Vision-based Reactive Policies for Obstacle Avoidance
Elie Aljalbout
Ji Chen
Konstantin Ritt
Maximilian Ulmer
Sami Haddadin
13
21
0
30 Oct 2020
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL
Saurabh Kumar
Aviral Kumar
Sergey Levine
Chelsea Finn
OffRL
16
90
0
27 Oct 2020
Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking
Kate Sanders
Michael Danielczuk
Jeffrey Mahler
A. Tanwani
Ken Goldberg
OffRL
21
6
0
20 Jul 2020
Representations for Stable Off-Policy Reinforcement Learning
Dibya Ghosh
Marc G. Bellemare
OffRL
SSL
OOD
14
43
0
10 Jul 2020
Learning to Play Table Tennis From Scratch using Muscular Robots
Le Chen
Simon Guist
Roberto Calandra
V. Berenz
Bernhard Schölkopf
Jan Peters
17
88
0
10 Jun 2020
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
136
188
0
08 May 2020
DSAC: Distributional Soft Actor Critic for Risk-Sensitive Reinforcement Learning
Xiaoteng Ma
Li Xia
Zhengyuan Zhou
Jun Yang
Qianchuan Zhao
32
17
0
30 Apr 2020
1