ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.02787
  4. Cited By
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping

Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping

1 October 2019
Cristian Bodnar
A. Li
Karol Hausman
P. Pastor
Mrinal Kalakrishnan
    OffRL
ArXivPDFHTML

Papers citing "Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping"

34 / 34 papers shown
Title
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
44
0
0
27 Oct 2024
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Taehyun Cho
Seung Han
Kyungjae Lee
Seokhun Ju
Dohyeong Kim
Jungwoo Lee
72
0
0
31 Jul 2024
Octopus v3: Technical Report for On-device Sub-billion Multimodal AI
  Agent
Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent
Wei Chen
Zhiyuan Li
LLMAG
30
5
0
17 Apr 2024
Robustifying a Policy in Multi-Agent RL with Diverse Cooperative
  Behaviors and Adversarial Style Sampling for Assistive Tasks
Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behaviors and Adversarial Style Sampling for Assistive Tasks
Takayuki Osa
Tatsuya Harada
42
2
0
01 Mar 2024
Near-Minimax-Optimal Distributional Reinforcement Learning with a
  Generative Model
Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model
Mark Rowland
Wenliang Kevin Li
Rémi Munos
Clare Lyle
Yunhao Tang
Will Dabney
OOD
OffRL
30
1
0
12 Feb 2024
More Benefits of Being Distributional: Second-Order Bounds for
  Reinforcement Learning
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
88
12
0
11 Feb 2024
Distributional Bellman Operators over Mean Embeddings
Distributional Bellman Operators over Mean Embeddings
Wenliang Kevin Li
Grégoire Delétang
Matthew Aitchison
Marcus Hutter
Anian Ruoss
Arthur Gretton
Mark Rowland
OffRL
15
4
0
09 Dec 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional
  Reinforcement Learning
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
32
9
0
25 Sep 2023
Safe Reinforcement Learning with Dead-Ends Avoidance and Recovery
Safe Reinforcement Learning with Dead-Ends Avoidance and Recovery
Xiao Zhang
Hai Zhang
Hongtu Zhou
Chang Huang
Di Zhang
Chen Ye
Junqiao Zhao
OffRL
35
4
0
24 Jun 2023
The Statistical Benefits of Quantile Temporal-Difference Learning for
  Value Estimation
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
Mark Rowland
Yunhao Tang
Clare Lyle
Rémi Munos
Marc G. Bellemare
Will Dabney
15
10
0
28 May 2023
The Benefits of Being Distributional: Small-Loss Bounds for
  Reinforcement Learning
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Kaiwen Wang
Kevin Zhou
Runzhe Wu
Nathan Kallus
Wen Sun
OffRL
34
18
0
25 May 2023
Learn to Grasp via Intention Discovery and its Application to
  Challenging Clutter
Learn to Grasp via Intention Discovery and its Application to Challenging Clutter
Chao Zhao
Chunli Jiang
Junhao Cai
Hongyu Yu
M. Y. Wang
Qifeng Chen
24
0
0
05 Apr 2023
An Analysis of Quantile Temporal-Difference Learning
An Analysis of Quantile Temporal-Difference Learning
Mark Rowland
Rémi Munos
M. G. Azar
Yunhao Tang
Georg Ostrovski
Anna Harutyunyan
K. Tuyls
Marc G. Bellemare
Will Dabney
16
23
0
11 Jan 2023
All the Feels: A dexterous hand with large-area tactile sensing
All the Feels: A dexterous hand with large-area tactile sensing
Raunaq M. Bhirangi
Abigail DeFranco
Jacob Adkins
Carmel Majidi
Abhi Gupta
Tess Hellebrekers
Vikash Kumar
33
12
0
27 Oct 2022
The Nature of Temporal Difference Errors in Multi-step Distributional
  Reinforcement Learning
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
Marc G. Bellemare
OffRL
27
11
0
15 Jul 2022
Efficient Risk-Averse Reinforcement Learning
Efficient Risk-Averse Reinforcement Learning
Ido Greenberg
Yinlam Chow
Mohammad Ghavamzadeh
Shie Mannor
36
39
0
10 May 2022
Efficient and Accurate Candidate Generation for Grasp Pose Detection in
  SE(3)
Efficient and Accurate Candidate Generation for Grasp Pose Detection in SE(3)
A. T. Pas
Colin Keil
Robert W. Platt
26
4
0
03 Apr 2022
Let's Handle It: Generalizable Manipulation of Articulated Objects
Let's Handle It: Generalizable Manipulation of Articulated Objects
Zhutian Yang
Aidan Curtis
30
1
0
23 Feb 2022
Learn to Grasp with Less Supervision: A Data-Efficient Maximum
  Likelihood Grasp Sampling Loss
Learn to Grasp with Less Supervision: A Data-Efficient Maximum Likelihood Grasp Sampling Loss
Xinghao Zhu
Yefan Zhou
Yongxiang Fan
Lingfeng Sun
Jianyu Chen
Masayoshi Tomizuka
32
15
0
10 Aug 2021
Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive
  Navigation
Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation
Jinyoung Choi
C. Dance
Jung-Eun Kim
Seulbin Hwang
Kyungsik Park
UQCV
15
26
0
07 Apr 2021
Discovering Diverse Solutions in Deep Reinforcement Learning by
  Maximizing State-Action-Based Mutual Information
Discovering Diverse Solutions in Deep Reinforcement Learning by Maximizing State-Action-Based Mutual Information
Takayuki Osa
Voot Tangkaratt
Masashi Sugiyama
21
31
0
12 Mar 2021
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement
  Learning Agents
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents
Wei Qiu
Xinrun Wang
Runsheng Yu
Xu He
R. Wang
Bo An
S. Obraztsova
Zinovi Rabinovich
35
50
0
16 Feb 2021
COCOI: Contact-aware Online Context Inference for Generalizable
  Non-planar Pushing
COCOI: Contact-aware Online Context Inference for Generalizable Non-planar Pushing
Zhuo Xu
Wenhao Yu
Alexander Herzog
Wenlong Lu
Chuyuan Fu
Masayoshi Tomizuka
Yunfei Bai
Chenxi Liu
Daniel Ho
OffRL
27
17
0
23 Nov 2020
A Geometric Perspective on Self-Supervised Policy Adaptation
A Geometric Perspective on Self-Supervised Policy Adaptation
Cristian Bodnar
Karol Hausman
Gabriel Dulac-Arnold
Rico Jonschkowski
SSL
44
5
0
14 Nov 2020
ROLL: Visual Self-Supervised Reinforcement Learning with Object
  Reasoning
ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning
Yufei Wang
G. Narasimhan
Xingyu Lin
Brian Okorn
David Held
OffRL
LRM
30
13
0
13 Nov 2020
Accelerating Grasp Exploration by Leveraging Learned Priors
Accelerating Grasp Exploration by Leveraging Learned Priors
Han Yu Li
Michael Danielczuk
Ashwin Balakrishna
V. Satish
Ken Goldberg
11
10
0
11 Nov 2020
RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer
RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer
Daniel Ho
Kanishka Rao
Zhuo Xu
Eric Jang
Mohi Khansari
Yunfei Bai
GAN
LM&Ro
45
97
0
06 Nov 2020
Learning Vision-based Reactive Policies for Obstacle Avoidance
Learning Vision-based Reactive Policies for Obstacle Avoidance
Elie Aljalbout
Ji Chen
Konstantin Ritt
Maximilian Ulmer
Sami Haddadin
13
21
0
30 Oct 2020
One Solution is Not All You Need: Few-Shot Extrapolation via Structured
  MaxEnt RL
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL
Saurabh Kumar
Aviral Kumar
Sergey Levine
Chelsea Finn
OffRL
16
90
0
27 Oct 2020
Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking
Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking
Kate Sanders
Michael Danielczuk
Jeffrey Mahler
A. Tanwani
Ken Goldberg
OffRL
21
6
0
20 Jul 2020
Representations for Stable Off-Policy Reinforcement Learning
Representations for Stable Off-Policy Reinforcement Learning
Dibya Ghosh
Marc G. Bellemare
OffRL
SSL
OOD
14
43
0
10 Jul 2020
Learning to Play Table Tennis From Scratch using Muscular Robots
Learning to Play Table Tennis From Scratch using Muscular Robots
Le Chen
Simon Guist
Roberto Calandra
V. Berenz
Bernhard Schölkopf
Jan Peters
17
88
0
10 Jun 2020
Controlling Overestimation Bias with Truncated Mixture of Continuous
  Distributional Quantile Critics
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
136
188
0
08 May 2020
DSAC: Distributional Soft Actor Critic for Risk-Sensitive Reinforcement
  Learning
DSAC: Distributional Soft Actor Critic for Risk-Sensitive Reinforcement Learning
Xiaoteng Ma
Li Xia
Zhengyuan Zhou
Jun Yang
Qianchuan Zhao
32
17
0
30 Apr 2020
1