ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.10293
  4. Cited By
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic
  Manipulation

QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

27 June 2018
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
Eric Jang
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
ArXivPDFHTML

Papers citing "QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation"

50 / 378 papers shown
Title
Challenges and Opportunities in Offline Reinforcement Learning from
  Visual Observations
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Cong Lu
Philip J. Ball
Tim G. J. Rudner
Jack Parker-Holder
Michael A. Osborne
Yee Whye Teh
OffRL
32
52
0
09 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on
  Exploration and Performance
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
46
17
0
08 Jun 2022
Efficient Scheduling of Data Augmentation for Deep Reinforcement
  Learning
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Byungchan Ko
Jungseul Ok
OnRL
27
5
0
01 Jun 2022
Play it by Ear: Learning Skills amidst Occlusion through Audio-Visual
  Imitation Learning
Play it by Ear: Learning Skills amidst Occlusion through Audio-Visual Imitation Learning
Maximilian Du
Olivia Y. Lee
Suraj Nair
Chelsea Finn
OffRL
62
33
0
30 May 2022
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning
  Framework
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
Dian Wang
Colin Kohler
Xu Zhu
Ming Jia
Robert W. Platt
32
9
0
28 May 2022
Grasping as Inference: Reactive Grasping in Heavily Cluttered
  Environment
Grasping as Inference: Reactive Grasping in Heavily Cluttered Environment
Dongwon Son
30
9
0
26 May 2022
Learning Task-relevant Representations for Generalization via
  Characteristic Functions of Reward Sequence Distributions
Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions
Rui Yang
Jie Wang
Zijie Geng
Mingxuan Ye
Shuiwang Ji
Bin Li
Fengli Wu
OOD
36
20
0
20 May 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in
  Latent Space
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
58
29
0
17 May 2022
Robotic Maintenance of Road Infrastructures: The HERON Project
Robotic Maintenance of Road Infrastructures: The HERON Project
Iason Katsamenis
M. Bimpas
Eftychios E. Protopapadakis
Charalampos Zafeiropoulos
D. Kalogeras
...
Yannis Handanos
Franziska Schmidt
Lionel Ott
Miquel Cantero
Rafael Lopez
23
22
0
09 May 2022
Coarse-to-fine Q-attention with Tree Expansion
Coarse-to-fine Q-attention with Tree Expansion
Stephen James
Pieter Abbeel
26
11
0
26 Apr 2022
Training and Evaluation of Deep Policies using Reinforcement Learning
  and Generative Models
Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models
Ali Ghadirzadeh
Petra Poklukar
Karol Arndt
Chelsea Finn
Ville Kyrki
Danica Kragic
Mårten Björkman
OffRL
22
1
0
18 Apr 2022
Synthesizing Adversarial Visual Scenarios for Model-Based Robotic
  Control
Synthesizing Adversarial Visual Scenarios for Model-Based Robotic Control
Shubhankar Agarwal
Sandeep Chinchali
AAML
40
4
0
13 Apr 2022
When Should We Prefer Offline Reinforcement Learning Over Behavioral
  Cloning?
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Aviral Kumar
Joey Hong
Anika Singh
Sergey Levine
OffRL
47
77
0
12 Apr 2022
Learning Design and Construction with Varying-Sized Materials via
  Prioritized Memory Resets
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets
Yunfei Li
Tao Kong
Lei Li
Yi Wu
58
4
0
12 Apr 2022
Learning to Drive by Watching YouTube Videos: Action-Conditioned
  Contrastive Policy Pretraining
Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining
Qihang Zhang
Zhenghao Peng
Bolei Zhou
SSL
30
38
0
05 Apr 2022
Jump-Start Reinforcement Learning
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
44
109
0
05 Apr 2022
RL4ReAl: Reinforcement Learning for Register Allocation
RL4ReAl: Reinforcement Learning for Register Allocation
S. VenkataKeerthy
Siddhartha Jain
Anilava Kundu
Rohit Aggarwal
Albert Cohen
Ramakrishna Upadrasta
OffRL
41
5
0
05 Apr 2022
Coarse-to-Fine Q-attention with Learned Path Ranking
Coarse-to-Fine Q-attention with Learned Path Ranking
Stephen James
Pieter Abbeel
32
15
0
04 Apr 2022
Asynchronous Reinforcement Learning for Real-Time Control of Physical
  Robots
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
36
19
0
23 Mar 2022
Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement
  Learning
Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning
Satoshi Kataoka
Seyed Kamyar Seyed Ghasemipour
Daniel Freeman
Igor Mordatch
24
19
0
15 Mar 2022
Vision-Based Manipulators Need to Also See from Their Hands
Vision-Based Manipulators Need to Also See from Their Hands
Kyle Hsu
Moo Jin Kim
Rafael Rafailov
Jiajun Wu
Chelsea Finn
37
45
0
15 Mar 2022
Blocks Assemble! Learning to Assemble with Large-Scale Structured
  Reinforcement Learning
Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning
Seyed Kamyar Seyed Ghasemipour
Daniel Freeman
Byron David
S. Gu
Satoshi Kataoka
Igor Mordatch
OffRL
32
25
0
15 Mar 2022
Masked Visual Pre-training for Motor Control
Masked Visual Pre-training for Motor Control
Tete Xiao
Ilija Radosavovic
Trevor Darrell
Jitendra Malik
SSL
34
242
0
11 Mar 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
41
226
0
09 Mar 2022
Investigation of Factorized Optical Flows as Mid-Level Representations
Investigation of Factorized Optical Flows as Mid-Level Representations
Hsuan-Kung Yang
Tsu-Ching Hsiao
Tingbo Liao
Hsu-Shen Liu
Li-Yuan Tsao
Tzu-Wen Wang
Shan Yang
Yu-Wen Chen
Huang-ru Liao
Chun-Yi Lee
35
3
0
09 Mar 2022
On-Robot Learning With Equivariant Models
On-Robot Learning With Equivariant Models
Dian Wang
Ming Jia
Xu Zhu
Robin Walters
Robert W. Platt
OffRL
SSL
33
36
0
09 Mar 2022
All You Need is LUV: Unsupervised Collection of Labeled Images using
  Invisible UV Fluorescent Indicators
All You Need is LUV: Unsupervised Collection of Labeled Images using Invisible UV Fluorescent Indicators
Brijen Thananjeyan
Justin Kerr
Huang Huang
Joseph E. Gonzalez
Ken Goldberg
34
9
0
09 Mar 2022
$\mathrm{SO}(2)$-Equivariant Reinforcement Learning
SO(2)\mathrm{SO}(2)SO(2)-Equivariant Reinforcement Learning
Dian Wang
Robin Walters
Robert W. Platt
30
80
0
08 Mar 2022
Kubric: A scalable dataset generator
Kubric: A scalable dataset generator
Klaus Greff
Francois Belletti
Lucas Beyer
Carl Doersch
Yilun Du
...
Ziyu Wang
Tianhao Wu
K. M. Yi
Fangcheng Zhong
Andrea Tagliasacchi
50
250
0
07 Mar 2022
Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement
  Learning
Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning
Hongpeng Cao
Mirco Theile
Federico G. Wyrwal
Marco Caccamo
43
6
0
04 Mar 2022
GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning
GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning
Tianhao Wu
Fangwei Zhong
Yiran Geng
Hongchen Wang
Yongjian Zhu
Yizhou Wang
Hao Dong
27
8
0
04 Mar 2022
Task-grasping from human demonstration
Task-grasping from human demonstration
Daichi Saito
Kazuhiro Sasabuchi
Naoki Wake
Jun Takamatsu
Hideki Koike
Katsushi Ikeuchi
36
8
0
01 Mar 2022
Learning Transferable Reward for Query Object Localization with Policy
  Adaptation
Learning Transferable Reward for Query Object Localization with Policy Adaptation
Tingfeng Li
Shaobo Han
Martin Renqiang Min
Dimitris N. Metaxas
35
1
0
24 Feb 2022
ReorientBot: Learning Object Reorientation for Specific-Posed Placement
ReorientBot: Learning Object Reorientation for Specific-Posed Placement
Kentaro Wada
Stephen James
Andrew J. Davison
34
29
0
22 Feb 2022
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for
  Visual Reinforcement Learning
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning
Zhecheng Yuan
Guozheng Ma
Yao Mu
Bo Xia
Bo Yuan
Xueqian Wang
Ping Luo
Huazhe Xu
33
29
0
21 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
35
65
0
13 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
36
29
0
10 Feb 2022
Malleable Agents for Re-Configurable Robotic Manipulators
Malleable Agents for Re-Configurable Robotic Manipulators
Athindran Ramesh Kumar
Gurudutt Hosangadi
32
0
0
04 Feb 2022
BC-Z: Zero-Shot Task Generalization with Robotic Imitation Learning
BC-Z: Zero-Shot Task Generalization with Robotic Imitation Learning
Eric Jang
A. Irpan
Mohi Khansari
Daniel Kappler
F. Ebert
Corey Lynch
Sergey Levine
Chelsea Finn
LM&Ro
72
520
0
04 Feb 2022
DexVIP: Learning Dexterous Grasping with Human Hand Pose Priors from
  Video
DexVIP: Learning Dexterous Grasping with Human Hand Pose Priors from Video
Priyanka Mandikal
Kristen Grauman
143
93
0
01 Feb 2022
Efficient Embedding of Semantic Similarity in Control Policies via
  Entangled Bisimulation
Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation
Martín Bertrán
Walter A. Talbott
Nitish Srivastava
J. Susskind
45
3
0
28 Jan 2022
Can Wikipedia Help Offline Reinforcement Learning?
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
Accelerating Representation Learning with View-Consistent Dynamics in
  Data-Efficient Reinforcement Learning
Accelerating Representation Learning with View-Consistent Dynamics in Data-Efficient Reinforcement Learning
Tao Huang
Jiacheng Wang
Xiao Chen
39
4
0
18 Jan 2022
ValueNetQP: Learned one-step optimal control for legged locomotion
ValueNetQP: Learned one-step optimal control for legged locomotion
Julian Viereck
Avadesh Meduri
Ludovic Righetti
19
8
0
11 Jan 2022
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee
Tianhao Wu
Yunchang Yang
Han Zhong
Liwei Wang
S. Du
Jiantao Jiao
55
14
0
21 Dec 2021
Towards Disturbance-Free Visual Mobile Manipulation
Towards Disturbance-Free Visual Mobile Manipulation
Tianwei Ni
Kiana Ehsani
Luca Weihs
Jordi Salvador
28
9
0
17 Dec 2021
Learning over All Stabilizing Nonlinear Controllers for a
  Partially-Observed Linear System
Learning over All Stabilizing Nonlinear Controllers for a Partially-Observed Linear System
Ruigang Wang
Nicholas H. Barbara
Max Revay
I. Manchester
19
16
0
08 Dec 2021
DemoGrasp: Few-Shot Learning for Robotic Grasping with Human
  Demonstration
DemoGrasp: Few-Shot Learning for Robotic Grasping with Human Demonstration
Pengyuan Wang
Fabian Manhardt
Luca Minciullo
Lorenzo Garattoni
Sven Meie
Nassir Navab
Benjamin Busam
37
34
0
06 Dec 2021
Tool as Embodiment for Recursive Manipulation
Tool as Embodiment for Recursive Manipulation
Yukiyasu Noguchi
T. Matsushima
Y. Matsuo
S. Gu
38
7
0
01 Dec 2021
Offline Reinforcement Learning: Fundamental Barriers for Value Function
  Approximation
Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation
Dylan J. Foster
A. Krishnamurthy
D. Simchi-Levi
Yunzong Xu
OffRL
21
62
0
21 Nov 2021
Previous
12345678
Next