ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.07292
  4. Cited By
Convergence of Value Aggregation for Imitation Learning

Convergence of Value Aggregation for Imitation Learning

22 January 2018
Ching-An Cheng
Byron Boots
ArXivPDFHTML

Papers citing "Convergence of Value Aggregation for Imitation Learning"

9 / 9 papers shown
Title
Introduction to Online Convex Optimization
Introduction to Online Convex Optimization
Elad Hazan
OffRL
71
1,919
0
07 Sep 2019
Agile Autonomous Driving using End-to-End Deep Imitation Learning
Agile Autonomous Driving using End-to-End Deep Imitation Learning
Yunpeng Pan
Ching-An Cheng
Kamil Saigol
Keuntaek Lee
Xinyan Yan
Evangelos Theodorou
Byron Boots
53
54
0
21 Sep 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential
  Prediction
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Wen Sun
Arun Venkatraman
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
95
235
0
03 Mar 2017
Comparing Human-Centric and Robot-Centric Sampling for Robot Deep
  Learning from Demonstrations
Comparing Human-Centric and Robot-Centric Sampling for Robot Deep Learning from Demonstrations
Michael Laskey
Caleb Chuck
Jonathan Lee
Jeffrey Mahler
S. Krishnan
Kevin Jamieson
Anca Dragan
Ken Goldberg
27
74
0
04 Oct 2016
Reinforcement and Imitation Learning via Interactive No-Regret Learning
Reinforcement and Imitation Learning via Interactive No-Regret Learning
Stéphane Ross
J. Andrew Bagnell
OffRL
63
262
0
23 Jun 2014
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
73
12,163
0
19 Dec 2013
Learning Monocular Reactive UAV Control in Cluttered Natural
  Environments
Learning Monocular Reactive UAV Control in Cluttered Natural Environments
Stéphane Ross
Narek Melik-Barkhudarov
Kumar Shaurya Shankar
Andreas Wendel
Debadeepta Dey
J. Andrew Bagnell
M. Hebert
57
437
0
07 Nov 2012
Online Learning with Predictable Sequences
Online Learning with Predictable Sequences
Alexander Rakhlin
Karthik Sridharan
89
355
0
18 Aug 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
119
3,196
0
02 Nov 2010
1