Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.14513
Cited By
v1
v2 (latest)
ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards
24 January 2025
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards"
27 / 27 papers shown
Title
Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera
Yu Hu
Yuang Zhang
Yunlong Song
Yang Deng
Feng Yu
Linzuo Zhang
Weiyao Lin
Danping Zou
Wenxian Yu
107
5
0
07 Nov 2024
VisFly: An Efficient and Versatile Simulator for Training Vision-based Flight
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
106
3
0
20 Jul 2024
Back to Newton's Laws: Learning Vision-based Agile Flight via Differentiable Physics
Yuang Zhang
Yu Hu
Yunlong Song
Danping Zou
Weiyao Lin
98
20
0
15 Jul 2024
Learning Quadruped Locomotion Using Differentiable Simulation
Yunlong Song
Sangbae Kim
Davide Scaramuzza
110
16
0
21 Mar 2024
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
85
616
0
10 Jan 2023
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
69
26
0
27 Oct 2022
Training Efficient Controllers via Analytic Policy Gradient
Nina Wiedemann
Valentin Wüest
Antonio Loquercio
M. Müller
Dario Floreano
Davide Scaramuzza
OffRL
74
20
0
26 Sep 2022
Accelerated Policy Learning with Parallel Differentiable Simulation
Jie Xu
Viktor Makoviychuk
Yashraj S. Narang
Fabio Ramos
Wojciech Matusik
Animesh Garg
Miles Macklin
72
95
0
14 Apr 2022
Dojo: A Differentiable Physics Engine for Robotics
Taylor A. Howell
Simon Le Cleac'h
Jan Brüdigam
J. Zico Kolter
Mac Schwager
Zachary Manchester
85
36
0
02 Mar 2022
Do Differentiable Simulators Give Better Policy Gradients?
H.J. Terry Suh
Max Simchowitz
Kai Zhang
Russ Tedrake
70
101
0
02 Feb 2022
Learning High-Speed Flight in the Wild
Antonio Loquercio
Elia Kaufmann
René Ranftl
Matthias Muller
V. Koltun
Davide Scaramuzza
106
302
0
11 Oct 2021
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation
C. Freeman
Erik Frey
Anton Raichuk
Sertan Girgin
Igor Mordatch
Olivier Bachem
111
380
0
24 Jun 2021
NeuralSim: Augmenting Differentiable Simulators with Neural Networks
Eric Heiden
David Millard
Erwin Coumans
Yizhou Sheng
Gaurav Sukhatme
67
140
0
09 Nov 2020
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
550
10,591
0
17 Feb 2020
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
140
1,374
0
03 Dec 2019
DiffTaichi: Differentiable Programming for Physical Simulation
Yuanming Hu
Luke Anderson
Tzu-Mao Li
Qi Sun
N. Carr
Jonathan Ragan-Kelley
F. Durand
75
388
0
01 Oct 2019
Deep Drone Racing: From Simulation to Reality with Domain Randomization
Antonio Loquercio
Elia Kaufmann
René Ranftl
Alexey Dosovitskiy
V. Koltun
Davide Scaramuzza
75
212
0
21 May 2019
Habitat: A Platform for Embodied AI Research
Manolis Savva
Abhishek Kadian
Oleksandr Maksymets
Yili Zhao
Erik Wijmans
...
Jia-Wei Liu
V. Koltun
Jitendra Malik
Devi Parikh
Dhruv Batra
LM&Ro
129
1,423
0
02 Apr 2019
Deep Drone Racing: Learning Agile Flight in Dynamic Environments
Elia Kaufmann
Antonio Loquercio
René Ranftl
Alexey Dosovitskiy
V. Koltun
Davide Scaramuzza
61
135
0
22 Jun 2018
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
BDL
230
1,284
0
30 May 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
198
5,226
0
26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,432
0
04 Jan 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
583
19,315
0
20 Jul 2017
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
332
13,295
0
09 Sep 2015
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
Manuel Watter
Jost Tobias Springenberg
Joschka Boedecker
Martin Riedmiller
BDL
103
847
0
24 Jun 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
283
6,807
0
19 Feb 2015
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
132
12,272
0
19 Dec 2013
1