ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.14513
  4. Cited By
ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards
v1v2 (latest)

ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards

24 January 2025
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
ArXiv (abs)PDFHTML

Papers citing "ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards"

27 / 27 papers shown
Title
Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera
Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera
Yu Hu
Yuang Zhang
Yunlong Song
Yang Deng
Feng Yu
Linzuo Zhang
Weiyao Lin
Danping Zou
Wenxian Yu
107
5
0
07 Nov 2024
VisFly: An Efficient and Versatile Simulator for Training Vision-based
  Flight
VisFly: An Efficient and Versatile Simulator for Training Vision-based Flight
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
106
3
0
20 Jul 2024
Back to Newton's Laws: Learning Vision-based Agile Flight via
  Differentiable Physics
Back to Newton's Laws: Learning Vision-based Agile Flight via Differentiable Physics
Yuang Zhang
Yu Hu
Yunlong Song
Danping Zou
Weiyao Lin
98
20
0
15 Jul 2024
Learning Quadruped Locomotion Using Differentiable Simulation
Learning Quadruped Locomotion Using Differentiable Simulation
Yunlong Song
Sangbae Kim
Davide Scaramuzza
110
16
0
21 Mar 2024
Mastering Diverse Domains through World Models
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
85
616
0
10 Jan 2023
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via
  Differentiable Physics-Based Simulation and Rendering
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
69
26
0
27 Oct 2022
Training Efficient Controllers via Analytic Policy Gradient
Training Efficient Controllers via Analytic Policy Gradient
Nina Wiedemann
Valentin Wüest
Antonio Loquercio
M. Müller
Dario Floreano
Davide Scaramuzza
OffRL
74
20
0
26 Sep 2022
Accelerated Policy Learning with Parallel Differentiable Simulation
Accelerated Policy Learning with Parallel Differentiable Simulation
Jie Xu
Viktor Makoviychuk
Yashraj S. Narang
Fabio Ramos
Wojciech Matusik
Animesh Garg
Miles Macklin
72
95
0
14 Apr 2022
Dojo: A Differentiable Physics Engine for Robotics
Dojo: A Differentiable Physics Engine for Robotics
Taylor A. Howell
Simon Le Cleac'h
Jan Brüdigam
J. Zico Kolter
Mac Schwager
Zachary Manchester
85
36
0
02 Mar 2022
Do Differentiable Simulators Give Better Policy Gradients?
Do Differentiable Simulators Give Better Policy Gradients?
H.J. Terry Suh
Max Simchowitz
Kai Zhang
Russ Tedrake
70
101
0
02 Feb 2022
Learning High-Speed Flight in the Wild
Learning High-Speed Flight in the Wild
Antonio Loquercio
Elia Kaufmann
René Ranftl
Matthias Muller
V. Koltun
Davide Scaramuzza
106
302
0
11 Oct 2021
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body
  Simulation
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation
C. Freeman
Erik Frey
Anton Raichuk
Sertan Girgin
Igor Mordatch
Olivier Bachem
111
380
0
24 Jun 2021
NeuralSim: Augmenting Differentiable Simulators with Neural Networks
NeuralSim: Augmenting Differentiable Simulators with Neural Networks
Eric Heiden
David Millard
Erwin Coumans
Yizhou Sheng
Gaurav Sukhatme
67
140
0
09 Nov 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
550
10,591
0
17 Feb 2020
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
140
1,374
0
03 Dec 2019
DiffTaichi: Differentiable Programming for Physical Simulation
DiffTaichi: Differentiable Programming for Physical Simulation
Yuanming Hu
Luke Anderson
Tzu-Mao Li
Qi Sun
N. Carr
Jonathan Ragan-Kelley
F. Durand
75
388
0
01 Oct 2019
Deep Drone Racing: From Simulation to Reality with Domain Randomization
Deep Drone Racing: From Simulation to Reality with Domain Randomization
Antonio Loquercio
Elia Kaufmann
René Ranftl
Alexey Dosovitskiy
V. Koltun
Davide Scaramuzza
75
212
0
21 May 2019
Habitat: A Platform for Embodied AI Research
Habitat: A Platform for Embodied AI Research
Manolis Savva
Abhishek Kadian
Oleksandr Maksymets
Yili Zhao
Erik Wijmans
...
Jia-Wei Liu
V. Koltun
Jitendra Malik
Devi Parikh
Dhruv Batra
LM&Ro
129
1,423
0
02 Apr 2019
Deep Drone Racing: Learning Agile Flight in Dynamic Environments
Deep Drone Racing: Learning Agile Flight in Dynamic Environments
Elia Kaufmann
Antonio Loquercio
René Ranftl
Alexey Dosovitskiy
V. Koltun
Davide Scaramuzza
61
135
0
22 Jun 2018
Deep Reinforcement Learning in a Handful of Trials using Probabilistic
  Dynamics Models
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
BDL
230
1,284
0
30 May 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
198
5,226
0
26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,432
0
04 Jan 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
583
19,315
0
20 Jul 2017
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
332
13,295
0
09 Sep 2015
Embed to Control: A Locally Linear Latent Dynamics Model for Control
  from Raw Images
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
Manuel Watter
Jost Tobias Springenberg
Joschka Boedecker
Martin Riedmiller
BDL
103
847
0
24 Jun 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
283
6,807
0
19 Feb 2015
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
132
12,272
0
19 Dec 2013
1