ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.03369
  4. Cited By
Learning from Active Human Involvement through Proxy Value Propagation

Learning from Active Human Involvement through Proxy Value Propagation

5 February 2025
Zhenghao Peng
Wenjie Mo
Chenda Duan
Quanyi Li
Bolei Zhou
ArXivPDFHTML

Papers citing "Learning from Active Human Involvement through Proxy Value Propagation"

13 / 13 papers shown
Title
HACTS: a Human-As-Copilot Teleoperation System for Robot Learning
HACTS: a Human-As-Copilot Teleoperation System for Robot Learning
Z. Xu
Yinuo Zhao
Kun Wu
Ning Liu
Junjie Ji
Zhengping Che
C. Liu
Jian Tang
47
0
0
31 Mar 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
45
0
0
06 Mar 2025
Reinforcement Learning From Imperfect Corrective Actions And Proxy
  Rewards
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards
Zhaohui Jiang
Xuening Feng
Paul Weng
Yifei Zhu
Yan Song
Tianze Zhou
Yujing Hu
Tangjie Lv
Changjie Fan
41
0
0
08 Oct 2024
Shared Autonomy with IDA: Interventional Diffusion Assistance
Shared Autonomy with IDA: Interventional Diffusion Assistance
Brandon J. McMahan
Zhenghao Peng
Bolei Zhou
Jonathan C. Kao
20
1
0
05 Sep 2024
Trustworthy Human-AI Collaboration: Reinforcement Learning with Human
  Feedback and Physics Knowledge for Safe Autonomous Driving
Trustworthy Human-AI Collaboration: Reinforcement Learning with Human Feedback and Physics Knowledge for Safe Autonomous Driving
Zilin Huang
Zihao Sheng
Sikai Chen
31
4
0
01 Sep 2024
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from
  Intervention
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention
Yuxin Chen
Chen Tang
Chenran Li
Ran Tian
Peter Stone
M. Tomizuka
Wei Zhan
23
1
0
24 Jun 2024
TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction
TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction
Yunfan Jiang
Chen Wang
Ruohan Zhang
Jiajun Wu
Fei-Fei Li
OnRL
37
26
0
16 May 2024
DexCap: Scalable and Portable Mocap Data Collection System for Dexterous
  Manipulation
DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation
Chen Wang
Haochen Shi
Weizhuo Wang
Ruohan Zhang
Fei-Fei Li
Karen Liu
52
103
0
12 Mar 2024
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented
  In-Context Learning in Multi-Modal Large Language Model
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model
Jianhao Yuan
Shuyang Sun
Daniel Omeiza
Bo-Lu Zhao
Paul Newman
Lars Kunze
Matthew Gadd
LRM
36
48
0
16 Feb 2024
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and
  Efficient Autonomous Driving
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving
Zilin Huang
Zihao Sheng
Chengyuan Ma
Sikai Chen
22
27
0
06 Jan 2024
Efficient Learning of Safe Driving Policy via Human-AI Copilot
  Optimization
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
Quanyi Li
Zhenghao Peng
Bolei Zhou
77
35
0
17 Feb 2022
DeepTake: Prediction of Driver Takeover Behavior using Multimodal Data
DeepTake: Prediction of Driver Takeover Behavior using Multimodal Data
Erfan Pakdamanian
Shili Sheng
Sonia Baee
Seongkook Heo
Sarit Kraus
Lu Feng
60
71
0
31 Dec 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,955
0
04 May 2020
1