ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.09286
  4. Cited By
Language-Model-Assisted Bi-Level Programming for Reward Learning from
  Internet Videos

Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos

11 October 2024
Harsh Mahesheka
Zhixian Xie
Ziyi Wang
Wanxin Jin
ArXivPDFHTML

Papers citing "Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos"

20 / 20 papers shown
Title
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
Runyu Ma
Jelle Luijkx
Zlatan Ajanović
Jens Kober
LM&Ro
LRM
75
9
0
14 Mar 2024
"Task Success" is not Enough: Investigating the Use of Video-Language
  Models as Behavior Critics for Catching Undesirable Agent Behaviors
"Task Success" is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors
L. Guan
Yifan Zhou
Denis Liu
Yantian Zha
H. B. Amor
Subbarao Kambhampati
LM&Ro
54
16
0
06 Feb 2024
Learning Reward for Physical Skills using Large Language Model
Learning Reward for Physical Skills using Large Language Model
Yuwei Zeng
Yiqing Xu
46
6
0
21 Oct 2023
Language to Rewards for Robotic Skill Synthesis
Language to Rewards for Robotic Skill Synthesis
Wenhao Yu
Nimrod Gileadi
Chuyuan Fu
Sean Kirmani
Kuang-Huei Lee
...
N. Heess
Dorsa Sadigh
Jie Tan
Yuval Tassa
F. Xia
LM&Ro
81
279
0
14 Jun 2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Cheng Chi
Zhenjia Xu
S. Feng
Eric A. Cousineau
Yilun Du
Benjamin Burchfiel
Russ Tedrake
Shuran Song
339
1,175
0
07 Mar 2023
Grounded Decoding: Guiding Text Generation with Grounded Models for
  Embodied Agents
Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents
Wenlong Huang
Fei Xia
Dhruv Shah
Danny Driess
Andy Zeng
...
Pete Florence
Igor Mordatch
Sergey Levine
Karol Hausman
Brian Ichter
LM&Ro
70
48
0
01 Mar 2023
ProgPrompt: Generating Situated Robot Task Plans using Large Language
  Models
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh
Valts Blukis
Arsalan Mousavian
Ankit Goyal
Danfei Xu
Jonathan Tremblay
Dieter Fox
Jesse Thomason
Animesh Garg
LM&Ro
LLMAG
158
654
0
22 Sep 2022
Human-to-Robot Imitation in the Wild
Human-to-Robot Imitation in the Wild
Shikhar Bahl
Abhi Gupta
Deepak Pathak
81
169
0
19 Jul 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
179
1,951
0
04 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLM
LRM
142
582
0
01 Apr 2022
Imitate and Repurpose: Learning Reusable Robot Movement Skills From
  Human and Animal Behaviors
Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors
Steven Bohez
S. Tunyasuvunakool
Philemon Brakel
Fereshteh Sadeghi
Leonard Hasenclever
...
Nathan Batchelor
Federico Casarini
J. Merel
R. Hadsell
N. Heess
77
51
0
31 Mar 2022
Urban Driver: Learning to Drive from Real-world Demonstrations Using
  Policy Gradients
Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients
Oliver Scheel
Luca Bergamini
Maciej Wołczyk
Bla.zej Osiñski
Peter Ondruska
63
109
0
27 Sep 2021
CLIPort: What and Where Pathways for Robotic Manipulation
CLIPort: What and Where Pathways for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
106
650
0
24 Sep 2021
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot
  Learning
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning
Viktor Makoviychuk
Lukasz Wawrzyniak
Yunrong Guo
Michelle Lu
Kier Storey
...
David Hoeller
Nikita Rudin
Arthur Allshire
Ankur Handa
Gavriel State
165
1,072
0
24 Aug 2021
f-IRL: Inverse Reinforcement Learning via State Marginal Matching
f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Tianwei Ni
Harshit S. Sikchi
Yufei Wang
Tejus Gupta
Lisa Lee
Benjamin Eysenbach
75
73
0
09 Nov 2020
Model-Based Inverse Reinforcement Learning from Visual Demonstrations
Model-Based Inverse Reinforcement Learning from Visual Demonstrations
Neha Das
Sarah Bechtle
Todor Davchev
Dinesh Jayaraman
Akshara Rai
Franziska Meier
166
84
0
18 Oct 2020
Learning Agile Robotic Locomotion Skills by Imitating Animals
Learning Agile Robotic Locomotion Skills by Imitating Animals
Xue Bin Peng
Erwin Coumans
Tingnan Zhang
T. Lee
Jie Tan
Sergey Levine
115
504
0
02 Apr 2020
Markerless tracking of user-defined features with deep learning
Markerless tracking of user-defined features with deep learning
Alexander Mathis
Pranav Mamidanna
Taiga Abe
Kevin M. Cury
V. Murthy
Mackenzie W. Mathis
Matthias Bethge
42
3,352
0
09 Apr 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
478
19,019
0
20 Jul 2017
Imitation from Observation: Learning to Imitate Behaviors from Raw Video
  via Context Translation
Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
YuXuan Liu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
106
380
0
11 Jul 2017
1