Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.09286
Cited By
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos
11 October 2024
Harsh Mahesheka
Zhixian Xie
Ziyi Wang
Wanxin Jin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos"
20 / 20 papers shown
Title
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
Runyu Ma
Jelle Luijkx
Zlatan Ajanović
Jens Kober
LM&Ro
LRM
75
9
0
14 Mar 2024
"Task Success" is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors
L. Guan
Yifan Zhou
Denis Liu
Yantian Zha
H. B. Amor
Subbarao Kambhampati
LM&Ro
54
16
0
06 Feb 2024
Learning Reward for Physical Skills using Large Language Model
Yuwei Zeng
Yiqing Xu
46
6
0
21 Oct 2023
Language to Rewards for Robotic Skill Synthesis
Wenhao Yu
Nimrod Gileadi
Chuyuan Fu
Sean Kirmani
Kuang-Huei Lee
...
N. Heess
Dorsa Sadigh
Jie Tan
Yuval Tassa
F. Xia
LM&Ro
81
279
0
14 Jun 2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Cheng Chi
Zhenjia Xu
S. Feng
Eric A. Cousineau
Yilun Du
Benjamin Burchfiel
Russ Tedrake
Shuran Song
339
1,175
0
07 Mar 2023
Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents
Wenlong Huang
Fei Xia
Dhruv Shah
Danny Driess
Andy Zeng
...
Pete Florence
Igor Mordatch
Sergey Levine
Karol Hausman
Brian Ichter
LM&Ro
70
48
0
01 Mar 2023
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh
Valts Blukis
Arsalan Mousavian
Ankit Goyal
Danfei Xu
Jonathan Tremblay
Dieter Fox
Jesse Thomason
Animesh Garg
LM&Ro
LLMAG
158
654
0
22 Sep 2022
Human-to-Robot Imitation in the Wild
Shikhar Bahl
Abhi Gupta
Deepak Pathak
81
169
0
19 Jul 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
179
1,951
0
04 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLM
LRM
142
582
0
01 Apr 2022
Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors
Steven Bohez
S. Tunyasuvunakool
Philemon Brakel
Fereshteh Sadeghi
Leonard Hasenclever
...
Nathan Batchelor
Federico Casarini
J. Merel
R. Hadsell
N. Heess
77
51
0
31 Mar 2022
Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients
Oliver Scheel
Luca Bergamini
Maciej Wołczyk
Bla.zej Osiñski
Peter Ondruska
63
109
0
27 Sep 2021
CLIPort: What and Where Pathways for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
106
650
0
24 Sep 2021
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning
Viktor Makoviychuk
Lukasz Wawrzyniak
Yunrong Guo
Michelle Lu
Kier Storey
...
David Hoeller
Nikita Rudin
Arthur Allshire
Ankur Handa
Gavriel State
165
1,072
0
24 Aug 2021
f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Tianwei Ni
Harshit S. Sikchi
Yufei Wang
Tejus Gupta
Lisa Lee
Benjamin Eysenbach
75
73
0
09 Nov 2020
Model-Based Inverse Reinforcement Learning from Visual Demonstrations
Neha Das
Sarah Bechtle
Todor Davchev
Dinesh Jayaraman
Akshara Rai
Franziska Meier
166
84
0
18 Oct 2020
Learning Agile Robotic Locomotion Skills by Imitating Animals
Xue Bin Peng
Erwin Coumans
Tingnan Zhang
T. Lee
Jie Tan
Sergey Levine
115
504
0
02 Apr 2020
Markerless tracking of user-defined features with deep learning
Alexander Mathis
Pranav Mamidanna
Taiga Abe
Kevin M. Cury
V. Murthy
Mackenzie W. Mathis
Matthias Bethge
42
3,352
0
09 Apr 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
478
19,019
0
20 Jul 2017
Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
YuXuan Liu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
106
380
0
11 Jul 2017
1