ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.05612
  4. Cited By
A Persistent Spatial Semantic Representation for High-level Natural
  Language Instruction Execution
v1v2v3 (latest)

A Persistent Spatial Semantic Representation for High-level Natural Language Instruction Execution

12 July 2021
Valts Blukis
Chris Paxton
Dieter Fox
Animesh Garg
Yoav Artzi
    LM&Ro
ArXiv (abs)PDFHTML

Papers citing "A Persistent Spatial Semantic Representation for High-level Natural Language Instruction Execution"

45 / 45 papers shown
Title
Evaluations at Work: Measuring the Capabilities of GenAI in Use
Evaluations at Work: Measuring the Capabilities of GenAI in Use
Brandon Lepine
Gawesha Weerantunga
Juho Kim
Pamela Mishkin
Matthew Beane
78
0
0
15 May 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
167
24
0
31 Dec 2024
ET tu, CLIP? Addressing Common Object Errors for Unseen Environments
ET tu, CLIP? Addressing Common Object Errors for Unseen Environments
Ye Won Byun
Cathy Jiao
Shahriar Noroozizadeh
Jimin Sun
Rosa Vitiello
VLM
120
1
0
25 Jun 2024
Embodied Instruction Following in Unknown Environments
Embodied Instruction Following in Unknown Environments
Zhenyu Wu
Ziwei Wang
Xiuwei Xu
Hang Yin
Yinan Liang
Angyuan Ma
Jiwen Lu
Haibin Yan
LM&Ro
106
4
0
17 Jun 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
335
54
0
23 May 2024
"Set It Up!": Functional Object Arrangement with Compositional Generative Models
"Set It Up!": Functional Object Arrangement with Compositional Generative Models
Yiqing Xu
Jiayuan Mao
Yilun Du
Tomás Lozano-Pérez
L. Kaelbling
David Hsu
LM&Ro
175
6
0
20 May 2024
ADaPT: As-Needed Decomposition and Planning with Language Models
ADaPT: As-Needed Decomposition and Planning with Language Models
Archiki Prasad
Alexander Koller
Mareike Hartmann
Peter Clark
Ashish Sabharwal
Mohit Bansal
Tushar Khot
LM&Ro
93
93
0
08 Nov 2023
Embodied Task Planning with Large Language Models
Embodied Task Planning with Large Language Models
Zhenyu Wu
Ziwei Wang
Xiuwei Xu
Jiwen Lu
Haibin Yan
LM&RoLLMAG
81
76
0
04 Jul 2023
Plan, Eliminate, and Track -- Language Models are Good Teachers for
  Embodied Agents
Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Yu-Chih Chen
So Yeon Min
Chase Davis
Ruslan Salakhutdinov
A. Azaria
Yuan-Fang Li
Tom Michael Mitchell
A. Bovik
LM&RoLLMAG
157
34
0
03 May 2023
Spatial-Language Attention Policies for Efficient Robot Learning
Spatial-Language Attention Policies for Efficient Robot Learning
Priyam Parashar
Vidhi Jain
Xiaohan Zhang
Jay Vakil
Sam Powers
Yonatan Bisk
Chris Paxton
LM&Ro
80
5
0
21 Apr 2023
Embodied Concept Learner: Self-supervised Learning of Concepts and
  Mapping through Instruction Following
Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following
Mingyu Ding
Yan Xu
Zhenfang Chen
David D. Cox
Ping Luo
J. Tenenbaum
Chuang Gan
LM&Ro
93
21
0
07 Apr 2023
Natural Language-conditioned Reinforcement Learning with Inside-out Task
  Language Development and Translation
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation
Jing-Cheng Pang
Xinyi Yang
Sibei Yang
Yang Yu
73
9
0
18 Feb 2023
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object
  Navigation
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
KAI-QING Zhou
Kai Zheng
Connor Pryor
Yilin Shen
Hongxia Jin
Lise Getoor
Xinze Wang
130
118
0
30 Jan 2023
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making
  using Language Guided World Modelling
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling
Kolby Nottingham
Prithviraj Ammanabrolu
Alane Suhr
Yejin Choi
Hannaneh Hajishirzi
Sameer Singh
Roy Fox
LLMAGLM&Ro
90
82
0
28 Jan 2023
Don't Generate, Discriminate: A Proposal for Grounding Language Models
  to Real-World Environments
Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
Yu Gu
Xiang Deng
Yu-Chuan Su
LLMAG
125
58
0
19 Dec 2022
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large
  Language Models
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
Chan Hee Song
Jiaman Wu
Clay Washington
Brian M Sadler
Wei-Lun Chao
Yu-Chuan Su
LLMAGLM&Ro
173
425
0
08 Dec 2022
Sequence-Based Plan Feasibility Prediction for Efficient Task and Motion
  Planning
Sequence-Based Plan Feasibility Prediction for Efficient Task and Motion Planning
Zhutian Yang
Caelan Reed Garrett
Tomás Lozano-Pérez
Leslie Kaelbling
Dieter Fox
122
27
0
03 Nov 2022
DANLI: Deliberative Agent for Following Natural Language Instructions
DANLI: Deliberative Agent for Following Natural Language Instructions
Yichi Zhang
Jianing Yang
Jiayi Pan
Shane Storks
N. Devraj
Ziqiao Ma
Keunwoo Peter Yu
Yuwei Bao
J. Chai
LM&Ro
143
16
0
22 Oct 2022
Retrospectives on the Embodied AI Workshop
Retrospectives on the Embodied AI Workshop
Matt Deitke
Dhruv Batra
Yonatan Bisk
Tommaso Campari
Angel X. Chang
...
Jesse Thomason
Alexander Toshev
Joanne Truong
Luca Weihs
Jiajun Wu
LM&Ro
122
51
0
13 Oct 2022
CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory
CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory
Nur Muhammad (Mahi) Shafiullah
Chris Paxton
Lerrel Pinto
Soumith Chintala
Arthur Szlam
VLMLM&RoCLIP
176
166
0
11 Oct 2022
See, Plan, Predict: Language-guided Cognitive Planning with Video
  Prediction
See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction
Maria Attarian
Advaya Gupta
Ziyi Zhou
Wei Yu
Igor Gilitschenski
Animesh Garg
LM&Ro
76
8
0
07 Oct 2022
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
304
501
0
12 Sep 2022
JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for
  Conversational Embodied Agents
JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents
Kai Zheng
KAI-QING Zhou
Jing Gu
Yue Fan
Jialu Wang
Zong-xiao Li
Xuehai He
Xinze Wang
LM&Ro
106
40
0
28 Aug 2022
TIDEE: Tidying Up Novel Rooms using Visuo-Semantic Commonsense Priors
TIDEE: Tidying Up Novel Rooms using Visuo-Semantic Commonsense Priors
Gabriel H. Sarch
Zhaoyuan Fang
Adam W. Harley
Paul Schydlo
Michael J. Tarr
Saurabh Gupta
Katerina Fragkiadaki
LM&Ro
91
45
0
21 Jul 2022
Reasoning about Actions over Visual and Linguistic Modalities: A Survey
Reasoning about Actions over Visual and Linguistic Modalities: A Survey
Shailaja Keyur Sampat
Maitreya Patel
Subhasish Das
Yezhou Yang
Chitta Baral
ReLMLM&RoLRM
85
12
0
15 Jul 2022
A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic
  Search
A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic Search
Brandon Trabucco
Gunnar Sigurdsson
Robinson Piramuthu
Gaurav Sukhatme
Ruslan Salakhutdinov
OCL
84
7
0
21 Jun 2022
Neuro-Symbolic Procedural Planning with Commonsense Prompting
Neuro-Symbolic Procedural Planning with Commonsense Prompting
Yujie Lu
Weixi Feng
Wanrong Zhu
Wenda Xu
Xinze Wang
Miguel P. Eckstein
William Yang Wang
LM&Ro
81
37
0
06 Jun 2022
Few-shot Subgoal Planning with Language Models
Few-shot Subgoal Planning with Language Models
Lajanugen Logeswaran
Yao Fu
Moontae Lee
Honglak Lee
LRM
76
26
0
28 May 2022
Voxel-informed Language Grounding
Voxel-informed Language Grounding
Rodolfo Corona
Shizhan Zhu
Dan Klein
Trevor Darrell
178
12
0
19 May 2022
On the Limits of Evaluating Embodied Agent Model Generalization Using
  Validation Sets
On the Limits of Evaluating Embodied Agent Model Generalization Using Validation Sets
Hyounghun Kim
Aishwarya Padmakumar
Di Jin
Joey Tianyi Zhou
Dilek Z. Hakkani-Tür
16
0
0
18 May 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
220
1,992
0
04 Apr 2022
Moment-based Adversarial Training for Embodied Language Comprehension
Moment-based Adversarial Training for Embodied Language Comprehension
Shintaro Ishikawa
K. Sugiura
LM&Ro
97
8
0
02 Apr 2022
LEBP -- Language Expectation & Binding Policy: A Two-Stream Framework
  for Embodied Vision-and-Language Interaction Task Learning Agents
LEBP -- Language Expectation & Binding Policy: A Two-Stream Framework for Embodied Vision-and-Language Interaction Task Learning Agents
Hao Liu
Yang Liu
Hong He
Hang Yang
LM&Ro
83
21
0
09 Mar 2022
DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following
DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following
Xiaofeng Gao
Qiaozi Gao
Ran Gong
Kaixiang Lin
Govind Thattai
Gaurav Sukhatme
LM&Ro
148
71
0
27 Feb 2022
One Step at a Time: Long-Horizon Vision-and-Language Navigation with
  Milestones
One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
Chan Hee Song
Jihyung Kil
Tai-Yu Pan
Brian M. Sadler
Wei-Lun Chao
Yu-Chuan Su
LRM
80
33
0
14 Feb 2022
A Dataset for Interactive Vision-Language Navigation with Unknown
  Command Feasibility
A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility
Andrea Burns
Deniz Arsan
Sanjna Agrawal
Ranjitha Kumar
Kate Saenko
Bryan A. Plummer
131
65
0
04 Feb 2022
Learning to Act with Affordance-Aware Multimodal Neural SLAM
Learning to Act with Affordance-Aware Multimodal Neural SLAM
Zhiwei Jia
Kaixiang Lin
Yizhou Zhao
Qiaozi Gao
Govind Thattai
Gaurav Sukhatme
LM&Ro
93
15
0
24 Jan 2022
LUMINOUS: Indoor Scene Generation for Embodied AI Challenges
LUMINOUS: Indoor Scene Generation for Embodied AI Challenges
Yizhou Zhao
Kaixiang Lin
Zhiwei Jia
Qiaozi Gao
Govind Thattai
Jesse Thomason
Gaurav Sukhatme
3DVLM&Ro
56
16
0
10 Nov 2021
FILM: Following Instructions in Language with Modular Methods
FILM: Following Instructions in Language with Modular Methods
So Yeon Min
Devendra Singh Chaplot
Pradeep Ravikumar
Yonatan Bisk
Ruslan Salakhutdinov
LM&Ro
279
163
0
12 Oct 2021
Are you doing what I say? On modalities alignment in ALFRED
Are you doing what I say? On modalities alignment in ALFRED
Ting-Rui Chiang
Yi-Ting Yeh
Ta-Chung Chi
Yau-Shian Wang
75
1
0
12 Oct 2021
Skill Induction and Planning with Latent Language
Skill Induction and Planning with Latent Language
Pratyusha Sharma
Antonio Torralba
Jacob Andreas
LM&Ro
265
112
0
04 Oct 2021
TEACh: Task-driven Embodied Agents that Chat
TEACh: Task-driven Embodied Agents that Chat
Aishwarya Padmakumar
Jesse Thomason
Ayush Shrivastava
P. Lange
Anjali Narayan-Chen
Spandana Gella
Robinson Piramithu
Gokhan Tur
Dilek Z. Hakkani-Tür
LM&Ro
260
188
0
01 Oct 2021
PlaTe: Visually-Grounded Planning with Transformers in Procedural Tasks
PlaTe: Visually-Grounded Planning with Transformers in Procedural Tasks
Jiankai Sun
De-An Huang
Bo Lu
Yunhui Liu
Bolei Zhou
Animesh Garg
76
56
0
10 Sep 2021
Embodied BERT: A Transformer Model for Embodied, Language-guided Visual
  Task Completion
Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion
Alessandro Suglia
Qiaozi Gao
Jesse Thomason
Govind Thattai
Gaurav Sukhatme
LM&Ro
130
78
0
10 Aug 2021
A modular vision language navigation and manipulation framework for long
  horizon compositional tasks in indoor environment
A modular vision language navigation and manipulation framework for long horizon compositional tasks in indoor environment
Homagni Saha
Fateme Fotouhif
Qisai Liu
Soumik Sarkar
LM&Ro
64
8
0
19 Jan 2021
1