ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.03825
  4. Cited By
See, Plan, Predict: Language-guided Cognitive Planning with Video
  Prediction

See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction

7 October 2022
Maria Attarian
Advaya Gupta
Ziyi Zhou
Wei Yu
Igor Gilitschenski
Animesh Garg
    LM&Ro
ArXivPDFHTML

Papers citing "See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction"

10 / 10 papers shown
Title
Symbolically-Guided Visual Plan Inference from Uncurated Video Data
Symbolically-Guided Visual Plan Inference from Uncurated Video Data
Wenyan Yang
Ahmet Tikna
Yi Zhao
Yuying Zhang
Luigi Palopoli
Marco Roveri
Joni Pajarinen
VGen
26
0
0
13 May 2025
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Vivek Myers
Bill Chunyuan Zheng
Anca Dragan
Kuan Fang
Sergey Levine
65
0
0
08 Feb 2025
Can Pre-Trained Text-to-Image Models Generate Visual Goals for
  Reinforcement Learning?
Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?
Jialu Gao
Kaizhe Hu
Guowei Xu
Huazhe Xu
LM&Ro
23
15
0
15 Jul 2023
Goal Representations for Instruction Following: A Semi-Supervised
  Language Interface to Control
Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Vivek Myers
Andre Wang He
Kuan Fang
Homer Walke
Philippe Hansen-Estruch
Ching-An Cheng
Mihai Jalobeanu
Andrey Kolobov
Anca Dragan
Sergey Levine
LM&Ro
27
29
0
30 Jun 2023
Toward Grounded Commonsense Reasoning
Toward Grounded Commonsense Reasoning
Minae Kwon
Hengyuan Hu
Vivek Myers
Siddharth Karamcheti
Anca Dragan
Dorsa Sadigh
LM&Ro
ReLM
LRM
42
9
0
14 Jun 2023
Procedure Planning in Instructional Videos via Contextual Modeling and
  Model-based Policy Learning
Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning
Jing Bi
Jiebo Luo
Chenliang Xu
76
48
0
05 Oct 2021
Grounding Predicates through Actions
Grounding Predicates through Actions
Toki Migimatsu
Jeannette Bohg
150
32
0
29 Sep 2021
A Persistent Spatial Semantic Representation for High-level Natural
  Language Instruction Execution
A Persistent Spatial Semantic Representation for High-level Natural Language Instruction Execution
Valts Blukis
Chris Paxton
Dieter Fox
Animesh Garg
Yoav Artzi
LM&Ro
214
135
0
12 Jul 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViT
VGen
245
484
0
20 Apr 2021
Learning by Watching: Physical Imitation of Manipulation Skills from
  Human Videos
Learning by Watching: Physical Imitation of Manipulation Skills from Human Videos
Haoyu Xiong
Quanzhou Li
Yun-Chun Chen
Homanga Bharadhwaj
Samarth Sinha
Animesh Garg
SSL
128
93
0
18 Jan 2021
1