ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.12098
  4. Cited By
CLIPort: What and Where Pathways for Robotic Manipulation

CLIPort: What and Where Pathways for Robotic Manipulation

24 September 2021
Mohit Shridhar
Lucas Manuelli
D. Fox
    LM&Ro
ArXivPDFHTML

Papers citing "CLIPort: What and Where Pathways for Robotic Manipulation"

50 / 477 papers shown
Title
ORACLE-Grasp: Zero-Shot Task-Oriented Robotic Grasping using Large Multimodal Models
ORACLE-Grasp: Zero-Shot Task-Oriented Robotic Grasping using Large Multimodal Models
Avihai Giuili
Rotem Atari
A. Sintov
VLM
27
0
0
13 May 2025
Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness
Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness
Reihaneh Mirjalili
Tobias Jülg
Florian Walter
Wolfram Burgard
27
0
0
13 May 2025
3D CAVLA: Leveraging Depth and 3D Context to Generalize Vision Language Action Models for Unseen Tasks
3D CAVLA: Leveraging Depth and 3D Context to Generalize Vision Language Action Models for Unseen Tasks
V. Bhat
Yu-Hsiang Lan
P. Krishnamurthy
Ramesh Karri
Farshad Khorrami
52
0
0
09 May 2025
Vision-Language-Action Models: Concepts, Progress, Applications and Challenges
Vision-Language-Action Models: Concepts, Progress, Applications and Challenges
Ranjan Sapkota
Yang Cao
Konstantinos I Roumeliotis
Manoj Karkee
LM&Ro
157
1
0
07 May 2025
Latent Adaptive Planner for Dynamic Manipulation
Latent Adaptive Planner for Dynamic Manipulation
Donghun Noh
Deqian Kong
Minglu Zhao
Andrew Lizarraga
Jianwen Xie
Ying Nian Wu
Dennis W. Hong
125
0
0
06 May 2025
CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation
CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation
Xiaoqi Li
Lingyun Xu
M. Zhang
Jiaming Liu
Yan Shen
...
Jiahui Xu
Liang Heng
Siyuan Huang
S. Zhang
Hao Dong
LM&Ro
51
0
0
04 May 2025
DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation
DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation
Zixuan Chen
Junhui Yin
Yangtao Chen
Jing Huo
Pinzhuo Tian
Jieqi Shi
Yiwen Hou
Y. Li
Yang Gao
33
0
0
01 May 2025
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
Haifeng Huang
Xinyi Chen
Y. Chen
H. Li
Xiaoshen Han
Z. Wang
Tai Wang
Jiangmiao Pang
Zhou Zhao
LM&Ro
80
0
0
30 Apr 2025
A Survey of Interactive Generative Video
A Survey of Interactive Generative Video
Jiwen Yu
Yiran Qin
Haoxuan Che
Quande Liu
X. Wang
Pengfei Wan
Di Zhang
Kun Gai
Hao Chen
Xihui Liu
VGen
65
0
0
30 Apr 2025
RL-Driven Data Generation for Robust Vision-Based Dexterous Grasping
RL-Driven Data Generation for Robust Vision-Based Dexterous Grasping
Atsushi Kanehira
Naoki Wake
Kazuhiro Sasabuchi
Jun Takamatsu
Katsushi Ikeuchi
42
0
0
25 Apr 2025
Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-Models
Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-Models
Chen Wang
Fei Xia
Wenhao Yu
Tingnan Zhang
Ruohan Zhang
Ce Liu
Li Fei-Fei
Jie Tan
Jacky Liang
33
0
0
17 Apr 2025
KeyMPs: One-Shot Vision-Language Guided Motion Generation by Sequencing DMPs for Occlusion-Rich Tasks
KeyMPs: One-Shot Vision-Language Guided Motion Generation by Sequencing DMPs for Occlusion-Rich Tasks
Edgar Anarossi
Yuhwan Kwon
Hirotaka Tahara
Shohei Tanaka
Keisuke Shirai
Masashi Hamaya
C. C. Beltran-Hernandez
Atsushi Hashimoto
Takamitsu Matsubara
32
0
0
14 Apr 2025
LangPert: Detecting and Handling Task-level Perturbations for Robust Object Rearrangement
LangPert: Detecting and Handling Task-level Perturbations for Robust Object Rearrangement
Xu Yin
Min-Sung Yoon
Yuchi Huo
Kang Zhang
Sung-eui Yoon
26
0
0
14 Apr 2025
Slot-Level Robotic Placement via Visual Imitation from Single Human Video
Slot-Level Robotic Placement via Visual Imitation from Single Human Video
Dandan Shan
Kaichun Mo
Wei Yang
Yu-Wei Chao
David Fouhey
Dieter Fox
Arsalan Mousavian
36
0
0
02 Apr 2025
Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop Feedback
Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop Feedback
Y. Meng
Xiangtong Yao
Haihui Ye
Yirui Zhou
Shengqiang Zhang
Zhenshan Bing
Alois C. Knoll
LM&Ro
VLM
55
0
0
27 Mar 2025
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
Qingqing Zhao
Yao Lu
Moo Jin Kim
Zipeng Fu
Zhuoyang Zhang
...
Ankur Handa
Ming-Yu Liu
Donglai Xiang
Gordon Wetzstein
Tsung-Yi Lin
LM&Ro
LRM
43
11
0
27 Mar 2025
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
Junfeng Ni
Yu Liu
Ruijie Lu
Zirui Zhou
Song-Chun Zhu
Yixin Chen
Siyuan Huang
DiffM
67
4
0
19 Mar 2025
Adversarial Data Collection: Human-Collaborative Perturbations for Efficient and Robust Robotic Imitation Learning
Siyuan Huang
Yue Liao
Siyuan Feng
Shu Jiang
Si Liu
Hongsheng Li
Maoqing Yao
Guanghui Ren
AAML
58
1
0
14 Mar 2025
Towards Fast, Memory-based and Data-Efficient Vision-Language Policy
Haoxuan Li
Sixu Yan
Y. Li
Xinggang Wang
LM&Ro
64
0
0
13 Mar 2025
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation
Zixian Liu
Mingtong Zhang
Yunzhu Li
54
0
0
13 Mar 2025
Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter
Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter
Kechun Xu
Xunlong Xia
Kaixuan Wang
Yifei Yang
Yunxuan Mao
Bing Deng
R. Xiong
Y. Wang
OffRL
69
0
0
12 Mar 2025
SE(3)-Equivariant Robot Learning and Control: A Tutorial Survey
SE(3)-Equivariant Robot Learning and Control: A Tutorial Survey
Joohwan Seo
Soochul Yoo
Junwoo Chang
Hyunseok An
Hyunwoo Ryu
Soomi Lee
Arvind Kruthiventy
Jongeun Choi
R. Horowitz
71
2
0
12 Mar 2025
MetaFold: Language-Guided Multi-Category Garment Folding Framework via Trajectory Generation and Foundation Model
Haonan Chen
Junxiao Li
Ruihai Wu
Yiwei Liu
Yiwen Hou
...
Chongkai Gao
Zhenyu Wei
Shensi Xu
Jiaqi Huang
Lin Shao
AI4CE
49
1
0
11 Mar 2025
iManip: Skill-Incremental Learning for Robotic Manipulation
Zexin Zheng
Jia-Feng Cai
Xiao-Ming Wu
Yi-Lin Wei
Yu-Ming Tang
Wei-Shi Zheng
CLL
54
0
0
10 Mar 2025
Look Before You Leap: Using Serialized State Machine for Language Conditioned Robotic Manipulation
Tong Mu
Yihao Liu
Mehran Armand
68
0
0
07 Mar 2025
SRSA: Skill Retrieval and Adaptation for Robotic Assembly Tasks
Yijie Guo
Bingjie Tang
Iretiayo Akinola
Dieter Fox
Abhishek Gupta
Yashraj S. Narang
44
0
0
06 Mar 2025
Generative Artificial Intelligence in Robotic Manipulation: A Survey
Kun Zhang
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
...
Qifeng Chen
Jia Pan
Wei K. Zhang
Bo Yang
Hua Chen
59
1
0
05 Mar 2025
RoboDexVLM: Visual Language Model-Enabled Task Planning and Motion Control for Dexterous Robot Manipulation
Haichao Liu
Sikai Guo
Pengfei Mai
Jiahang Cao
Haoang Li
Jun Ma
39
1
0
03 Mar 2025
Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation
Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation
Siddhant Haldar
Lerrel Pinto
3DPC
66
2
0
27 Feb 2025
Tidiness Score-Guided Monte Carlo Tree Search for Visual Tabletop Rearrangement
Tidiness Score-Guided Monte Carlo Tree Search for Visual Tabletop Rearrangement
Hogun Kee
Wooseok Oh
Minjae Kang
Hyemin Ahn
Songhwai Oh
62
0
0
24 Feb 2025
X-IL: Exploring the Design Space of Imitation Learning Policies
X-IL: Exploring the Design Space of Imitation Learning Policies
Xiaogang Jia
Atalay Donat
Xi Huang
Xuan Zhao
Denis Blessing
...
Han A. Wang
Hanyi Zhang
Qian Wang
Rudolf Lioutikov
Gerhard Neumann
88
1
0
20 Feb 2025
Object-Centric Image to Video Generation with Language Guidance
Object-Centric Image to Video Generation with Language Guidance
Angel Villar-Corrales
Gjergj Plepi
Sven Behnke
DiffM
VGen
OCL
76
0
0
17 Feb 2025
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Egor Cherepanov
Nikita Kachaev
A. Kovalev
Aleksandr I. Panov
OffRL
41
0
0
14 Feb 2025
A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards
A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards
Shivansh Patel
Xinchen Yin
Wenlong Huang
Shubham Garg
H. Nayyeri
Li Fei-Fei
Svetlana Lazebnik
Y. Li
92
0
0
12 Feb 2025
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Vivek Myers
Bill Chunyuan Zheng
Anca Dragan
Kuan Fang
Sergey Levine
65
0
0
08 Feb 2025
Compositional Instruction Following with Language Models and Reinforcement Learning
Compositional Instruction Following with Language Models and Reinforcement Learning
Vanya Cohen
Geraud Nangue Tasse
N. Gopalan
Steven D. James
Matthew C. Gombolay
Ray Mooney
Benjamin Rosman
71
0
0
21 Jan 2025
Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing
Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing
Muhamamd Haris Khan
Selamawit Asfaw
Dmitrii Iarchuk
Miguel Altamirano Cabrera
Luis Moreno
Issatay Tokmurziyev
Dzmitry Tsetserukou
47
2
0
12 Jan 2025
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
Yishu Li
Wen Hui Leng
Yiming Fang
Ben Eisner
David Held
AI4CE
42
1
0
31 Dec 2024
TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable
  Industrial Robotics Through Large/Vision Language Models
TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models
Ammar N. Abbas
Csaba Beleznai
LM&Ro
78
2
0
19 Dec 2024
SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from
  Sparse Multi-View RGB Images
SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images
Junqiu Yu
Xinlin Ren
Yongchong Gu
Haitao Lin
Tianyu Wang
Y. X. Zhu
Hang Xu
Yu-Gang Jiang
Xiangyang Xue
Yanwei Fu
3DGS
81
0
0
03 Dec 2024
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for
  Robust 3D Robotic Manipulation
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Yueru Jia
Jiaming Liu
Sixiang Chen
Chenyang Gu
Z. Wang
...
Lily Lee
Pengwei Wang
Zhongyuan Wang
Renrui Zhang
Shanghang Zhang
89
11
0
27 Nov 2024
Learning for Long-Horizon Planning via Neuro-Symbolic Abductive
  Imitation
Learning for Long-Horizon Planning via Neuro-Symbolic Abductive Imitation
Jie-Jing Shao
Hao-Ran Hao
Xiao-Wen Yang
Yu-Feng Li
74
2
0
27 Nov 2024
Rethinking the Intermediate Features in Adversarial Attacks: Misleading
  Robotic Models via Adversarial Distillation
Rethinking the Intermediate Features in Adversarial Attacks: Misleading Robotic Models via Adversarial Distillation
Ke Zhao
Huayang Huang
Miao Li
Yu Wu
AAML
71
0
0
21 Nov 2024
Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Jiange Yang
Haoyi Zhu
Y. Wang
Gangshan Wu
Tong He
Limin Wang
100
2
0
21 Nov 2024
TAPT: Test-Time Adversarial Prompt Tuning for Robust Inference in
  Vision-Language Models
TAPT: Test-Time Adversarial Prompt Tuning for Robust Inference in Vision-Language Models
Xin Wang
Kai-xiang Chen
Jiaming Zhang
Jingjing Chen
Xingjun Ma
AAML
VPVLM
VLM
83
1
0
20 Nov 2024
Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms
Haizhou Ge
Ruixiang Wang
Zhu-ang Xu
Hongrui Zhu
Ruichen Deng
Yuhang Dong
Zeyu Pang
Guyue Zhou
Junyu Zhang
Lu Shi
78
1
0
18 Nov 2024
Learning Generalizable 3D Manipulation With 10 Demonstrations
Learning Generalizable 3D Manipulation With 10 Demonstrations
Yu Ren
Yang Cong
Ronghan Chen
Jiahao Long
SSL
56
1
0
15 Nov 2024
ClevrSkills: Compositional Language and Visual Reasoning in Robotics
ClevrSkills: Compositional Language and Visual Reasoning in Robotics
Sanjay Haresh
Daniel Dijkman
Apratim Bhattacharyya
Roland Memisevic
CoGe
LRM
39
1
0
13 Nov 2024
RT-Grasp: Reasoning Tuning Robotic Grasping via Multi-modal Large
  Language Model
RT-Grasp: Reasoning Tuning Robotic Grasping via Multi-modal Large Language Model
Jinxuan Xu
Shiyu Jin
Yutian Lei
Yuqian Zhang
Liangjun Zhang
LRM
25
7
0
07 Nov 2024
Vocal Sandbox: Continual Learning and Adaptation for Situated
  Human-Robot Collaboration
Vocal Sandbox: Continual Learning and Adaptation for Situated Human-Robot Collaboration
J. Grannen
Siddharth Karamcheti
Suvir Mirchandani
Percy Liang
Dorsa Sadigh
39
0
0
04 Nov 2024
1234...8910
Next