ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.12098
  4. Cited By
CLIPort: What and Where Pathways for Robotic Manipulation

CLIPort: What and Where Pathways for Robotic Manipulation

24 September 2021
Mohit Shridhar
Lucas Manuelli
D. Fox
    LM&Ro
ArXivPDFHTML

Papers citing "CLIPort: What and Where Pathways for Robotic Manipulation"

50 / 477 papers shown
Title
Empowering Large Language Models on Robotic Manipulation with Affordance
  Prompting
Empowering Large Language Models on Robotic Manipulation with Affordance Prompting
Guangran Cheng
Chuheng Zhang
Wenzhe Cai
Li Zhao
Changyin Sun
Jiang Bian
LM&Ro
LLMAG
189
9
0
17 Apr 2024
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
Yandan Yang
Baoxiong Jia
Peiyuan Zhi
Siyuan Huang
LM&Ro
VGen
46
42
0
15 Apr 2024
Reflectance Estimation for Proximity Sensing by Vision-Language Models:
  Utilizing Distributional Semantics for Low-Level Cognition in Robotics
Reflectance Estimation for Proximity Sensing by Vision-Language Models: Utilizing Distributional Semantics for Low-Level Cognition in Robotics
Masashi Osada
G. A. G. Ricardez
Yosuke Suzuki
Tadahiro Taniguchi
26
2
0
11 Apr 2024
GenCHiP: Generating Robot Policy Code for High-Precision and
  Contact-Rich Manipulation Tasks
GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks
Kaylee Burns
Ajinkya Jain
Keegan Go
Fei Xia
Michael Stark
S. Schaal
Karol Hausman
29
4
0
09 Apr 2024
Can only LLMs do Reasoning?: Potential of Small Language Models in Task
  Planning
Can only LLMs do Reasoning?: Potential of Small Language Models in Task Planning
Gawon Choi
Hyemin Ahn
LM&Ro
LRM
34
1
0
05 Apr 2024
SUGAR: Pre-training 3D Visual Representations for Robotics
SUGAR: Pre-training 3D Visual Representations for Robotics
Shizhe Chen
Ricardo Garcia Pinel
Ivan Laptev
Cordelia Schmid
47
14
0
01 Apr 2024
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Zeren Chen
Zhelun Shi
Xiaoya Lu
Lehan He
Sucheng Qian
...
Zhen-fei Yin
Jing Shao
Jing Shao
Cewu Lu
Cewu Lu
38
5
0
28 Mar 2024
Uncertainty-Aware Deployment of Pre-trained Language-Conditioned
  Imitation Learning Policies
Uncertainty-Aware Deployment of Pre-trained Language-Conditioned Imitation Learning Policies
Bo Wu
Bruce D. Lee
Kostas Daniilidis
Bernadette Bucher
Nikolai Matni
LM&Ro
AI4CE
30
2
0
27 Mar 2024
Temporal and Semantic Evaluation Metrics for Foundation Models in Post-Hoc Analysis of Robotic Sub-tasks
Temporal and Semantic Evaluation Metrics for Foundation Models in Post-Hoc Analysis of Robotic Sub-tasks
Jonathan Salfity
Selma Wanna
Minkyu Choi
Mitch Pryor
43
1
0
25 Mar 2024
Selectively Informative Description can Reduce Undesired Embedding
  Entanglements in Text-to-Image Personalization
Selectively Informative Description can Reduce Undesired Embedding Entanglements in Text-to-Image Personalization
Jimyeong Kim
Jungwon Park
Wonjong Rhee
DiffM
30
5
0
22 Mar 2024
Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion
  Descriptors
Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors
Nikolaos Tsagkas
Jack Rome
S. Ramamoorthy
Oisin Mac Aodha
Chris Xiaoxuan Lu
26
6
0
21 Mar 2024
GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped
  Robot
GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot
Wenxuan Song
Han Zhao
Pengxiang Ding
Can Cui
Shangke Lyu
Yaning Fan
Donglin Wang
OffRL
27
11
0
20 Mar 2024
Adaptive Visual Imitation Learning for Robotic Assisted Feeding Across
  Varied Bowl Configurations and Food Types
Adaptive Visual Imitation Learning for Robotic Assisted Feeding Across Varied Bowl Configurations and Food Types
Rui Liu
Amisha Bhaskar
Pratap Tokekar
32
3
0
19 Mar 2024
VIHE: Virtual In-Hand Eye Transformer for 3D Robotic Manipulation
VIHE: Virtual In-Hand Eye Transformer for 3D Robotic Manipulation
Weiyao Wang
Yutian Lei
Shiyu Jin
Gregory D. Hager
Liangjun Zhang
31
2
0
18 Mar 2024
ManipVQA: Injecting Robotic Affordance and Physically Grounded
  Information into Multi-Modal Large Language Models
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Siyuan Huang
Iaroslav Ponomarenko
Zhengkai Jiang
Xiaoqi Li
Xiaobin Hu
Peng Gao
Hongsheng Li
Hao Dong
LM&Ro
32
16
0
17 Mar 2024
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary
  Robotic Grasping
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Yuhang Zheng
Xiangyu Chen
Yupeng Zheng
Songen Gu
Runyi Yang
...
Chao Yang
Dawei Wang
Zhen Chen
Xiaoxiao Long
Meiqing Wang
55
43
0
14 Mar 2024
InfoCon: Concept Discovery with Generative and Discriminative
  Informativeness
InfoCon: Concept Discovery with Generative and Discriminative Informativeness
Ruizhe Liu
Qian Luo
Yanchao Yang
32
2
0
14 Mar 2024
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
Runyu Ma
Jelle Luijkx
Zlatan Ajanović
Jens Kober
LM&Ro
LRM
38
7
0
14 Mar 2024
NaturalVLM: Leveraging Fine-grained Natural Language for
  Affordance-Guided Visual Manipulation
NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation
Ran Xu
Yan Shen
Xiaoqi Li
Ruihai Wu
Hao Dong
LM&Ro
30
9
0
13 Mar 2024
ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic
  Manipulation
ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
Guanxing Lu
Shiyi Zhang
Ziwei Wang
Changliu Liu
Jiwen Lu
Yansong Tang
46
50
0
13 Mar 2024
CoPa: General Robotic Manipulation through Spatial Constraints of Parts
  with Foundation Models
CoPa: General Robotic Manipulation through Spatial Constraints of Parts with Foundation Models
Haoxu Huang
Fanqi Lin
Yingdong Hu
Shengjie Wang
Yang Gao
38
49
0
13 Mar 2024
VANP: Learning Where to See for Navigation with Self-Supervised
  Vision-Action Pre-Training
VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-Training
Mohammad Nazeri
Junzhe Wang
Amirreza Payandeh
Xuesu Xiao
SSL
ViT
41
5
0
12 Mar 2024
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
Ge Yan
Yueh-hua Wu
Xiaolong Wang
VGen
37
20
0
07 Mar 2024
RT-H: Action Hierarchies Using Language
RT-H: Action Hierarchies Using Language
Suneel Belkhale
Tianli Ding
Ted Xiao
P. Sermanet
Quon Vuong
Jonathan Tompson
Yevgen Chebotar
Debidatta Dwibedi
Dorsa Sadigh
LM&Ro
34
76
0
04 Mar 2024
Never-Ending Behavior-Cloning Agent for Robotic Manipulation
Never-Ending Behavior-Cloning Agent for Robotic Manipulation
Wenqi Liang
Gan Sun
Qian He
Yu Ren
Jiahua Dong
Yang Cong
LM&Ro
27
1
0
01 Mar 2024
Mirage: Cross-Embodiment Zero-Shot Policy Transfer with Cross-Painting
Mirage: Cross-Embodiment Zero-Shot Policy Transfer with Cross-Painting
L. Chen
Kush Hari
K. Dharmarajan
Chenfeng Xu
Quan Vuong
Ken Goldberg
49
20
0
29 Feb 2024
MOSAIC: A Modular System for Assistive and Interactive Cooking
MOSAIC: A Modular System for Assistive and Interactive Cooking
Huaxiaoyue Wang
K. Kedia
Juntao Ren
Rahma Abdullah
Atiksh Bhardwaj
...
Maximus Adrian Pace
Yash Sharma
Xiangwan Sun
Neha Sunkara
Sanjiban Choudhury
35
12
0
29 Feb 2024
Learning with Language-Guided State Abstractions
Learning with Language-Guided State Abstractions
Andi Peng
Ilia Sucholutsky
Belinda Z. Li
T. Sumers
Thomas L. Griffiths
Jacob Andreas
Julie A. Shah
LM&Ro
49
13
0
28 Feb 2024
Rethinking Mutual Information for Language Conditioned Skill Discovery
  on Imitation Learning
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning
Zhaoxun Ju
Chao Yang
Hongbo Wang
Yu Qiao
Gang Hua
LM&Ro
28
3
0
27 Feb 2024
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language
  Navigation
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Jiazhao Zhang
Kunyu Wang
Rongtao Xu
Gengze Zhou
Yicong Hong
Xiaomeng Fang
Qi Wu
Zhizheng Zhang
Wang He
LM&Ro
37
45
0
24 Feb 2024
Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior
Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior
Kechun Xu
Zhongxiang Zhou
Jun Wu
Haojian Lu
Rong Xiong
Yue Wang
40
2
0
23 Feb 2024
CyberDemo: Augmenting Simulated Human Demonstration for Real-World
  Dexterous Manipulation
CyberDemo: Augmenting Simulated Human Demonstration for Real-World Dexterous Manipulation
Jun Wang
Yuzhe Qin
Kaiming Kuang
Yigit Korkmaz
Akhilan Gurumoorthy
Hao Su
Xiaolong Wang
43
19
0
22 Feb 2024
RoboScript: Code Generation for Free-Form Manipulation Tasks across Real
  and Simulation
RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Junting Chen
Yao Mu
Qiaojun Yu
Tianming Wei
Silang Wu
...
Wenqi Shao
Yu Qiao
Huazhe Xu
Mingyu Ding
Ping Luo
LM&Ro
34
11
0
22 Feb 2024
DINOBot: Robot Manipulation via Retrieval and Alignment with Vision
  Foundation Models
DINOBot: Robot Manipulation via Retrieval and Alignment with Vision Foundation Models
Norman Di Palo
Edward Johns
LM&Ro
40
25
0
20 Feb 2024
Learning to Learn Faster from Human Feedback with Language Model
  Predictive Control
Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Jacky Liang
Fei Xia
Wenhao Yu
Andy Zeng
Montse Gonzalez Arenas
...
N. Heess
Kanishka Rao
Nik Stewart
Jie Tan
Carolina Parada
LM&Ro
61
34
0
18 Feb 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
39
108
0
16 Feb 2024
One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill
One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill
Sangwoo Shin
Daehee Lee
Minjong Yoo
Woo Kyung Kim
Honguk Woo
32
9
0
13 Feb 2024
BBSEA: An Exploration of Brain-Body Synchronization for Embodied Agents
BBSEA: An Exploration of Brain-Body Synchronization for Embodied Agents
Sizhe Yang
Qian Luo
Anumpam Pani
Yanchao Yang
29
2
0
13 Feb 2024
THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic
  Manipulation
THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation
Wilbert Pumacay
Ishika Singh
Jiafei Duan
Ranjay Krishna
Jesse Thomason
Dieter Fox
27
39
0
13 Feb 2024
Policy Improvement using Language Feedback Models
Policy Improvement using Language Feedback Models
Victor Zhong
Dipendra Kumar Misra
Xingdi Yuan
Marc-Alexandre Côté
16
9
0
12 Feb 2024
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Soroush Nasiriany
Fei Xia
Wenhao Yu
Ted Xiao
Jacky Liang
...
Karol Hausman
N. Heess
Chelsea Finn
Sergey Levine
Brian Ichter
LM&Ro
LRM
25
92
0
12 Feb 2024
SemTra: A Semantic Skill Translator for Cross-Domain Zero-Shot Policy
  Adaptation
SemTra: A Semantic Skill Translator for Cross-Domain Zero-Shot Policy Adaptation
Sangwoo Shin
Minjong Yoo
Jeongwoo Lee
Honguk Woo
43
4
0
12 Feb 2024
Learning by Watching: A Review of Video-based Learning Approaches for
  Robot Manipulation
Learning by Watching: A Review of Video-based Learning Approaches for Robot Manipulation
Chrisantus Eze
Christopher Crick
SSL
82
12
0
11 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRL
VLM
LM&Ro
48
45
0
08 Feb 2024
Code as Reward: Empowering Reinforcement Learning with VLMs
Code as Reward: Empowering Reinforcement Learning with VLMs
David Venuto
Sami Nur Islam
Martin Klissarov
Doina Precup
Sherry Yang
Ankit Anand
VLM
25
9
0
07 Feb 2024
The Essential Role of Causality in Foundation World Models for Embodied
  AI
The Essential Role of Causality in Foundation World Models for Embodied AI
Tarun Gupta
Wenbo Gong
Chao Ma
Nick Pawlowski
Agrin Hilmkil
...
Jianfeng Gao
Stefan Bauer
Danica Kragic
Bernhard Schölkopf
Cheng Zhang
30
15
0
06 Feb 2024
Vision-Language Models Provide Promptable Representations for
  Reinforcement Learning
Vision-Language Models Provide Promptable Representations for Reinforcement Learning
William Chen
Oier Mees
Aviral Kumar
Sergey Levine
VLM
LM&Ro
42
23
0
05 Feb 2024
Fast Explicit-Input Assistance for Teleoperation in Clutter
Fast Explicit-Input Assistance for Teleoperation in Clutter
Nick Walker
Xuning Yang
Animesh Garg
Maya Cakmak
Dieter Fox
Claudia Pérez-DÁrpino
16
0
0
04 Feb 2024
Point Cloud Matters: Rethinking the Impact of Different Observation
  Spaces on Robot Learning
Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
Haoyi Zhu
Yating Wang
Di Huang
Weicai Ye
Wanli Ouyang
Tong He
SSL
3DPC
44
20
0
04 Feb 2024
TartanDrive 2.0: More Modalities and Better Infrastructure to Further
  Self-Supervised Learning Research in Off-Road Driving Tasks
TartanDrive 2.0: More Modalities and Better Infrastructure to Further Self-Supervised Learning Research in Off-Road Driving Tasks
Matthew Sivaprakasam
Parv Maheshwari
Mateo Guaman Castro
S. Triest
Micah Nye
Steven Willits
Andrew Saba
Wenshan Wang
Sebastian A. Scherer
43
13
0
02 Feb 2024
Previous
12345...8910
Next