ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.09335
  4. Cited By

NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model

12 March 2025
Yuzhi Lai
Shenghai Yuan
Youssef Nassar
Mingyu Fan
T. Weber
Matthias Rätsch
    LM&Ro
ArXiv (abs)PDFHTML

Papers citing "NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model"

46 / 46 papers shown
Title
MM-LINS: a Multi-Map LiDAR-Inertial System for Over-Degenerate Environments
MM-LINS: a Multi-Map LiDAR-Inertial System for Over-Degenerate Environments
Yongxin Ma
Jie Xu
Shenghai Yuan
Tian Zhi
Wenlu Yu
Jun Zhou
Lihua Xie
107
6
0
25 Mar 2025
FAM-HRI: Foundation-Model Assisted Multi-Modal Human-Robot Interaction Combining Gaze and Speech
FAM-HRI: Foundation-Model Assisted Multi-Modal Human-Robot Interaction Combining Gaze and Speech
Yuzhi Lai
Shenghai Yuan
Boya Zhang
Benjamin Kiefer
Peizheng Li
Andreas Zell
68
1
0
11 Mar 2025
Handle Object Navigation as Weighted Traveling Repairman Problem
Ruimeng Liu
Xinhang Xu
Shenghai Yuan
Lihua Xie
132
2
0
10 Mar 2025
HelmetPoser: A Helmet-Mounted IMU Dataset for Data-Driven Estimation of Human Head Motion in Diverse Conditions
HelmetPoser: A Helmet-Mounted IMU Dataset for Data-Driven Estimation of Human Head Motion in Diverse Conditions
Jianping Li
Qiutong Leng
Jinxing Liu
Xinhang Xu
Tongxin Jin
Muqing Cao
T. Nguyen
Shenghai Yuan
Kun Cao
Lihua Xie
139
4
0
17 Feb 2025
Swept Volume-Aware Trajectory Planning and MPC Tracking for Multi-Axle Swerve-Drive AMRs
Swept Volume-Aware Trajectory Planning and MPC Tracking for Multi-Axle Swerve-Drive AMRs
Tianxin Hu
Shenghai Yuan
Ruofei Bai
Xinghang Xu
Yuwen Liao
Fen Liu
Lihua Xie
101
2
0
22 Dec 2024
AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness
AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness
Yizhuo Yang
Shenghai Yuan
Muqing Cao
Jianfei Yang
Lihua Xie
236
8
0
11 Nov 2024
Robust Loop Closure by Textual Cues in Challenging Environments
Robust Loop Closure by Textual Cues in Challenging Environments
Tongxing Jin
T. Nguyen
Xinhang Xu
Yizhuo Yang
Shenghai Yuan
Jianping Li
Lihua Xie
79
6
0
21 Oct 2024
SGBA: Semantic Gaussian Mixture Model-Based LiDAR Bundle Adjustment
SGBA: Semantic Gaussian Mixture Model-Based LiDAR Bundle Adjustment
Xingyu Ji
Shenghai Yuan
Jianping Li
Pengyu Yin
Haozhi Cao
Lihua Xie
83
6
0
02 Oct 2024
GERA: Geometric Embedding for Efficient Point Registration Analysis
GERA: Geometric Embedding for Efficient Point Registration Analysis
Geng Li
Haozhi Cao
Mingyang Liu
Shenghai Yuan
Jianfei Yang
3DPC
51
2
0
01 Oct 2024
AIR-Embodied: An Efficient Active 3DGS-based Interaction and
  Reconstruction Framework with Embodied Large Language Model
AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model
Zhenghao Qi
Shenghai Yuan
Fen Liu
Haozhi Cao
Tianchen Deng
Jianfei Yang
Lihua Xie
LM&RoDiffM
61
3
0
24 Sep 2024
ULOC: Learning to Localize in Complex Large-Scale Environments with
  Ultra-Wideband Ranges
ULOC: Learning to Localize in Complex Large-Scale Environments with Ultra-Wideband Ranges
Thien-Minh Nguyen
Yizhuo Yang
T. Nguyen
Shenghai Yuan
Lihua Xie
74
5
0
17 Sep 2024
Salient Sparse Visual Odometry With Pose-Only Supervision
Salient Sparse Visual Odometry With Pose-Only Supervision
Siyu Chen
Kangcheng Liu
Chen Wang
Shenghai Yuan
Jianfei Yang
Lihua Xie
94
8
0
06 Apr 2024
HCTO: Optimality-Aware LiDAR Inertial Odometry with Hybrid Continuous
  Time Optimization for Compact Wearable Mapping System
HCTO: Optimality-Aware LiDAR Inertial Odometry with Hybrid Continuous Time Optimization for Compact Wearable Mapping System
Jianping Li
Shenghai Yuan
Muqing Cao
Thien-Minh Nguyen
Kun Cao
Lihua Xie
82
27
0
21 Mar 2024
Compact 3D Gaussian Splatting For Dense Visual SLAM
Compact 3D Gaussian Splatting For Dense Visual SLAM
Tianchen Deng
Yaohui Chen
Leyan Zhang
Jianfei Yang
Shenghai Yuan
Danwei W. Wang
Weidong Chen
3DGS
139
34
0
17 Mar 2024
PSS-BA: LiDAR Bundle Adjustment with Progressive Spatial Smoothing
PSS-BA: LiDAR Bundle Adjustment with Progressive Spatial Smoothing
Jianping Li
Thien-Minh Nguyen
Shenghai Yuan
Lihua Xie
59
10
0
10 Mar 2024
A Cost-Effective Cooperative Exploration and Inspection Strategy for
  Heterogeneous Aerial System
A Cost-Effective Cooperative Exploration and Inspection Strategy for Heterogeneous Aerial System
Xinhang Xu
Muqing Cao
Shenghai Yuan
T. Nguyen
Thien-Minh Nguyen
Lihua Xie
60
9
0
02 Mar 2024
Jacquard V2: Refining Datasets using the Human In the Loop Data
  Correction Method
Jacquard V2: Refining Datasets using the Human In the Loop Data Correction Method
Qiuhao Li
Shenghai Yuan
67
5
0
08 Feb 2024
MMAUD: A Comprehensive Multi-Modal Anti-UAV Dataset for Modern Miniature
  Drone Threats
MMAUD: A Comprehensive Multi-Modal Anti-UAV Dataset for Modern Miniature Drone Threats
Shenghai Yuan
Yizhuo Yang
T. Nguyen
Thien-Minh Nguyen
Jianfei Yang
Fen Liu
Jianping Li
Han Wang
Lihua Xie
55
19
0
06 Feb 2024
MoPA: Multi-Modal Prior Aided Domain Adaptation for 3D Semantic
  Segmentation
MoPA: Multi-Modal Prior Aided Domain Adaptation for 3D Semantic Segmentation
Haozhi Cao
Yuecong Xu
Jianfei Yang
Peng Yin
Shenghai Yuan
Lihua Xie
83
14
0
21 Sep 2023
Outram: One-shot Global Localization via Triangulated Scene Graph and
  Global Outlier Pruning
Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning
Peng Yin
Haozhi Cao
Thien-Minh Nguyen
Shenghai Yuan
Shuyang Zhang
Kangcheng Liu
Lihua Xie
69
17
0
16 Sep 2023
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with
  Language Models
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Wenlong Huang
Chen Wang
Ruohan Zhang
Yunzhu Li
Jiajun Wu
Li Fei-Fei
LM&Ro
120
517
0
12 Jul 2023
LIO-GVM: an Accurate, Tightly-Coupled Lidar-Inertial Odometry with
  Gaussian Voxel Map
LIO-GVM: an Accurate, Tightly-Coupled Lidar-Inertial Odometry with Gaussian Voxel Map
Xingyu Ji
Shenghai Yuan
Peng Yin
Lihua Xie
62
14
0
30 Jun 2023
MM-Fi: Multi-Modal Non-Intrusive 4D Human Dataset for Versatile Wireless
  Sensing
MM-Fi: Multi-Modal Non-Intrusive 4D Human Dataset for Versatile Wireless Sensing
Jianfei Yang
He Huang
Yunjiao Zhou
Xinyan Chen
Yuecong Xu
Shenghai Yuan
Han Zou
Chris Xiaoxuan Lu
Lihua Xie
93
58
0
12 May 2023
Segment Anything
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLMVLM
371
7,405
0
05 Apr 2023
VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System
  using Monocular Camera and Ultra-wideband Sensors
VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors
T. Nguyen
Shenghai Yuan
Lihua Xie
55
8
0
20 Mar 2023
Multi-Modal Continual Test-Time Adaptation for 3D Semantic Segmentation
Multi-Modal Continual Test-Time Adaptation for 3D Semantic Segmentation
Haozhi Cao
Yuecong Xu
Jianfei Yang
Pengyu Yin
Shenghai Yuan
Lihua Xie
TTA
52
18
0
18 Mar 2023
Chat with the Environment: Interactive Multimodal Perception Using Large
  Language Models
Chat with the Environment: Interactive Multimodal Perception Using Large Language Models
Xufeng Zhao
Mengdi Li
C. Weber
Muhammad Burhan Hafez
S. Wermter
LLMAGLM&RoLRM
156
49
0
14 Mar 2023
DoubleBee: A Hybrid Aerial-Ground Robot with Two Active Wheels
DoubleBee: A Hybrid Aerial-Ground Robot with Two Active Wheels
Muqing Cao
Xinhang Xu
Shenghai Yuan
Kun Cao
Kangcheng Liu
Lihua Xie
40
12
0
09 Mar 2023
Communicating human intent to a robotic companion by multi-type gesture
  sentences
Communicating human intent to a robotic companion by multi-type gesture sentences
Petr Vanc
Jan Kristof Behrens
Karla Stepanova
Václav Hlaváč
59
8
0
08 Mar 2023
Co-Speech Gesture Synthesis using Discrete Gesture Token Learning
Co-Speech Gesture Synthesis using Discrete Gesture Token Learning
Shuhong Lu
Youngwoo Yoon
Andrew W. Feng
SLR
86
12
0
04 Mar 2023
ChatGPT for Robotics: Design Principles and Model Abilities
ChatGPT for Robotics: Design Principles and Model Abilities
Sai H. Vemprala
Rogerio Bonatti
A. Bucker
Ashish Kapoor
LM&Ro
83
474
0
20 Feb 2023
Segregator: Global Point Cloud Registration with Semantic and Geometric
  Cues
Segregator: Global Point Cloud Registration with Semantic and Geometric Cues
Peng Yin
Shenghai Yuan
Haozhi Cao
Xingyu Ji
Shuyang Zhang
Lihua Xie
72
19
0
18 Jan 2023
Robust Speech Recognition via Large-Scale Weak Supervision
Robust Speech Recognition via Large-Scale Weak Supervision
Alec Radford
Jong Wook Kim
Tao Xu
Greg Brockman
C. McLeavey
Ilya Sutskever
OffRL
209
3,750
0
06 Dec 2022
ProgPrompt: Generating Situated Robot Task Plans using Large Language
  Models
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh
Valts Blukis
Arsalan Mousavian
Ankit Goyal
Danfei Xu
Jonathan Tremblay
Dieter Fox
Jesse Thomason
Animesh Garg
LM&RoLLMAG
175
657
0
22 Sep 2022
GaitFi: Robust Device-Free Human Identification via WiFi and Vision
  Multimodal Learning
GaitFi: Robust Device-Free Human Identification via WiFi and Vision Multimodal Learning
Lang Deng
Jianfei Yang
Shenghai Yuan
Han Zou
Chris Xiaoxuan Lu
Lihua Xie
CVBM
71
43
0
30 Aug 2022
DIRECT: A Differential Dynamic Programming Based Framework for
  Trajectory Generation
DIRECT: A Differential Dynamic Programming Based Framework for Trajectory Generation
Kun Cao
Muqing Cao
Shenghai Yuan
Lihua Xie
60
17
0
10 Sep 2021
Open-vocabulary Object Detection via Vision and Language Knowledge
  Distillation
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Nayeon Lee
Weicheng Kuo
Huayu Chen
VLMObjD
293
920
0
28 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
981
29,871
0
26 Feb 2021
Language-Conditioned Imitation Learning for Robot Manipulation Tasks
Language-Conditioned Imitation Learning for Robot Manipulation Tasks
Simon Stepputtis
Joseph Campbell
Mariano Phielipp
Stefan Lee
Chitta Baral
H. B. Amor
LM&Ro
200
205
0
22 Oct 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
882
42,463
0
28 May 2020
Jacquard: A Large Scale Dataset for Robotic Grasp Detection
Jacquard: A Large Scale Dataset for Robotic Grasp Detection
Amaury Depierre
Emmanuel Dellandrea
Liming Chen
101
319
0
30 Mar 2018
PDDLStream: Integrating Symbolic Planners and Blackbox Samplers via
  Optimistic Adaptive Planning
PDDLStream: Integrating Symbolic Planners and Blackbox Samplers via Optimistic Adaptive Planning
Caelan Reed Garrett
Tomás Lozano-Pérez
L. Kaelbling
72
260
0
23 Feb 2018
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Zhe Cao
Tomas Simon
S. Wei
Yaser Sheikh
3DH
156
6,551
0
24 Nov 2016
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon
S. Divvala
Ross B. Girshick
Ali Farhadi
ObjD
724
37,033
0
08 Jun 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMatObjD
531
62,409
0
04 Jun 2015
Microsoft COCO: Common Objects in Context
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
434
43,832
0
01 May 2014
1