ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.00673
  4. Cited By
ADAPT: Action-aware Driving Caption Transformer

ADAPT: Action-aware Driving Caption Transformer

1 February 2023
Bu Jin
Xinyi Liu
Yupeng Zheng
Pengfei Li
Hao Zhao
Tong Zhang
Yuhang Zheng
Guyue Zhou
Jingjing Liu
ArXivPDFHTML

Papers citing "ADAPT: Action-aware Driving Caption Transformer"

50 / 57 papers shown
Title
PADriver: Towards Personalized Autonomous Driving
PADriver: Towards Personalized Autonomous Driving
Genghua Kou
Fan Jia
Weixin Mao
Yong-Jin Liu
Yucheng Zhao
Ziheng Zhang
Osamu Yoshie
Tiancai Wang
Yongbin Li
Xinming Zhang
49
0
0
08 May 2025
UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty
UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty
Pengxuan Yang
Yupeng Zheng
Qichao Zhang
Kefei Zhu
Zebin Xing
Qiao Lin
Yun-Fu Liu
Zhiguo Su
Dongbin Zhao
32
0
0
17 Apr 2025
Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction
Zongzheng Zhang
Xinrun Li
Sizhe Zou
Guoxuan Chi
Siqi Li
...
Guoliang Wang
Guantian Zheng
Leichen Wang
Hang Zhao
Hao Zhao
62
0
0
10 Mar 2025
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving
Zebin Xing
Xinsong Zhang
Yang Hu
Bo Jiang
Tong He
Qian Zhang
Xiaoxiao Long
Wei Yin
67
3
0
07 Mar 2025
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models
J.N. Zhang
Xuan Yang
Tianfu Wang
Yu Yao
Aleksandr Petiushko
B. Li
38
0
0
28 Feb 2025
Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model
Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model
Bo-Kai Ruan
Hao-Tang Tsui
Yung-Hui Li
Hong-Han Shuai
LM&Ro
86
4
0
20 Feb 2025
DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model
DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model
Yi Liu
Changran Xu
Yunhao Zhou
Zhiyu Li
Qiang Xu
VLM
51
4
0
20 Feb 2025
Embodied Scene Understanding for Vision Language Models via MetaVQA
Embodied Scene Understanding for Vision Language Models via MetaVQA
Weizhen Wang
Chenda Duan
Zhenghao Peng
Yuxin Liu
Bolei Zhou
LM&Ro
44
0
0
17 Jan 2025
DriveLM: Driving with Graph Visual Question Answering
DriveLM: Driving with Graph Visual Question Answering
Chonghao Sima
Katrin Renz
Kashyap Chitta
L. Chen
Hanxue Zhang
Chengen Xie
Jens Beißwenger
Ping Luo
Andreas Geiger
Hongyang Li
108
164
0
17 Jan 2025
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
Tian Jin
Yuxiao Luo
Yue Ma
Yu Qiao
Yali Wang
Mamba
50
1
0
08 Jan 2025
Explanation for Trajectory Planning using Multi-modal Large Language
  Model for Autonomous Driving
Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving
Shota Yamazaki
Chenyu Zhang
Takuya Nanri
Akio Shigekane
Siyuan Wang
Jo Nishiyama
Tao Chu
Kohei Yokosawa
LRM
41
1
0
15 Nov 2024
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5%
  Parameters and 90% Performance
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
70
24
0
21 Oct 2024
Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for
  Autonomous Driving
Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving
Sihao Wu
Jiaxu Liu
Xiangyu Yin
Guangliang Cheng
Xingyu Zhao
Meng Fang
Xinping Yi
Xiaowei Huang
30
0
0
16 Oct 2024
Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models
  for Effective Emergency Braking
Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking
Wei Zhang
Pengfei Li
Junli Wang
B. S.
Qihao Jin
...
Shibo Rui
Yang Yu
Wenchao Ding
Peng Li
Yilun Chen
36
0
0
11 Oct 2024
Efficient Driving Behavior Narration and Reasoning on Edge Device Using
  Large Language Models
Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models
Yizhou Huang
Yihua Cheng
Kezhi Wang
LRM
52
1
0
30 Sep 2024
KARMA: Augmenting Embodied AI Agents with Long-and-short Term Memory Systems
KARMA: Augmenting Embodied AI Agents with Long-and-short Term Memory Systems
Zixuan Wang
Bo Yu
Junzhe Zhao
Wenhao Sun
Sai Hou
Shuai Liang
Xing Hu
Yinhe Han
Yiming Gan
49
1
0
23 Sep 2024
MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian
  Action Prediction
MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction
Yan Feng
Alexander Carballo
Keisuke Fujii
Robin Karlsson
Ming Ding
K. Takeda
31
0
0
14 Sep 2024
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous
  Driving
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving
Kairui Ding
Boyuan Chen
Yuchen Su
Huan-ang Gao
Bu Jin
...
Wuqiang Zhang
Xiaohui Li
Paul Barsch
Hongyang Li
Hao Zhao
58
3
0
10 Sep 2024
ChatSUMO: Large Language Model for Automating Traffic Scenario
  Generation in Simulation of Urban MObility
ChatSUMO: Large Language Model for Automating Traffic Scenario Generation in Simulation of Urban MObility
Shuyang Li
Talha Azfar
Ruimin Ke
LLMAG
26
8
0
29 Aug 2024
CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous
  Driving
CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
Hidehisa Arai
Keita Miwa
Kento Sasaki
Yu Yamaguchi
Kohei Watanabe
Shunsuke Aoki
Issei Yamamoto
54
9
0
19 Aug 2024
Multi-Frame Vision-Language Model for Long-form Reasoning in Driver
  Behavior Analysis
Multi-Frame Vision-Language Model for Long-form Reasoning in Driver Behavior Analysis
Hiroshi Takato
Hiroshi Tsutsui
Komei Soda
Hidetaka Kamigaito
VLM
35
0
0
03 Aug 2024
Large Language Models for Human-like Autonomous Driving: A Survey
Large Language Models for Human-like Autonomous Driving: A Survey
Yun Li
Kai Katsumata
Ehsan Javanmardi
Manabu Tsukada
LM&MA
52
6
0
27 Jul 2024
Tell Me Where You Are: Multimodal LLMs Meet Place Recognition
Tell Me Where You Are: Multimodal LLMs Meet Place Recognition
Zonglin Lyu
Juexiao Zhang
Mingxuan Lu
Yiming Li
Chen Feng
49
5
0
25 Jun 2024
Do More Details Always Introduce More Hallucinations in LVLM-based Image
  Captioning?
Do More Details Always Introduce More Hallucinations in LVLM-based Image Captioning?
Mingqian Feng
Yunlong Tang
Zeliang Zhang
Chenliang Xu
42
3
0
18 Jun 2024
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and
  Social Experiences
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences
Yidong Huang
Jacob Sansom
Ziqiao Ma
Felix Gervits
Joyce Chai
44
17
0
05 Jun 2024
PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle
  Motion Planning
PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Yupeng Zheng
Zebin Xing
Qichao Zhang
Bu Jin
Pengfei Li
...
Zhongpu Xia
Kun Zhan
Xianpeng Lang
Yaran Chen
Dongbin Zhao
LM&Ro
LRM
LLMAG
65
14
0
03 Jun 2024
Hard Cases Detection in Motion Prediction by Vision-Language Foundation
  Models
Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models
Yi Yang
Qingwen Zhang
Kei Ikemura
Nazre Batool
John Folkesson
VLM
33
1
0
31 May 2024
On the Utility of External Agent Intention Predictor for Human-AI
  Coordination
On the Utility of External Agent Intention Predictor for Human-AI Coordination
Chenxu Wang
Zilong Chen
Angelo Cangelosi
Huaping Liu
39
1
0
03 May 2024
Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios?
Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios?
Marcel Hallgarten
Julian Zapata
Martin Stoll
Katrin Renz
Andreas Zell
46
10
0
11 Apr 2024
Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving
  Imitation Learning with LLMs
Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs
Yiqun Duan
Qiang Zhang
Renjing Xu
38
9
0
07 Apr 2024
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from
  Interleaved Multimodal Inputs
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen
Xiang Li
Xiaojun Ye
Chao Li
Zhaoxin Fan
Hao Zhao
VGen
3DV
214
4
0
05 Apr 2024
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Bu Jin
Yupeng Zheng
Pengfei Li
Weize Li
Yuhang Zheng
...
Kun Zhan
Peng Jia
Xiaoxiao Long
Yilun Chen
Hao Zhao
3DV
79
15
0
28 Mar 2024
P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap
  Priors
P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors
Zhou Jiang
Zhenxin Zhu
Pengfei Li
Huan-ang Gao
Tianyuan Yuan
Yongliang Shi
Hang Zhao
Hao Zhao
38
22
0
15 Mar 2024
MonoOcc: Digging into Monocular Semantic Occupancy Prediction
MonoOcc: Digging into Monocular Semantic Occupancy Prediction
Yupeng Zheng
Xiang Li
Pengfei Li
Yuhang Zheng
Bu Jin
Chengliang Zhong
Xiaoxiao Long
Hao Zhao
Qichao Zhang
31
25
0
13 Mar 2024
Embodied Understanding of Driving Scenarios
Embodied Understanding of Driving Scenarios
Yunsong Zhou
Linyan Huang
Qingwen Bu
Jia Zeng
Tianyu Li
Hang Qiu
Hongzi Zhu
Minyi Guo
Yu Qiao
Hongyang Li
LM&Ro
62
31
0
07 Mar 2024
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented
  In-Context Learning in Multi-Modal Large Language Model
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model
Jianhao Yuan
Shuyang Sun
Daniel Omeiza
Bo Zhao
Paul Newman
Lars Kunze
Matthew Gadd
LRM
36
49
0
16 Feb 2024
Using Left and Right Brains Together: Towards Vision and Language
  Planning
Using Left and Right Brains Together: Towards Vision and Language Planning
Jun Cen
Chenfei Wu
Xiao Liu
Sheng-Siang Yin
Yixuan Pei
Jinglong Yang
Qifeng Chen
Nan Duan
Jianguo Zhang
68
3
0
16 Feb 2024
Delving into Multi-modal Multi-task Foundation Models for Road Scene
  Understanding: From Learning Paradigm Perspectives
Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Sheng Luo
Wei Chen
Wanxin Tian
Rui Liu
Luanxuan Hou
...
Ling Shao
Yi Yang
Bojun Gao
Qun Li
Guobin Wu
51
13
0
05 Feb 2024
Prospective Role of Foundation Models in Advancing Autonomous Vehicles
Prospective Role of Foundation Models in Advancing Autonomous Vehicles
Jianhua Wu
B. Gao
Jincheng Gao
Jianhao Yu
Hongqing Chu
...
Xun Gong
Yi Chang
H. E. Tseng
Hong Chen
Jie Chen
45
3
0
08 Dec 2023
LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language
  Model Programs
LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs
Yunsheng Ma
Can Cui
Xu Cao
Wenqian Ye
Peiran Liu
...
Rohit Gupta
Kyungtae Han
Aniket Bera
James M. Rehg
Ziran Wang
35
42
0
07 Dec 2023
Empowering Autonomous Driving with Large Language Models: A Safety
  Perspective
Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Yixuan Wang
Ruochen Jiao
Sinong Simon Zhan
Chengtian Lang
Chao Huang
Zhaoran Wang
Zhuoran Yang
Qi Zhu
40
27
0
28 Nov 2023
Applications of Large Scale Foundation Models for Autonomous Driving
Applications of Large Scale Foundation Models for Autonomous Driving
Yu Huang
Yue Chen
Zhu Li
ELM
AI4CE
LRM
ALM
LM&Ro
61
15
0
20 Nov 2023
Human-Centric Autonomous Systems With LLMs for User Command Reasoning
Human-Centric Autonomous Systems With LLMs for User Command Reasoning
Yi Yang
Qingwen Zhang
Ci Li
Daniel Simoes Marta
Nazre Batool
John Folkesson
LRM
70
29
0
14 Nov 2023
What Makes a Fantastic Passenger-Car Driver in Urban Contexts?
Yueteng Yu
Zhijie Yi
Xinyu Yang
Mengdi Chu
Junrong Lu
...
Jialin Song
Xingrui Gu
Jirui Yuan
Guyue Zhou
Jiangtao Gong
31
0
0
07 Nov 2023
Vision Language Models in Autonomous Driving: A Survey and Outlook
Vision Language Models in Autonomous Driving: A Survey and Outlook
Xingcheng Zhou
Mingyu Liu
Ekim Yurtsever
B. L. Žagar
Walter Zimmer
Hu Cao
Alois C. Knoll
VLM
37
39
0
22 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
126
159
0
04 Oct 2023
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable
  Autonomous Driving
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving
Long Chen
Oleg Sinavski
Jan Hünermann
Alice Karnsund
Andrew James Willmott
Danny Birch
Daniel Maund
Jamie Shotton
MLLM
20
182
0
03 Oct 2023
DriveGPT4: Interpretable End-to-end Autonomous Driving via Large
  Language Model
DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model
Zhenhua Xu
Yujia Zhang
Enze Xie
Zhen Zhao
Yong Guo
Kwan-Yee. K. Wong
Zhenguo Li
Hengshuang Zhao
MLLM
22
255
0
02 Oct 2023
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous
  Driving
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving
Zirui Wu
Tianyu Liu
Liyi Luo
Zhide Zhong
Jianteng Chen
...
Xiaoyu Ye
Zike Yan
Yongliang Shi
Yiyi Liao
Hao Zhao
32
121
0
27 Jul 2023
End-to-end Autonomous Driving: Challenges and Frontiers
End-to-end Autonomous Driving: Challenges and Frontiers
Li Chen
Peng Wu
Kashyap Chitta
Bernhard Jaeger
Andreas Geiger
Hongyang Li
3DV
58
264
0
29 Jun 2023
12
Next