ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.10767
  4. Cited By
DRAMA: Joint Risk Localization and Captioning in Driving

DRAMA: Joint Risk Localization and Captioning in Driving

22 September 2022
Srikanth Malla
Chiho Choi
Isht Dwivedi
Joonhyang Choi
Jiachen Li
ArXivPDFHTML

Papers citing "DRAMA: Joint Risk Localization and Captioning in Driving"

50 / 57 papers shown
Title
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Zongchuang Zhao
Haoyu Fu
Dingkang Liang
Xin Zhou
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
MLLM
VLM
49
0
0
13 May 2025
Deep Learning Advances in Vision-Based Traffic Accident Anticipation: A Comprehensive Review of Methods,Datasets,and Future Directions
Deep Learning Advances in Vision-Based Traffic Accident Anticipation: A Comprehensive Review of Methods,Datasets,and Future Directions
Yi Zhang
Wenye Zhou
Ruonan Lin
Xin Yang
Hao Zheng
31
0
0
12 May 2025
DriveSOTIF: Advancing Perception SOTIF Through Multimodal Large Language Models
DriveSOTIF: Advancing Perception SOTIF Through Multimodal Large Language Models
Shucheng Huang
Freda Shi
Chen Sun
Jiaming Zhong
Minghao Ning
Yufeng Yang
Yukun Lu
Hong Wang
A. Khajepour
26
0
0
11 May 2025
Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends
Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends
M. Tami
Mohammed Elhenawy
Huthaifa I. Ashqar
31
0
0
21 Apr 2025
Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding
Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding
Tong Zeng
Longfeng Wu
Liang Shi
Dawei Zhou
Feng Guo
25
0
0
20 Apr 2025
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving
Yue Li
Meng Tian
Zhenyu Lin
Jiangtong Zhu
Dechang Zhu
Haiqiang Liu
Zining Wang
Yueyi Zhang
Zhiwei Xiong
Xinhai Zhao
CoGe
VLM
80
1
0
27 Mar 2025
ATARS: An Aerial Traffic Atomic Activity Recognition and Temporal Segmentation Dataset
ATARS: An Aerial Traffic Atomic Activity Recognition and Temporal Segmentation Dataset
Zihao Chen
Hsuanyu Wu
Chi-Hsi Kung
Yi-Ting Chen
Yan-Tsung Peng
42
0
0
24 Mar 2025
DynRsl-VLM: Enhancing Autonomous Driving Perception with Dynamic Resolution Vision-Language Models
Xirui Zhou
Lianlei Shan
Xiaolin Gui
58
0
0
14 Mar 2025
HazardNet: A Small-Scale Vision Language Model for Real-Time Traffic Safety Detection at Edge Devices
HazardNet: A Small-Scale Vision Language Model for Real-Time Traffic Safety Detection at Edge Devices
M. Tami
Mohammed Elhenawy
Huthaifa I. Ashqar
38
0
0
27 Feb 2025
Embodied Scene Understanding for Vision Language Models via MetaVQA
Embodied Scene Understanding for Vision Language Models via MetaVQA
Weizhen Wang
Chenda Duan
Zhenghao Peng
Yuxin Liu
Bolei Zhou
LM&Ro
44
0
0
17 Jan 2025
DriveLM: Driving with Graph Visual Question Answering
DriveLM: Driving with Graph Visual Question Answering
Chonghao Sima
Katrin Renz
Kashyap Chitta
L. Chen
Hanxue Zhang
Chengen Xie
Jens Beißwenger
Ping Luo
Andreas Geiger
Hongyang Li
96
162
0
17 Jan 2025
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
S. Chen
Yuxiao Luo
Yue Ma
Yu Qiao
Yali Wang
Mamba
42
1
0
08 Jan 2025
doScenes: An Autonomous Driving Dataset with Natural Language
  Instruction for Human Interaction and Vision-Language Navigation
doScenes: An Autonomous Driving Dataset with Natural Language Instruction for Human Interaction and Vision-Language Navigation
Parthib Roy
Srinivasa Perisetla
Shashank Shriram
Harsha Krishnaswamy
Aryan Keskar
Ross Greer
VGen
72
2
0
08 Dec 2024
On-Road Object Importance Estimation: A New Dataset and A Model with
  Multi-Fold Top-Down Guidance
On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down Guidance
Zhixiong Nan
Yilong Chen
Tianfei Zhou
Tao Xiang
72
0
0
26 Nov 2024
Explanation for Trajectory Planning using Multi-modal Large Language
  Model for Autonomous Driving
Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving
Shota Yamazaki
Chenyu Zhang
Takuya Nanri
Akio Shigekane
Siyuan Wang
Jo Nishiyama
Tao Chu
Kohei Yokosawa
LRM
36
1
0
15 Nov 2024
Driving with Regulation: Interpretable Decision-Making for Autonomous Vehicles with Retrieval-Augmented Reasoning via LLM
Driving with Regulation: Interpretable Decision-Making for Autonomous Vehicles with Retrieval-Augmented Reasoning via LLM
Tianhui Cai
Yifan Liu
Zewei Zhou
Haoxuan Ma
Seth Z. Zhao
Zhiwen Wu
Jiaqi Ma
42
7
0
07 Oct 2024
Video Token Sparsification for Efficient Multimodal LLMs in Autonomous
  Driving
Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving
Yunsheng Ma
Amr Abdelraouf
Rohit Gupta
Ziran Wang
Kyungtae Han
23
3
0
16 Sep 2024
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous
  Driving
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving
Kairui Ding
Boyuan Chen
Yuchen Su
Huan-ang Gao
Bu Jin
...
Wuqiang Zhang
Xiaohui Li
Paul Barsch
Hongyang Li
Hao Zhao
50
3
0
10 Sep 2024
How Could Generative AI Support Compliance with the EU AI Act? A Review
  for Safe Automated Driving Perception
How Could Generative AI Support Compliance with the EU AI Act? A Review for Safe Automated Driving Perception
Mert Keser
Youssef Shoeb
Alois Knoll
39
2
0
30 Aug 2024
CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous
  Driving
CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
Hidehisa Arai
Keita Miwa
Kento Sasaki
Yu Yamaguchi
Kohei Watanabe
Shunsuke Aoki
Issei Yamamoto
48
9
0
19 Aug 2024
WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained
  Spatial-Temporal Understanding
WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Quan Kong
Yuki Kawana
Rajat Saini
Ashutosh Kumar
Jingjing Pan
...
Yohei Ozao
Balázs Opra
D. Anastasiu
Yoichi Sato
N. Kobori
VGen
31
7
0
22 Jul 2024
VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual
  Descriptions
VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions
Seokha Moon
Hyun Woo
Hongbeen Park
Haeji Jung
R. Mahjourian
Hyung-Gun Chi
Hyerin Lim
Sangpil Kim
Jinkyu Kim
36
5
0
17 Jul 2024
WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and
  Driving Intentions Reasoning
WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and Driving Intentions Reasoning
Yiheng Li
Chongjian Ge
Chenran Li
Chenfeng Xu
M. Tomizuka
Chen Tang
Mingyu Ding
Wei Zhan
VGen
LRM
34
0
0
05 Jul 2024
Using Multimodal Large Language Models for Automated Detection of
  Traffic Safety Critical Events
Using Multimodal Large Language Models for Automated Detection of Traffic Safety Critical Events
M. Tami
Huthaifa I. Ashqar
Mohammed Elhenawy
37
3
0
19 Jun 2024
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Shihao Wang
Zhiding Yu
Xiaohui Jiang
Shiyi Lan
Min Shi
Nadine Chang
Jan Kautz
Ying Li
Jose M. Alvarez
LRM
40
47
0
02 May 2024
Instance-free Text to Point Cloud Localization with Relative Position
  Awareness
Instance-free Text to Point Cloud Localization with Relative Position Awareness
Lichao Wang
Zhihao Yuan
Jinke Ren
Shuguang Cui
Zhen Li
36
0
0
27 Apr 2024
Physical Backdoor Attack can Jeopardize Driving with
  Vision-Large-Language Models
Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models
Zhenyang Ni
Rui Ye
Yuxian Wei
Zhen Xiang
Yanfeng Wang
Siheng Chen
AAML
32
9
0
19 Apr 2024
Automated Evaluation of Large Vision-Language Models on Self-driving
  Corner Cases
Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Kai Chen
Yanze Li
Wenhua Zhang
Yanxin Liu
Pengxiang Li
...
Xinhai Zhao
Zhenguo Li
Dit-Yan Yeung
Huchuan Lu
Xu Jia
ELM
MLLM
54
28
0
16 Apr 2024
IDD-X: A Multi-View Dataset for Ego-relative Important Object
  Localization and Explanation in Dense and Unstructured Traffic
IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic
Chirag Parikh
Rohit Saluja
C. V. Jawahar
Ravi Kiran Sarvadevabhatla
27
2
0
12 Apr 2024
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Bu Jin
Yupeng Zheng
Pengfei Li
Weize Li
Yuhang Zheng
...
Kun Zhan
Peng Jia
Xiaoxiao Long
Yilun Chen
Hao Zhao
3DV
71
15
0
28 Mar 2024
DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving
DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving
Tianqi Wang
Enze Xie
Ruihang Chu
Zhenguo Li
Ping Luo
LRM
34
15
0
25 Mar 2024
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Neeloy Chakraborty
Melkior Ornik
Katherine Driggs-Campbell
LRM
57
9
0
25 Mar 2024
Driving Style Alignment for LLM-powered Driver Agent
Driving Style Alignment for LLM-powered Driver Agent
Ruoxuan Yang
Xinyue Zhang
Anais Fernandez-Laaksonen
Xin Ding
Jiangtao Gong
33
10
0
17 Mar 2024
Embodied Understanding of Driving Scenarios
Embodied Understanding of Driving Scenarios
Yunsong Zhou
Linyan Huang
Qingwen Bu
Jia Zeng
Tianyu Li
Hang Qiu
Hongzi Zhu
Minyi Guo
Yu Qiao
Hongyang Li
LM&Ro
60
31
0
07 Mar 2024
DriveVLM: The Convergence of Autonomous Driving and Large
  Vision-Language Models
DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models
Xiaoyu Tian
Junru Gu
Bailin Li
Yicheng Liu
Yang Wang
Chenxu Hu
Kun Zhan
Peng Jia
Xianpeng Lang
Hang Zhao
VLM
67
124
0
19 Feb 2024
Delving into Multi-modal Multi-task Foundation Models for Road Scene
  Understanding: From Learning Paradigm Perspectives
Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Sheng Luo
Wei-Neng Chen
Wanxin Tian
Rui Liu
Luanxuan Hou
...
Ling Shao
Yi Yang
Bojun Gao
Qun Li
Guobin Wu
51
13
0
05 Feb 2024
Forging Vision Foundation Models for Autonomous Driving: Challenges,
  Methodologies, and Opportunities
Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities
Xu Yan
Haiming Zhang
Yingjie Cai
Jingming Guo
Weichao Qiu
...
Lihui Jiang
Wei Zhang
Hongbo Zhang
Dengxin Dai
Bingbing Liu
54
17
0
16 Jan 2024
A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality,
  and a Future Outlook
A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality, and a Future Outlook
Mingyu Liu
Ekim Yurtsever
Jonathan Fossaert
Xingcheng Zhou
Walter Zimmer
Yuning Cui
B. L. Žagar
Alois C. Knoll
56
36
0
02 Jan 2024
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected
  Multi-Modal Large Models
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
Xinpeng Ding
Jinahua Han
Hang Xu
Xiaodan Liang
Wei Zhang
Xiaomeng Li
31
38
0
02 Jan 2024
NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous
  Driving Datasets using Markup Annotations
NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup Annotations
Yuichi Inoue
Yuki Yada
Kotaro Tanahashi
Yu Yamaguchi
27
17
0
11 Dec 2023
Towards Knowledge-driven Autonomous Driving
Towards Knowledge-driven Autonomous Driving
Xin Li
Yeqi Bai
Pinlong Cai
Licheng Wen
Daocheng Fu
...
Yikang Li
Botian Shi
Yong-Jin Liu
Liang He
Yu Qiao
32
26
0
07 Dec 2023
Reason2Drive: Towards Interpretable and Chain-based Reasoning for
  Autonomous Driving
Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving
Ming-Jun Nie
Renyuan Peng
Chunwei Wang
Xinyue Cai
Jianhua Han
Hang Xu
Li Zhang
LRM
29
45
0
06 Dec 2023
RiskBench: A Scenario-based Benchmark for Risk Identification
RiskBench: A Scenario-based Benchmark for Risk Identification
Chi-Hsi Kung
Chieh-Chi Yang
Pang-Yuan Pao
Shu-Wei Lu
Pin-Lun Chen
Hsin-Cheng Lu
Yi-Ting Chen
39
5
0
04 Dec 2023
Empowering Autonomous Driving with Large Language Models: A Safety
  Perspective
Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Yixuan Wang
Ruochen Jiao
Sinong Simon Zhan
Chengtian Lang
Chao Huang
Zhaoran Wang
Zhuoran Yang
Qi Zhu
26
27
0
28 Nov 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
41
251
0
21 Nov 2023
Human-Centric Autonomous Systems With LLMs for User Command Reasoning
Human-Centric Autonomous Systems With LLMs for User Command Reasoning
Yi Yang
Qingwen Zhang
Ci Li
Daniel Simoes Marta
Nazre Batool
John Folkesson
LRM
62
29
0
14 Nov 2023
DRUformer: Enhancing the driving scene Important object detection with
  driving relationship self-understanding
DRUformer: Enhancing the driving scene Important object detection with driving relationship self-understanding
Yingjie Niu
Ming Ding
Keisuke Fujii
Kento Ohtani
Alexander Carballo
K. Takeda
ViT
36
0
0
11 Nov 2023
LLM4Drive: A Survey of Large Language Models for Autonomous Driving
LLM4Drive: A Survey of Large Language Models for Autonomous Driving
Zhenjie Yang
Xiaosong Jia
Hongyang Li
Junchi Yan
ELM
36
94
0
02 Nov 2023
Driving through the Concept Gridlock: Unraveling Explainability
  Bottlenecks in Automated Driving
Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving
J. Echterhoff
An Yan
Kyungtae Han
Amr Abdelraouf
Rohit Gupta
Julian McAuley
13
7
0
25 Oct 2023
Vision Language Models in Autonomous Driving: A Survey and Outlook
Vision Language Models in Autonomous Driving: A Survey and Outlook
Xingcheng Zhou
Mingyu Liu
Ekim Yurtsever
B. L. Žagar
Walter Zimmer
Hu Cao
Alois C. Knoll
VLM
29
35
0
22 Oct 2023
12
Next