Title |
---|
![]() HE-Drive: Human-Like End-to-End Driving with Vision Language Models Junming Wang Xingyu Zhang Zebin Xing Songen Gu Xiaoyang Guo Yang Hu Ziying Song Qian Zhang Xiaoxiao Long Wei Yin |
![]() Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous
Driving Kairui Ding Boyuan Chen Yuchen Su Huan-ang Gao Bu Jin ...Wuqiang Zhang Xiaohui Li Paul Barsch Hongyang Li Hao Zhao |
![]() MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? Yi-Fan Zhang Huanyu Zhang Haochen Tian Chaoyou Fu Shuangqing Zhang ...Qingsong Wen Zhang Zhang L. Wang Rong Jin Tieniu Tan |