Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.14948
Cited By
v1
v2 (latest)
ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents
19 March 2025
Hao Liang
Zhipeng Dong
Kaixin Chen
M. Fu
Yufeng Yue
Yi Yang
Mengyin Fu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents"
50 / 51 papers shown
Title
YOLOv10: Real-Time End-to-End Object Detection
Ao Wang
Hui Chen
Lihao Liu
Kai Chen
Zijia Lin
Jungong Han
Guiguang Ding
3DH
103
1,119
0
23 May 2024
VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection
Zihua Liu
Hiroki Sakuma
Masatoshi Okutomi
121
3
0
29 Mar 2024
Eliminating Warping Shakes for Unsupervised Online Video Stitching
Lang Nie
Chunyu Lin
K. Liao
Yun Zhang
Shuaicheng Liu
Rui Ai
Yao Zhao
56
4
0
11 Mar 2024
PEM: Prototype-based Efficient MaskFormer for Image Segmentation
Niccolò Cavagnero
Gabriele Rosi
Claudia Cuttano
Francesca Pistilli
Marco Ciccone
Giuseppe Averta
Fabio Cermelli
95
22
0
29 Feb 2024
UniMODE: Unified Monocular 3D Object Detection
Zhuoling Li
Xiaogang Xu
Sernam Lim
Hengshuang Zhao
74
3
0
28 Feb 2024
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents
Yuxi Wei
Zi Wang
Yifan Lu
Chenxin Xu
Chang-rui Liu
Hao Zhao
Siheng Chen
Yanfeng Wang
VGen
105
75
0
08 Feb 2024
SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation
Zhengze Xu
Dongyue Wu
Changqian Yu
Xiangxiang Chu
Nong Sang
Changxin Gao
ViT
95
56
0
28 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
87
156
0
01 Dec 2023
Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous Vehicles
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Ziran Wang
80
89
0
12 Oct 2023
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model
Xiaofan Li
Yifu Zhang
Xiaoqing Ye
VGen
111
78
0
11 Oct 2023
MagicDrive: Street View Generation with Diverse 3D Geometry Control
Ruiyuan Gao
Kai Chen
Enze Xie
Lanqing Hong
Zhenguo Li
Dit-Yan Yeung
Qiang Xu
DiffM
80
121
0
04 Oct 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
96
164
0
18 Sep 2023
UniSim: A Neural Closed-Loop Sensor Simulator
Ze Yang
Yun Chen
Jingkang Wang
S. Manivasagam
Wei-Chiu Ma
A. Yang
R. Urtasun
94
200
0
03 Aug 2023
BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout
Kairui Yang
Enhui Ma
Jibing Peng
Qing Guo
Di Lin
Kaicheng Yu
DiffM
80
65
0
03 Aug 2023
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving
Zirui Wu
Tianyu Liu
Liyi Luo
Zhide Zhong
Jianteng Chen
...
Xiaoyu Ye
Zike Yan
Yongliang Shi
Yiyi Liao
Hao Zhao
139
130
0
27 Jul 2023
RepViT: Revisiting Mobile CNN From ViT Perspective
Ao Wang
Hui Chen
Zijia Lin
Hengjun Pu
Guiguang Ding
66
207
0
18 Jul 2023
HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative perception with vision transformer
Hao Xiang
Runsheng Xu
Jiaqi Ma
60
51
0
20 Apr 2023
Collaboration Helps Camera Overtake LiDAR in 3D Detection
Yue Hu
Yifan Lu
Runsheng Xu
Weidi Xie
Siheng Chen
Yanfeng Wang
80
74
0
23 Mar 2023
VIMI: Vehicle-Infrastructure Multi-view Intermediate Fusion for Camera-based 3D Object Detection
Zhe Wang
Siqi Fan
Xiao Huo
Tongda Xu
Yan Wang
Jingjing Liu
Yilun Chen
Ya Zhang
3DPC
52
17
0
20 Mar 2023
Parallax-Tolerant Unsupervised Deep Image Stitching
Lang Nie
Chunyu Lin
K. Liao
Shuaicheng Liu
Yao Zhao
61
49
0
16 Feb 2023
Collaborative Perception in Autonomous Driving: Methods, Datasets and Challenges
Yushan Han
Hui Zhang
Huifang Li
Yi Jin
Congyan Lang
Yidong Li
69
110
0
16 Jan 2023
Street-View Image Generation from a Bird's-Eye View Layout
Alexander Swerdlow
Runsheng Xu
Bolei Zhou
82
73
0
11 Jan 2023
Visual Programming: Compositional visual reasoning without training
Tanmay Gupta
Aniruddha Kembhavi
ReLM
VLM
LRM
141
438
0
18 Nov 2022
Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps
Yue Hu
Shaoheng Fang
Zixing Lei
Yiqi Zhong
Siheng Chen
127
230
0
26 Sep 2022
Weakly-Supervised Stitching Network for Real-World Panoramic Image Generation
Dae-Young Song
Geonsoo Lee
H. Lee
Gi-Mun Um
Donghyeon Cho
68
23
0
13 Sep 2022
Latency-Aware Collaborative Perception
Zixing Lei
Shunli Ren
Yue Hu
Wenjun Zhang
Siheng Chen
84
98
0
18 Jul 2022
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers
Runsheng Xu
Zhengzhong Tu
Hao Xiang
Wei Shao
Bolei Zhou
Jiaqi Ma
102
222
0
05 Jul 2022
DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection
Haibao Yu
Yizhen Luo
Mao Shu
Yiyi Huo
Zebang Yang
...
Zhenglong Guo
Hanyu Li
Xing Hu
Jirui Yuan
Zaiqing Nie
3DPC
77
330
0
12 Apr 2022
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer
Runsheng Xu
Hao Xiang
Zhengzhong Tu
Xin Xia
Ming-Hsuan Yang
Jiaqi Ma
ViT
200
376
0
20 Mar 2022
V2X-Sim: Multi-Agent Collaborative Perception Dataset and Benchmark for Autonomous Driving
Yiming Li
Dekun Ma
Ziyan An
Zixun Wang
Yiqi Zhong
Siheng Chen
Chen Feng
84
224
0
17 Feb 2022
Instant Neural Graphics Primitives with a Multiresolution Hash Encoding
Thomas Müller
Alex Evans
Christoph Schied
A. Keller
336
4,045
0
16 Jan 2022
Multi-Robot Collaborative Perception with Graph Neural Networks
Yang Zhou
Jiuhong Xiao
Yuee Zhou
Giuseppe Loianno
65
66
0
05 Jan 2022
PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving
Pengchuan Xiao
Zhenlei Shao
Steven Hao
Zishuo Zhang
Xiaolin Chai
...
Jian Wu
Kai Sun
Kun Jiang
Yunlong Wang
Diange Yang
3DV
3DPC
62
190
0
23 Dec 2021
Learning Distilled Collaboration Graph for Multi-Agent Perception
Yiming Li
Shunli Ren
Pengxiang Wu
Siheng Chen
Chen Feng
Wenjun Zhang
90
243
0
01 Nov 2021
Keypoints-Based Deep Feature Fusion for Cooperative Vehicle Detection of Autonomous Driving
Yunshuang Yuan
Hao Cheng
Monika Sester
148
91
0
23 Sep 2021
OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication
Runsheng Xu
Hao Xiang
Xin Xia
Xu Han
Jinlong Liu
Jiaqi Ma
55
373
0
16 Sep 2021
Depth-Aware Multi-Grid Deep Homography Estimation with Contextual Correlation
Lang Nie
Chunyu Lin
K. Liao
Shuaicheng Liu
Yao Zhao
66
66
0
06 Jul 2021
Unsupervised Deep Image Stitching: Reconstructing Stitched Features to Images
Lang Nie
Chunyu Lin
K. Liao
Shuaicheng Liu
Yao-Min Zhao
75
170
0
24 Jun 2021
V2VNet: Vehicle-to-Vehicle Communication for Joint Perception and Prediction
Tsun-Hsuan Wang
S. Manivasagam
Ming Liang
Binh Yang
Wenyuan Zeng
James Tu
R. Urtasun
61
398
0
17 Aug 2020
When2com: Multi-Agent Perception via Communication Graph Grouping
Yen-Cheng Liu
Junjiao Tian
Nathan Glaser
Z. Kira
AI4CE
97
211
0
30 May 2020
Who2com: Collaborative Perception via Learnable Handshake Communication
Yen-Cheng Liu
Junjiao Tian
Chih-Yao Ma
Nathan Glaser
Chia-Wen Kuo
Z. Kira
71
165
0
21 Mar 2020
Cooperative Perception for 3D Object Detection in Driving Scenarios using Infrastructure Sensors
Eduardo Arnold
M. Dianati
R. de Temple
Saber Fallah
3DPC
46
245
0
18 Dec 2019
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
Pei Sun
Henrik Kretzschmar
Xerxes Dotiwalla
Aurelien Chouard
Vijaysai Patnaik
...
Shuyang Cheng
Yu Zhang
Jonathon Shlens
Zhifeng Chen
Dragomir Anguelov
139
2,902
0
10 Dec 2019
Cooper: Cooperative Perception for Connected Autonomous Vehicles based on 3D Point Clouds
Qi Chen
Sihai Tang
Qing Yang
Song Fu
3DPC
79
306
0
13 May 2019
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
301
5,770
0
26 Mar 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,114
0
11 Oct 2018
Single-Perspective Warps in Natural Image Stitching
Tianli Liao
Nan Li
36
115
0
13 Feb 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
728
132,199
0
12 Jun 2017
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles
S. Shah
Debadeepta Dey
Chris Lovett
Ashish Kapoor
94
2,001
0
15 May 2017
Deep Image Homography Estimation
Daniel DeTone
Tomasz Malisiewicz
Andrew Rabinovich
59
441
0
13 Jun 2016
1
2
Next