Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.01343
Cited By
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation
2 August 2024
Yue Duan
Zhangxuan Gu
ZhenZhe Ying
Changhua Meng
Xuelong Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation"
18 / 18 papers shown
Title
Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness
Chenfei Liao
Kaiyu Lei
Xu Zheng
Junha Moon
Zhixiong Wang
Yansen Wang
Danda Pani Paudel
Luc Van Gool
Xuming Hu
VLM
87
7
0
24 Mar 2025
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
Jianqi Chen
Panwen Hu
Xiaojun Chang
Z. Shi
Michael C. Kampffmeyer
Xiaodan Liang
65
5
0
14 Oct 2024
Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation
Jinyuan Liu
Zhu Liu
Guanyao Wu
Long Ma
Risheng Liu
Wei Zhong
Zhongxuan Luo
Xin-Yue Fan
28
123
0
04 Aug 2023
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
41
307
0
09 Mar 2022
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Enze Xie
Wenhai Wang
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
ViT
60
4,903
0
31 May 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
153
21,051
0
25 Mar 2021
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Sixiao Zheng
Jiachen Lu
Hengshuang Zhao
Xiatian Zhu
Zekun Luo
...
Yanwei Fu
Jianfeng Feng
Tao Xiang
Philip Torr
Li Zhang
ViT
49
2,866
0
31 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
55
39,900
0
22 Oct 2020
Dynamic Region-Aware Convolution
Jin Chen
Xijun Wang
Zichao Guo
Xinming Zhang
Jian Sun
24
116
0
27 Mar 2020
MMTM: Multimodal Transfer Module for CNN Fusion
Hamid Reza Vaezi Joze
Amirreza Shaban
Michael L. Iuzzolino
K. Koishida
34
277
0
20 Nov 2019
PST900: RGB-Thermal Calibration, Dataset and Segmentation Network
Shreyas S. Shivakumar
Neil Rodrigues
Alex Zhou
Ian D. Miller
Vijay Kumar
Camillo J Taylor
13
177
0
20 Sep 2019
CCNet: Criss-Cross Attention for Semantic Segmentation
Zilong Huang
Xinggang Wang
Yunchao Wei
Lichao Huang
Humphrey Shi
Wenyu Liu
Chang Huang
VOS
38
2,528
0
28 Nov 2018
Dual Attention Network for Scene Segmentation
J. Fu
Qingbin Liu
Haijie Tian
Yong Li
Yongjun Bao
Zhiwei Fang
Hanqing Lu
SSeg
150
5,073
0
09 Sep 2018
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Liang-Chieh Chen
Yukun Zhu
George Papandreou
Florian Schroff
Hartwig Adam
SSeg
44
13,005
0
07 Feb 2018
Rethinking Atrous Convolution for Semantic Image Segmentation
Liang-Chieh Chen
George Papandreou
Florian Schroff
Hartwig Adam
SSeg
113
8,402
0
17 Jun 2017
Pyramid Scene Parsing Network
Hengshuang Zhao
Jianping Shi
Xiaojuan Qi
Xiaogang Wang
Jiaya Jia
VOS
SSeg
104
11,915
0
04 Dec 2016
Fully Convolutional Networks for Semantic Segmentation
Evan Shelhamer
Jonathan Long
Trevor Darrell
VOS
SSeg
171
37,667
0
20 May 2016
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
503
76,398
0
18 May 2015
1