ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.12372
  4. Cited By
Vision Transformer Adapters for Generalizable Multitask Learning

Vision Transformer Adapters for Generalizable Multitask Learning

23 August 2023
Deblina Bhattacharjee
Sabine Süsstrunk
Mathieu Salzmann
    ViT
ArXiv (abs)PDFHTML

Papers citing "Vision Transformer Adapters for Generalizable Multitask Learning"

39 / 39 papers shown
Title
GIT: A Generative Image-to-text Transformer for Vision and Language
GIT: A Generative Image-to-text Transformer for Vision and Language
Jianfeng Wang
Zhengyuan Yang
Xiaowei Hu
Linjie Li
Kevin Qinghong Lin
Zhe Gan
Zicheng Liu
Ce Liu
Lijuan Wang
VLM
134
556
0
27 May 2022
Vision Transformer Adapter for Dense Predictions
Vision Transformer Adapter for Dense Predictions
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
129
564
0
17 May 2022
MulT: An End-to-End Multitask Learning Transformer
MulT: An End-to-End Multitask Learning Transformer
Deblina Bhattacharjee
Tong Zhang
Sabine Süsstrunk
Mathieu Salzmann
ViT
91
67
0
17 May 2022
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
Wenqiang Zhang
Zilong Huang
Guozhong Luo
Tao Chen
Xinggang Wang
Wenyu Liu
Gang Yu
Chunhua Shen
ViT
103
208
0
12 Apr 2022
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene
  Understanding
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding
Hanrong Ye
Dan Xu
ViT
57
88
0
15 Mar 2022
Multi-class Token Transformer for Weakly Supervised Semantic
  Segmentation
Multi-class Token Transformer for Weakly Supervised Semantic Segmentation
Lian Xu
Wanli Ouyang
Bennamoun
F. Boussaïd
Dan Xu
ViT
82
213
0
06 Mar 2022
MViTv2: Improved Multiscale Vision Transformers for Classification and
  Detection
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li
Chaoxia Wu
Haoqi Fan
K. Mangalam
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
148
690
0
02 Dec 2021
Benchmarking Detection Transfer Learning with Vision Transformers
Benchmarking Detection Transfer Learning with Vision Transformers
Yanghao Li
Saining Xie
Xinlei Chen
Piotr Dollar
Kaiming He
Ross B. Girshick
72
168
0
22 Nov 2021
Efficiently Identifying Task Groupings for Multi-Task Learning
Efficiently Identifying Task Groupings for Multi-Task Learning
Christopher Fifty
Ehsan Amid
Zhe Zhao
Tianhe Yu
Rohan Anil
Chelsea Finn
282
255
1
10 Sep 2021
RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination,
  Normal and Depth
RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth
Mengyang Pu
Yaping Huang
Q. Guan
Haibin Ling
63
46
0
02 Aug 2021
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Bowen Cheng
Alex Schwing
Alexander Kirillov
VLMViT
208
1,540
0
13 Jul 2021
CSWin Transformer: A General Vision Transformer Backbone with
  Cross-Shaped Windows
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
Xiaoyi Dong
Jianmin Bao
Dongdong Chen
Weiming Zhang
Nenghai Yu
Lu Yuan
Dong Chen
B. Guo
ViT
145
982
0
01 Jul 2021
Focal Self-attention for Local-Global Interactions in Vision
  Transformers
Focal Self-attention for Local-Global Interactions in Vision Transformers
Jianwei Yang
Chunyuan Li
Pengchuan Zhang
Xiyang Dai
Bin Xiao
Lu Yuan
Jianfeng Gao
ViT
78
435
0
01 Jul 2021
PVT v2: Improved Baselines with Pyramid Vision Transformer
PVT v2: Improved Baselines with Pyramid Vision Transformer
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViTAI4TS
106
1,675
0
25 Jun 2021
FoveaTer: Foveated Transformer for Image Classification
FoveaTer: Foveated Transformer for Image Classification
Aditya Jonnalagadda
Wenjie Wang
B. S. Manjunath
Miguel P. Eckstein
ViT
71
24
0
29 May 2021
Learning to Relate Depth and Semantics for Unsupervised Domain
  Adaptation
Learning to Relate Depth and Semantics for Unsupervised Domain Adaptation
Suman Saha
Anton Obukhov
D. Paudel
Menelaos Kanakis
Yuhua Chen
Stamatios Georgoulis
Luc Van Gool
OOD
59
57
0
17 May 2021
Segmenter: Transformer for Semantic Segmentation
Segmenter: Transformer for Semantic Segmentation
Robin Strudel
Ricardo Garcia Pinel
Ivan Laptev
Cordelia Schmid
ViT
203
1,467
0
12 May 2021
Exploring Relational Context for Multi-Task Dense Prediction
Exploring Relational Context for Multi-Task Dense Prediction
David Brüggemann
Menelaos Kanakis
Anton Obukhov
Stamatios Georgoulis
Luc Van Gool
83
77
0
28 Apr 2021
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
Xiangxiang Chu
Zhi Tian
Yuqing Wang
Bo Zhang
Haibing Ren
Xiaolin K. Wei
Huaxia Xia
Chunhua Shen
ViT
82
1,020
0
28 Apr 2021
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image
  Classification
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
Chun-Fu Chen
Quanfu Fan
Yikang Shen
ViT
71
1,478
0
27 Mar 2021
Vision Transformers for Dense Prediction
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViTMDE
136
1,734
0
24 Mar 2021
UniT: Multimodal Multitask Learning with a Unified Transformer
UniT: Multimodal Multitask Learning with a Unified Transformer
Ronghang Hu
Amanpreet Singh
ViT
84
300
0
22 Feb 2021
Pre-Trained Image Processing Transformer
Pre-Trained Image Processing Transformer
Hanting Chen
Yunhe Wang
Tianyu Guo
Chang Xu
Yiping Deng
Zhenhua Liu
Siwei Ma
Chunjing Xu
Chao Xu
Wen Gao
VLMViT
138
1,676
0
01 Dec 2020
General Multi-label Image Classification with Transformers
General Multi-label Image Classification with Transformers
Jack Lanchantin
Tianlu Wang
Vicente Ordonez
Yanjun Qi
ViT
60
266
0
27 Nov 2020
Robust Learning Through Cross-Task Consistency
Robust Learning Through Cross-Task Consistency
Amir Zamir
Alexander Sax
Teresa Yeo
Oğuzhan Fatih Kar
Nikhil Cheerla
Rohan Suri
Zhangjie Cao
Jitendra Malik
Leonidas Guibas
OOD
55
158
0
07 Jun 2020
Virtual KITTI 2
Virtual KITTI 2
Yohann Cabon
Naila Murray
Martin Humenberger
3DPC
66
286
0
29 Jan 2020
MTI-Net: Multi-Scale Task Interaction Networks for Multi-Task Learning
MTI-Net: Multi-Scale Task Interaction Networks for Multi-Task Learning
Simon Vandenhende
Stamatios Georgoulis
Luc Van Gool
61
223
0
19 Jan 2020
Gradient Surgery for Multi-Task Learning
Gradient Surgery for Multi-Task Learning
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
174
1,221
0
19 Jan 2020
An Empirical Study of Batch Normalization and Group Normalization in
  Conditional Computation
An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation
Vincent Michalski
Vikram S. Voleti
Samira Ebrahimi Kahou
Anthony Ortiz
Pascal Vincent
C. Pal
Doina Precup
BDL
34
6
0
31 Jul 2019
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot
  Cross-dataset Transfer
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
MDE
204
1,793
0
02 Jul 2019
Which Tasks Should Be Learned Together in Multi-task Learning?
Which Tasks Should Be Learned Together in Multi-task Learning?
Trevor Scott Standley
Amir Zamir
Dawn Chen
Leonidas Guibas
Jitendra Malik
Silvio Savarese
103
517
0
18 May 2019
Representation Similarity Analysis for Efficient Task taxonomy &
  Transfer Learning
Representation Similarity Analysis for Efficient Task taxonomy & Transfer Learning
Kshitij Dwivedi
Gemma Roig
68
152
0
26 Apr 2019
Task2Vec: Task Embedding for Meta-Learning
Task2Vec: Task Embedding for Meta-Learning
Alessandro Achille
Michael Lam
Rahul Tewari
Avinash Ravichandran
Subhransu Maji
Charless C. Fowlkes
Stefano Soatto
Pietro Perona
SSL
77
315
0
10 Feb 2019
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for
  Simultaneous Depth Estimation and Scene Parsing
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing
Dan Xu
Wanli Ouyang
Xiaogang Wang
N. Sebe
MoE
53
477
0
11 May 2018
Taskonomy: Disentangling Task Transfer Learning
Taskonomy: Disentangling Task Transfer Learning
Amir Zamir
Alexander Sax
Bokui (William) Shen
Leonidas Guibas
Jitendra Malik
Silvio Savarese
123
1,220
0
23 Apr 2018
Decoupled Weight Decay Regularization
Decoupled Weight Decay Regularization
I. Loshchilov
Frank Hutter
OffRL
144
2,142
0
14 Nov 2017
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry
  and Semantics
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
Alex Kendall
Y. Gal
R. Cipolla
3DH
272
3,123
0
19 May 2017
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
413
10,494
0
21 Jul 2016
The Cityscapes Dataset for Semantic Urban Scene Understanding
The Cityscapes Dataset for Semantic Urban Scene Understanding
Marius Cordts
Mohamed Omran
Sebastian Ramos
Timo Rehfeld
Markus Enzweiler
Rodrigo Benenson
Uwe Franke
Stefan Roth
Bernt Schiele
1.1K
11,623
0
06 Apr 2016
1