ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.00112
  4. Cited By
Transformer in Transformer
v1v2v3 (latest)

Transformer in Transformer

27 February 2021
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
    ViT
ArXiv (abs)PDFHTMLGithub (4228★)

Papers citing "Transformer in Transformer"

50 / 558 papers shown
Title
Reliable Few-shot Learning under Dual Noises
Reliable Few-shot Learning under Dual Noises
Ji Zhang
Jingkuan Song
Lianli Gao
N. Sebe
Heng Tao Shen
NoLa
31
0
0
19 Jun 2025
From Transformers to Large Language Models: A systematic review of AI applications in the energy sector towards Agentic Digital Twins
From Transformers to Large Language Models: A systematic review of AI applications in the energy sector towards Agentic Digital Twins
Gabriel Antonesi
T. Cioara
I. Anghel
Vasilis Michalakopoulos
Elissaios Sarmas
Liana Toderean
LLMAGMedImAI4CE
20
0
0
03 Jun 2025
Leaner Transformers: More Heads, Less Depth
Leaner Transformers: More Heads, Less Depth
Hemanth Saratchandran
Damien Teney
Simon Lucey
34
0
0
27 May 2025
TESSER: Transfer-Enhancing Adversarial Attacks from Vision Transformers via Spectral and Semantic Regularization
TESSER: Transfer-Enhancing Adversarial Attacks from Vision Transformers via Spectral and Semantic Regularization
Amira Guesmi
B. Ouni
Muhammad Shafique
AAML
233
0
0
26 May 2025
A Smart Healthcare System for Monkeypox Skin Lesion Detection and Tracking
A Smart Healthcare System for Monkeypox Skin Lesion Detection and Tracking
Huda Alghoraibi
Nuha Alqurashi
Sarah Alotaibi
Renad Alkhudaydi
Bdoor Aldajani
Lubna Alqurashi
Jood Batweel
Maha A. Thafar
12
0
0
25 May 2025
AnchorFormer: Differentiable Anchor Attention for Efficient Vision Transformer
AnchorFormer: Differentiable Anchor Attention for Efficient Vision Transformer
Jiquan Shan
Junxiao Wang
Lifeng Zhao
Liang Cai
Hongyuan Zhang
Ioannis Liritzis
ViT
245
0
0
22 May 2025
Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection
Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection
Yuqi Cheng
Yunkang Cao
Dongfang Wang
Nong Sang
Wenlong Li
109
1
0
12 May 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
Wenyuan Xu
Shibiao Xu
ViT
527
0
0
06 May 2025
SCFormer: Structured Channel-wise Transformer with Cumulative Historical State for Multivariate Time Series Forecasting
SCFormer: Structured Channel-wise Transformer with Cumulative Historical State for Multivariate Time Series Forecasting
Shiwei Guo
Zheyu Chen
Yupeng Ma
Yunfei Han
Yi Wang
AI4TS
429
0
0
05 May 2025
Hybrid Knowledge Transfer through Attention and Logit Distillation for On-Device Vision Systems in Agricultural IoT
Hybrid Knowledge Transfer through Attention and Logit Distillation for On-Device Vision Systems in Agricultural IoT
Stanley Mugisha
Rashid Kisitu
Florence Tushabe
72
0
0
21 Apr 2025
You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models
You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models
Shiwei Ding
Lan Zhang
Zhenlin Wang
Giuseppe Ateniese
Xiaoyong Yuan
70
0
0
16 Apr 2025
Multi-Modal Brain Tumor Segmentation via 3D Multi-Scale Self-attention and Cross-attention
Multi-Modal Brain Tumor Segmentation via 3D Multi-Scale Self-attention and Cross-attention
Yonghao Huang
Leiting Chen
Chuan Zhou
ViTMedIm
72
0
0
12 Apr 2025
Structured Knowledge Accumulation: The Principle of Entropic Least Action in Forward-Only Neural Learning
Structured Knowledge Accumulation: The Principle of Entropic Least Action in Forward-Only Neural Learning
Bouarfa Mahi Quantiota
87
0
0
04 Apr 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
Hao Wang
Shuo Zhang
Biao Leng
ViT
281
1
0
03 Apr 2025
Forward Learning with Differential Privacy
Forward Learning with Differential Privacy
Mingqian Feng
Zeliang Zhang
Jinyang Jiang
Yijie Peng
Chenliang Xu
98
0
0
01 Apr 2025
Mixed-granularity Implicit Representation for Continuous Hyperspectral Compressive Reconstruction
Mixed-granularity Implicit Representation for Continuous Hyperspectral Compressive Reconstruction
Jianan Li
Huan Chen
Wangcai Zhao
Rui Chen
Tingfa Xu
112
0
0
17 Mar 2025
UStyle: Waterbody Style Transfer of Underwater Scenes by Depth-Guided Feature Synthesis
UStyle: Waterbody Style Transfer of Underwater Scenes by Depth-Guided Feature Synthesis
Md Abu Bakr Siddique
Vaishnav Ramesh
Junliang Liu
Piyush Singh
Md Jahidul Islam
DiffM
112
0
0
14 Mar 2025
Ev-Layout: A Large-scale Event-based Multi-modal Dataset for Indoor Layout Estimation and Tracking
Xucheng Guo
Yiran Shen
Xiaofang Xiao
Yuanfeng Zhou
Lin Wang
3DV3DPCMDE
155
0
0
11 Mar 2025
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition
Dewan Tauhid Rahman
Yeahia Sarker
Antar Mazumder
Md. Shamim Anower
ViT
65
0
0
24 Feb 2025
Infrared Image Super-Resolution: Systematic Review, and Future Trends
Infrared Image Super-Resolution: Systematic Review, and Future Trends
Y. Huang
Tomo Miyazaki
Xiao-Fang Liu
S. Omachi
SupR
156
14
0
21 Feb 2025
Cross-Domain Continual Learning for Edge Intelligence in Wireless ISAC Networks
Cross-Domain Continual Learning for Edge Intelligence in Wireless ISAC Networks
Jingzhi Hu
Xin Li
Zhou Su
Jun Luo
185
0
0
18 Feb 2025
Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System
Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System
Sheikh Moonwara Anjum Monisha
Atul Bharadwaj
104
0
0
16 Feb 2025
Low-altitude Friendly-Jamming for Satellite-Maritime Communications via Generative AI-enabled Deep Reinforcement Learning
Jiawei Huang
Aimin Wang
Geng Sun
Jiahui Li
Jiacheng Wang
Dusit Niyato
Victor C. M. Leung
115
0
0
28 Jan 2025
Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi
Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi
Ella Koresh
Ronit D. Gross
Yuval Meir
Yarden Tzach
Tal Halevi
Ido Kanter
ViT
134
1
0
22 Jan 2025
Protego: Detecting Adversarial Examples for Vision Transformers via Intrinsic Capabilities
Protego: Detecting Adversarial Examples for Vision Transformers via Intrinsic Capabilities
Jialin Wu
Kaikai Pan
Yanjiao Chen
Jiangyi Deng
Shengyuan Pang
Wei Dong
ViTAAML
125
0
0
13 Jan 2025
Research on the Proximity Relationships of Psychosomatic Disease
  Knowledge Graph Modules Extracted by Large Language Models
Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models
Zihan Zhou
Ziyi Zeng
Wenhao Jiang
Yihui Zhu
Jiaxin Mao
Yonggui Yuan
Min Xia
Shubin Zhao
Mengyu Yao
Yunqian Chen
46
0
0
24 Dec 2024
Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers
Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers
Mingliang Xu
Yuyao Zhou
Yuxin Zhang
Shen Li
Shen Li
Yong Li
Zhanpeng Zeng
Rongrong Ji
MQ
341
0
0
21 Dec 2024
A Full Transformer-based Framework for Automatic Pain Estimation using
  Videos
A Full Transformer-based Framework for Automatic Pain Estimation using Videos
Stefanos Gkikas
Manolis Tsiknakis
MedImViT
144
8
0
19 Dec 2024
One Pixel is All I Need
One Pixel is All I Need
Deng Siqin
Zhou Xiaoyi
ViT
453
0
0
14 Dec 2024
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
EGVM
250
1
0
25 Nov 2024
GCI-ViTAL: Gradual Confidence Improvement with Vision Transformers for
  Active Learning on Label Noise
GCI-ViTAL: Gradual Confidence Improvement with Vision Transformers for Active Learning on Label Noise
Moseli Motsóehli
Kyungim Baek
94
1
0
08 Nov 2024
DCT-HistoTransformer: Efficient Lightweight Vision Transformer with DCT
  Integration for histopathological image analysis
DCT-HistoTransformer: Efficient Lightweight Vision Transformer with DCT Integration for histopathological image analysis
Mahtab Ranjbar
Mehdi Mohebbi
Mahdi Cherakhloo
Bijan Vosoughi. Vahdat
MedIm
87
1
0
24 Oct 2024
S$^4$ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack
S4^44ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack
Yongxiang Liu
Bowen Peng
Li Liu
Xuzhao Li
383
0
0
13 Oct 2024
On the Adversarial Transferability of Generalized "Skip Connections"
On the Adversarial Transferability of Generalized "Skip Connections"
Yisen Wang
Yichuan Mo
Dongxian Wu
Mingjie Li
Xingjun Ma
Zhouchen Lin
AAML
72
2
0
11 Oct 2024
BA-Net: Bridge Attention in Deep Neural Networks
BA-Net: Bridge Attention in Deep Neural Networks
Ronghui Zhang
Runzong Zou
Yue Zhao
Zirui Zhang
Junzhou Chen
Yue Cao
Chuan Hu
Houbing Song
62
1
0
10 Oct 2024
LecPrompt: A Prompt-based Approach for Logical Error Correction with
  CodeBERT
LecPrompt: A Prompt-based Approach for Logical Error Correction with CodeBERT
Zhenyu Xu
Victor S. Sheng
KELM
84
0
0
10 Oct 2024
AP-LDM: Attentive and Progressive Latent Diffusion Model for
  Training-Free High-Resolution Image Generation
AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation
Boyuan Cao
Jiaxin Ye
Yujie Wei
Hongming Shan
80
4
0
08 Oct 2024
CVVLSNet: Vehicle Location and Speed Estimation Using Partial Connected
  Vehicle Trajectory Data
CVVLSNet: Vehicle Location and Speed Estimation Using Partial Connected Vehicle Trajectory Data
Jiachen Ye
Dingyu Wang
Shaocheng Jia
Xin Pei
Zi Yang
Yi Zhang
S. Wong
61
0
0
30 Sep 2024
Enhancing Recommendation with Denoising Auxiliary Task
Enhancing Recommendation with Denoising Auxiliary Task
Pengsheng Liu
Linan Zheng
Jiale Chen
Guangfa Zhang
Yang Xu
Jinyun Fang
NoLa
63
0
0
25 Sep 2024
Adversarial Backdoor Defense in CLIP
Adversarial Backdoor Defense in CLIP
Junhao Kuang
Siyuan Liang
Jiawei Liang
Kuanrong Liu
Xiaochun Cao
AAML
85
3
0
24 Sep 2024
A Survey of the Self Supervised Learning Mechanisms for Vision Transformers
A Survey of the Self Supervised Learning Mechanisms for Vision Transformers
Asifullah Khan
A. Sohail
Mustansar Fiaz
Mehdi Hassan
Tariq Habib Afridi
...
Muhammad Zaigham Zaheer
Kamran Ali
Tangina Sultana
Ziaurrehman Tanoli
Naeem Akhter
284
5
0
30 Aug 2024
Hierarchical Network Fusion for Multi-Modal Electron Micrograph
  Representation Learning with Foundational Large Language Models
Hierarchical Network Fusion for Multi-Modal Electron Micrograph Representation Learning with Foundational Large Language Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Venkataramana Runkana
106
0
0
24 Aug 2024
Preliminary Investigations of a Multi-Faceted Robust and Synergistic
  Approach in Semiconductor Electron Micrograph Analysis: Integrating Vision
  Transformers with Large Language and Multimodal Models
Preliminary Investigations of a Multi-Faceted Robust and Synergistic Approach in Semiconductor Electron Micrograph Analysis: Integrating Vision Transformers with Large Language and Multimodal Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Sreeja Gangasani
Chidaksh Ravuru
Venkataramana Runkana
109
0
0
24 Aug 2024
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for
  Efficient Mobile Applications
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications
Tianfang Zhang
Lei Li
Yang Zhou
Wentao Liu
Chen Qian
Xiangyang Ji
ViT
94
17
0
07 Aug 2024
LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local
  Attention and Mamba
LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba
Yunxiang Fu
Chaoqi Chen
Yizhou Yu
Mamba
118
4
0
05 Aug 2024
HAIGEN: Towards Human-AI Collaboration for Facilitating Creativity and
  Style Generation in Fashion Design
HAIGEN: Towards Human-AI Collaboration for Facilitating Creativity and Style Generation in Fashion Design
Juzheng Zhang
Di Wu
Hanhui Deng
Yidan Long
Wenyi Tang
Yongqiang Chen
Can Liu
Zhanpeng Jin
Wenlei Zhang
Tangquan Qi
73
10
0
01 Aug 2024
VSSD: Vision Mamba with Non-Causal State Space Duality
VSSD: Vision Mamba with Non-Causal State Space Duality
Yuheng Shi
Minjing Dong
Mingjia Li
Chang Xu
Mamba
103
8
0
26 Jul 2024
Fairness Definitions in Language Models Explained
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Zichong Wang
Wenbin Zhang
ALM
113
10
0
26 Jul 2024
Rate-Distortion-Cognition Controllable Versatile Neural Image
  Compression
Rate-Distortion-Cognition Controllable Versatile Neural Image Compression
Jinming Liu
Ruoyu Feng
Yunpeng Qi
Qiuyu Chen
Zhibo Chen
Wenjun Zeng
Xin Jin
101
2
0
16 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
77
5
0
16 Jul 2024
1234...101112
Next