ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.00112
  4. Cited By
Transformer in Transformer

Transformer in Transformer

27 February 2021
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
    ViT
ArXivPDFHTML

Papers citing "Transformer in Transformer"

50 / 553 papers shown
Title
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis:
  Techniques for Portrait Generation, Driving Mechanisms, and Editing
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
46
1
0
15 Jun 2024
Recent Advances in Federated Learning Driven Large Language Models: A Survey on Architecture, Performance, and Security
Recent Advances in Federated Learning Driven Large Language Models: A Survey on Architecture, Performance, and Security
Youyang Qu
Ming Liu
Tianqing Zhu
Longxiang Gao
Shui Yu
Wanlei Zhou
MU
FedML
65
2
0
14 Jun 2024
SecureNet: A Comparative Study of DeBERTa and Large Language Models for
  Phishing Detection
SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection
Sakshi Mahendru
Tejul Pandit
31
1
0
10 Jun 2024
Convolutional Neural Networks and Vision Transformers for Fashion MNIST
  Classification: A Literature Review
Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review
Sonia Bbouzidi
Ghazala Hcini
Imen Jdey
Fadoua Drira
29
4
0
05 Jun 2024
Block Transformer: Global-to-Local Language Modeling for Fast Inference
Block Transformer: Global-to-Local Language Modeling for Fast Inference
Namgyu Ho
Sangmin Bae
Taehyeon Kim
Hyunjik Jo
Yireun Kim
Tal Schuster
Adam Fisch
James Thorne
Se-Young Yun
47
8
0
04 Jun 2024
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in
  Offline Reinforcement Learning
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning
Jiahang Cao
Qiang Zhang
Ziqing Wang
Jiaxu Wang
Hao Cheng
Yecheng Shao
Wen Zhao
Gang Han
Yijie Guo
Renjing Xu
Mamba
59
2
0
04 Jun 2024
Automatic Channel Pruning for Multi-Head Attention
Automatic Channel Pruning for Multi-Head Attention
Eunho Lee
Youngbae Hwang
ViT
40
1
0
31 May 2024
Activator: GLU Activation Function as the Core Component of a Vision
  Transformer
Activator: GLU Activation Function as the Core Component of a Vision Transformer
Abdullah Nazhat Abdullah
Tarkan Aydin
ViT
43
0
0
24 May 2024
PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration
  for Diverse LLM Services
PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services
Zheming Yang
Yuanhao Yang
Chang Zhao
Qi Guo
Wenkai He
Wen Ji
50
13
0
23 May 2024
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
Yuheng Shi
Minjing Dong
Chang Xu
Mamba
48
32
0
23 May 2024
Configuring Data Augmentations to Reduce Variance Shift in Positional
  Embedding of Vision Transformers
Configuring Data Augmentations to Reduce Variance Shift in Positional Embedding of Vision Transformers
Bum Jun Kim
Sang Woo Kim
ViT
43
1
0
23 May 2024
From Human-to-Human to Human-to-Bot Conversations in Software
  Engineering
From Human-to-Human to Human-to-Bot Conversations in Software Engineering
Ranim Khojah
Francisco Gomes de Oliveira Neto
Philipp Leitner
25
2
0
21 May 2024
Multi-View Attentive Contextualization for Multi-View 3D Object
  Detection
Multi-View Attentive Contextualization for Multi-View 3D Object Detection
Xianpeng Liu
Ce Zheng
Ming Qian
Nan Xue
Chong Chen
Zhebin Zhang
Chen Li
Tianfu Wu
41
2
0
20 May 2024
Large Language Models for Medicine: A Survey
Large Language Models for Medicine: A Survey
Yanxin Zheng
Wensheng Gan
Zefeng Chen
Zhenlian Qi
Qian Liang
Philip S. Yu
LM&MA
23
15
0
20 May 2024
A Survey of Generative Techniques for Spatial-Temporal Data Mining
A Survey of Generative Techniques for Spatial-Temporal Data Mining
Qianru Zhang
Haixin Wang
Cheng Long
Liangcai Su
Xingwei He
...
Tailin Wu
Hongzhi Yin
Siu-Ming Yiu
Qi Tian
Christian S. Jensen
AI4TS
52
7
0
15 May 2024
A Semantic and Motion-Aware Spatiotemporal Transformer Network for
  Action Detection
A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection
Matthew Korban
Peter Youngs
Scott T. Acton
ViT
29
6
0
13 May 2024
HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for
  Image Retrieval
HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval
Chao He
Hongxi Wei
22
6
0
13 May 2024
ExplainableDetector: Exploring Transformer-based Language Modeling
  Approach for SMS Spam Detection with Explainability Analysis
ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis
Mohammad Amaz Uddin
Muhammad Nazrul Islam
Leandros A. Maglaras
Helge Janicke
Iqbal H. Sarker
42
2
0
12 May 2024
G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous
  Knowledge for Commonsense Reasoning
G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous Knowledge for Commonsense Reasoning
Ruiting Dai
Yuqiao Tan
Lisi Mo
Shuang Liang
Guohao Huo
Jiayi Luo
Yao Cheng
ReLM
RALM
LRM
43
0
0
09 May 2024
CNN-LSTM and Transfer Learning Models for Malware Classification based
  on Opcodes and API Calls
CNN-LSTM and Transfer Learning Models for Malware Classification based on Opcodes and API Calls
A. Bensaoud
Jugal Kalita
30
13
0
04 May 2024
Revolutionizing Traffic Sign Recognition: Unveiling the Potential of
  Vision Transformers
Revolutionizing Traffic Sign Recognition: Unveiling the Potential of Vision Transformers
Susano Mingwin
Yulong Shisu
Yongshuai Wanwag
Sunshin Huing
46
2
0
29 Apr 2024
Data-independent Module-aware Pruning for Hierarchical Vision
  Transformers
Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Yang He
Qiufeng Wang
ViT
50
3
0
21 Apr 2024
Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature
  Processing
Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing
Yuang Liu
Zhiheng Qiu
Xiaokai Qin
ViT
33
0
0
20 Apr 2024
X-Light: Cross-City Traffic Signal Control Using Transformer on
  Transformer as Meta Multi-Agent Reinforcement Learner
X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner
Haoyuan Jiang
Ziyue Li
Hua Wei
Xuantang Xiong
Jingqing Ruan
Jiaming Lu
Hangyu Mao
Rui Zhao
33
8
0
18 Apr 2024
Unblind Text Inputs: Predicting Hint-text of Text Input in Mobile Apps
  via LLM
Unblind Text Inputs: Predicting Hint-text of Text Input in Mobile Apps via LLM
Zhe Liu
Chunyang Chen
Junjie Wang
Mengzhuo Chen
Boyu Wu
Yuekai Huang
Jun Hu
Qing Wang
34
10
0
03 Apr 2024
Scene Adaptive Sparse Transformer for Event-based Object Detection
Scene Adaptive Sparse Transformer for Event-based Object Detection
Yansong Peng
Hebei Li
Yueyi Zhang
Xiaoyan Sun
Feng Wu
ViT
46
13
0
02 Apr 2024
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and
  Time-Series Analysis
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
Badri N. Patro
Suhas Ranganath
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
43
2
0
26 Mar 2024
Once for Both: Single Stage of Importance and Sparsity Search for Vision
  Transformer Compression
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
Hancheng Ye
Chong Yu
Peng Ye
Renqiu Xia
Yansong Tang
Jiwen Lu
Tao Chen
Bo-Wen Zhang
56
3
0
23 Mar 2024
Cost-Efficient Large Language Model Serving for Multi-turn Conversations
  with CachedAttention
Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention
Bin Gao
Zhuomin He
Puru Sharma
Qingxuan Kang
Djordje Jevdjic
Junbo Deng
Xingkun Yang
Zhou Yu
Pengfei Zuo
71
45
0
23 Mar 2024
Accelerating ViT Inference on FPGA through Static and Dynamic Pruning
Accelerating ViT Inference on FPGA through Static and Dynamic Pruning
Dhruv Parikh
Shouyi Li
Bingyi Zhang
Rajgopal Kannan
Carl E. Busart
Viktor Prasanna
40
1
0
21 Mar 2024
Transformer based Multitask Learning for Image Captioning and Object
  Detection
Transformer based Multitask Learning for Image Captioning and Object Detection
Debolena Basak
P. K. Srijith
M. Desarkar
24
1
0
10 Mar 2024
NiNformer: A Network in Network Transformer with Token Mixing Generated
  Gating Function
NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function
Abdullah Nazhat Abdullah
Tarkan Aydin
39
0
0
04 Mar 2024
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection
Tianxiang Chen
Zi Ye
Zhentao Tan
Tao Gong
Yue-bo Wu
Qi Chu
Bin Liu
Nenghai Yu
Jieping Ye
Mamba
59
46
0
04 Mar 2024
AllSpark: Reborn Labeled Features from Unlabeled in Transformer for
  Semi-Supervised Semantic Segmentation
AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation
Haonan Wang
Qixiang Zhang
Yi Li
Xiaomeng Li
43
16
0
04 Mar 2024
GIN-SD: Source Detection in Graphs with Incomplete Nodes via Positional
  Encoding and Attentive Fusion
GIN-SD: Source Detection in Graphs with Incomplete Nodes via Positional Encoding and Attentive Fusion
Le Cheng
Peican Zhu
Keke Tang
Chao Gao
Zhen Wang
26
17
0
27 Feb 2024
A Comprehensive Survey of Convolutions in Deep Learning: Applications,
  Challenges, and Future Trends
A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends
Abolfazl Younesi
Mohsen Ansari
Mohammadamin Fazli
A. Ejlali
Muhammad Shafique
Joerg Henkel
3DV
52
44
0
23 Feb 2024
Label-efficient multi-organ segmentation with a diffusion model
Label-efficient multi-organ segmentation with a diffusion model
Yongzhi Huang
Jinxin Zhu
Haseeb Hassan
Liyilei Su
Jingyu Li
Binding Huang
Yun Peng
Jingyu Li
Jun Ma
Bingding Huang
DiffM
MedIm
36
0
0
23 Feb 2024
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level
  Recognition
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition
Lianghui Zhu
Junwei Zhou
Yan Liu
Xin Hao
Wenyu Liu
Xinggang Wang
VLM
33
5
0
22 Feb 2024
An Explainable Transformer-based Model for Phishing Email Detection: A
  Large Language Model Approach
An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach
Mohammad Amaz Uddin
Iqbal H. Sarker
36
15
0
21 Feb 2024
TDViT: Temporal Dilated Video Transformer for Dense Video Tasks
TDViT: Temporal Dilated Video Transformer for Dense Video Tasks
Guanxiong Sun
Yang Hua
Guosheng Hu
N. Robertson
ViT
27
1
0
14 Feb 2024
Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain
Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain
Amin Karimi Monsefi
Payam Karisani
Mengxi Zhou
Stacey S. Choi
Nathan Doble
Heng Ji
Srinivasan Parthasarathy
R. Ramnath
43
5
0
09 Feb 2024
Expediting In-Network Federated Learning by Voting-Based Consensus Model
  Compression
Expediting In-Network Federated Learning by Voting-Based Consensus Model Compression
Xiaoxin Su
Yipeng Zhou
Laizhong Cui
Song Guo
FedML
30
3
0
06 Feb 2024
A Survey on Transformer Compression
A Survey on Transformer Compression
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhijun Tu
Kai Han
Hailin Hu
Dacheng Tao
37
28
0
05 Feb 2024
NOAH: Learning Pairwise Object Category Attentions for Image
  Classification
NOAH: Learning Pairwise Object Category Attentions for Image Classification
Chao Li
Aojun Zhou
Anbang Yao
VLM
38
2
0
04 Feb 2024
ParZC: Parametric Zero-Cost Proxies for Efficient NAS
ParZC: Parametric Zero-Cost Proxies for Efficient NAS
Peijie Dong
Lujun Li
Xinglin Pan
Zimian Wei
Xiang Liu
Qiang-qiang Wang
Xiaowen Chu
66
3
0
03 Feb 2024
Investigating Recurrent Transformers with Dynamic Halt
Investigating Recurrent Transformers with Dynamic Halt
Jishnu Ray Chowdhury
Cornelia Caragea
39
1
0
01 Feb 2024
A Manifold Representation of the Key in Vision Transformers
A Manifold Representation of the Key in Vision Transformers
Li Meng
Morten Goodwin
Anis Yazidi
P. Engelstad
29
0
0
01 Feb 2024
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
Seokju Yun
Youngmin Ro
ViT
44
29
0
29 Jan 2024
Do deep neural networks utilize the weight space efficiently?
Do deep neural networks utilize the weight space efficiently?
Onur Can Koyun
B. U. Toreyin
21
0
0
26 Jan 2024
Speech Swin-Transformer: Exploring a Hierarchical Transformer with
  Shifted Windows for Speech Emotion Recognition
Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition
Yong Wang
Cheng Lu
Hailun Lian
Yan Zhao
Bjorn Schuller
Yuan Zong
Wenming Zheng
23
10
0
19 Jan 2024
Previous
12345...101112
Next