Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.05704
Cited By
Escaping the Big Data Paradigm with Compact Transformers
12 April 2021
Ali Hassani
Steven Walton
Nikhil Shah
Abulikemu Abuduweili
Jiachen Li
Humphrey Shi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Escaping the Big Data Paradigm with Compact Transformers"
16 / 216 papers shown
Title
SERF: Towards better training of deep neural networks using log-Softplus ERror activation Function
Sayan Nag
Mayukh Bhattacharyya
LLMSV
27
22
0
21 Aug 2021
Escaping the Gradient Vanishing: Periodic Alternatives of Softmax in Attention Mechanism
Shulun Wang
Bin Liu
Feng Liu
25
16
0
16 Aug 2021
TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network
Zhengyi Liu
Yuan Wang
Zhengzheng Tu
Yun Xiao
Bin Tang
ViT
32
142
0
09 Aug 2021
Vision Transformer for femur fracture classification
L. Tanzi
A. Audisio
G. Cirrincione
A. Aprato
E. Vezzetti
MedIm
38
64
0
07 Aug 2021
Vision Xformers: Efficient Attention for Image Classification
Pranav Jeevan
Amit Sethi
ViT
25
13
0
05 Jul 2021
MSN: Efficient Online Mask Selection Network for Video Instance Segmentation
Vidit Goel
Jiachen Li
Shubhika Garg
Harsh Maheshwari
Humphrey Shi
19
7
0
19 Jun 2021
On Deep Neural Network Calibration by Regularization and its Impact on Refinement
Aditya Singh
Alessandro Bay
B. Sengupta
Andrea Mirabile
AAML
27
2
0
17 Jun 2021
Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer
Zilong Huang
Youcheng Ben
Guozhong Luo
Pei Cheng
Gang Yu
Bin-Bin Fu
ViT
19
182
0
07 Jun 2021
A Little Robustness Goes a Long Way: Leveraging Robust Features for Targeted Transfer Attacks
Jacob Mitchell Springer
Melanie Mitchell
Garrett Kenyon
AAML
31
43
0
03 Jun 2021
Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding
Zizhao Zhang
Han Zhang
Long Zhao
Ting Chen
Sercan Ö. Arik
Tomas Pfister
ViT
22
169
0
26 May 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
283
1,984
0
09 Feb 2021
CheXtransfer: Performance and Parameter Efficiency of ImageNet Models for Chest X-Ray Interpretation
Alexander Ke
William Ellsworth
Oishi Banerjee
A. Ng
Pranav Rajpurkar
MedIm
73
101
0
18 Jan 2021
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
Fahad Shahbaz Khan
M. Shah
ViT
227
2,431
0
04 Jan 2021
Union-net: A deep neural network model adapted to small data sets
Qingfang He
Guang Cheng
Zhiying Lin
PINN
31
6
0
24 Dec 2020
Stochastic-Sign SGD for Federated Learning with Theoretical Guarantees
Richeng Jin
Yufan Huang
Xiaofan He
H. Dai
Tianfu Wu
FedML
22
63
0
25 Feb 2020
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
Previous
1
2
3
4
5