Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.07858
Cited By
The Lottery Ticket Hypothesis for Self-attention in Convolutional Neural Network
16 July 2022
Zhongzhan Huang
Senwei Liang
Mingfu Liang
Wei He
Haizhao Yang
Liang Lin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Lottery Ticket Hypothesis for Self-attention in Convolutional Neural Network"
14 / 14 papers shown
Title
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets
Xiaohan Chen
Yu Cheng
Shuohang Wang
Zhe Gan
Zhangyang Wang
Jingjing Liu
68
100
0
31 Dec 2020
Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization is Sufficient
Ankit Pensia
Shashank Rajput
Alliot Nagle
Harit Vishwakarma
Dimitris Papailiopoulos
48
103
0
14 Jun 2020
Efficient Crowd Counting via Structured Knowledge Transfer
Lingbo Liu
Jiaqi Chen
Hefeng Wu
Tianshui Chen
Guanbin Li
Liang Lin
40
64
0
23 Mar 2020
Proving the Lottery Ticket Hypothesis: Pruning is All You Need
Eran Malach
Gilad Yehudai
Shai Shalev-Shwartz
Ohad Shamir
96
272
0
03 Feb 2020
What's Hidden in a Randomly Weighted Neural Network?
Vivek Ramanujan
Mitchell Wortsman
Aniruddha Kembhavi
Ali Farhadi
Mohammad Rastegari
52
354
0
29 Nov 2019
FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search
Xiangxiang Chu
Bo Zhang
Ruijun Xu
51
332
0
03 Jul 2019
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
Yue Cao
Jiarui Xu
Stephen Lin
Fangyun Wei
Han Hu
ISeg
68
1,561
0
25 Apr 2019
Gradient Descent Provably Optimizes Over-parameterized Neural Networks
S. Du
Xiyu Zhai
Barnabás Póczós
Aarti Singh
MLT
ODL
161
1,261
0
04 Oct 2018
Benchmark Analysis of Representative Deep Neural Network Architectures
Simone Bianco
Rémi Cadène
Luigi Celona
Paolo Napoletano
BDL
52
673
0
01 Oct 2018
CBAM: Convolutional Block Attention Module
Sanghyun Woo
Jongchan Park
Joon-Young Lee
In So Kweon
187
16,337
0
17 Jul 2018
DARTS: Differentiable Architecture Search
Hanxiao Liu
Karen Simonyan
Yiming Yang
176
4,326
0
24 Jun 2018
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
Yuhong Li
Xiaofan Zhang
Deming Chen
117
1,325
0
27 Feb 2018
Efficient Neural Architecture Search via Parameter Sharing
Hieu H. Pham
M. Guan
Barret Zoph
Quoc V. Le
J. Dean
81
2,761
0
09 Feb 2018
Learning Transferable Architectures for Scalable Image Recognition
Barret Zoph
Vijay Vasudevan
Jonathon Shlens
Quoc V. Le
148
5,577
0
21 Jul 2017
1