ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.07858
  4. Cited By
The Lottery Ticket Hypothesis for Self-attention in Convolutional Neural
  Network

The Lottery Ticket Hypothesis for Self-attention in Convolutional Neural Network

16 July 2022
Zhongzhan Huang
Senwei Liang
Mingfu Liang
Wei He
Haizhao Yang
Liang Lin
ArXivPDFHTML

Papers citing "The Lottery Ticket Hypothesis for Self-attention in Convolutional Neural Network"

14 / 14 papers shown
Title
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets
Xiaohan Chen
Yu Cheng
Shuohang Wang
Zhe Gan
Zhangyang Wang
Jingjing Liu
68
100
0
31 Dec 2020
Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization
  is Sufficient
Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization is Sufficient
Ankit Pensia
Shashank Rajput
Alliot Nagle
Harit Vishwakarma
Dimitris Papailiopoulos
48
103
0
14 Jun 2020
Efficient Crowd Counting via Structured Knowledge Transfer
Efficient Crowd Counting via Structured Knowledge Transfer
Lingbo Liu
Jiaqi Chen
Hefeng Wu
Tianshui Chen
Guanbin Li
Liang Lin
40
64
0
23 Mar 2020
Proving the Lottery Ticket Hypothesis: Pruning is All You Need
Proving the Lottery Ticket Hypothesis: Pruning is All You Need
Eran Malach
Gilad Yehudai
Shai Shalev-Shwartz
Ohad Shamir
96
272
0
03 Feb 2020
What's Hidden in a Randomly Weighted Neural Network?
What's Hidden in a Randomly Weighted Neural Network?
Vivek Ramanujan
Mitchell Wortsman
Aniruddha Kembhavi
Ali Farhadi
Mohammad Rastegari
52
354
0
29 Nov 2019
FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural
  Architecture Search
FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search
Xiangxiang Chu
Bo Zhang
Ruijun Xu
51
332
0
03 Jul 2019
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
Yue Cao
Jiarui Xu
Stephen Lin
Fangyun Wei
Han Hu
ISeg
68
1,561
0
25 Apr 2019
Gradient Descent Provably Optimizes Over-parameterized Neural Networks
Gradient Descent Provably Optimizes Over-parameterized Neural Networks
S. Du
Xiyu Zhai
Barnabás Póczós
Aarti Singh
MLT
ODL
161
1,261
0
04 Oct 2018
Benchmark Analysis of Representative Deep Neural Network Architectures
Benchmark Analysis of Representative Deep Neural Network Architectures
Simone Bianco
Rémi Cadène
Luigi Celona
Paolo Napoletano
BDL
52
673
0
01 Oct 2018
CBAM: Convolutional Block Attention Module
CBAM: Convolutional Block Attention Module
Sanghyun Woo
Jongchan Park
Joon-Young Lee
In So Kweon
187
16,337
0
17 Jul 2018
DARTS: Differentiable Architecture Search
DARTS: Differentiable Architecture Search
Hanxiao Liu
Karen Simonyan
Yiming Yang
176
4,326
0
24 Jun 2018
CSRNet: Dilated Convolutional Neural Networks for Understanding the
  Highly Congested Scenes
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
Yuhong Li
Xiaofan Zhang
Deming Chen
117
1,325
0
27 Feb 2018
Efficient Neural Architecture Search via Parameter Sharing
Efficient Neural Architecture Search via Parameter Sharing
Hieu H. Pham
M. Guan
Barret Zoph
Quoc V. Le
J. Dean
81
2,761
0
09 Feb 2018
Learning Transferable Architectures for Scalable Image Recognition
Learning Transferable Architectures for Scalable Image Recognition
Barret Zoph
Vijay Vasudevan
Jonathon Shlens
Quoc V. Le
148
5,577
0
21 Jul 2017
1