Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.03635
Cited By
v1
v2
v3
v4
v5 (latest)
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
9 March 2018
Jonathan Frankle
Michael Carbin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"
50 / 2,030 papers shown
Title
Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads?
Zae Myung Kim
Laurent Besacier
Vassilina Nikoulina
D. Schwab
MILM
84
8
0
31 May 2021
RED : Looking for Redundancies for Data-Free Structured Compression of Deep Neural Networks
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
CVBM
73
23
0
31 May 2021
1xN Pattern for Pruning Convolutional Neural Networks
Mingbao Lin
Yu-xin Zhang
Yuchao Li
Bohong Chen
Chia-Wen Lin
Mengdi Wang
Shen Li
Yonghong Tian
Rongrong Ji
3DPC
115
43
0
31 May 2021
LEAP: Learnable Pruning for Transformer-based Models
Z. Yao
Xiaoxia Wu
Linjian Ma
Sheng Shen
Kurt Keutzer
Michael W. Mahoney
Yuxiong He
60
7
0
30 May 2021
Sparse Uncertainty Representation in Deep Learning with Inducing Weights
H. Ritter
Martin Kukla
Chen Zhang
Yingzhen Li
UQCV
BDL
81
17
0
30 May 2021
Embedding Principle of Loss Landscape of Deep Neural Networks
Yaoyu Zhang
Zhongwang Zhang
Yaoyu Zhang
Z. Xu
67
38
0
30 May 2021
Neural Network Training Using
ℓ
1
\ell_1
ℓ
1
-Regularization and Bi-fidelity Data
Subhayan De
Alireza Doostan
71
25
0
27 May 2021
Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation
Lewei Yao
Renjie Pi
Hang Xu
Wei Zhang
Zhenguo Li
Tong Zhang
139
40
0
27 May 2021
Search Spaces for Neural Model Training
Darko Stosic
Dusan Stosic
78
4
0
27 May 2021
Geometry of the Loss Landscape in Overparameterized Neural Networks: Symmetries and Invariances
Berfin cSimcsek
François Ged
Arthur Jacot
Francesco Spadaro
Clément Hongler
W. Gerstner
Johanni Brea
AI4CE
87
102
0
25 May 2021
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Chen Liang
Simiao Zuo
Minshuo Chen
Haoming Jiang
Xiaodong Liu
Pengcheng He
T. Zhao
Weizhu Chen
59
69
0
25 May 2021
AirNet: Neural Network Transmission over the Air
Mikolaj Jankowski
Deniz Gunduz
K. Mikolajczyk
102
1
0
24 May 2021
Properties of the After Kernel
Philip M. Long
66
29
0
21 May 2021
Understanding Uncertainty in Bayesian Deep Learning
Cooper Lorsung
BDL
UQCV
21
0
0
21 May 2021
A Probabilistic Approach to Neural Network Pruning
Xin-Yao Qian
Diego Klabjan
91
17
0
20 May 2021
Learning Language Specific Sub-network for Multilingual Machine Translation
Zehui Lin
Liwei Wu
Mingxuan Wang
Lei Li
78
83
0
19 May 2021
Livewired Neural Networks: Making Neurons That Fire Together Wire Together
Thomas Schumacher
70
4
0
17 May 2021
A brain basis of dynamical intelligence for AI and computational neuroscience
J. Monaco
Kanaka Rajan
Grace M. Hwang
AI4CE
51
6
0
15 May 2021
BERT Busters: Outlier Dimensions that Disrupt Transformers
Olga Kovaleva
Saurabh Kulshreshtha
Anna Rogers
Anna Rumshisky
117
92
0
14 May 2021
Troubleshooting Blind Image Quality Models in the Wild
Zhihua Wang
Haotao Wang
Tianlong Chen
Zhangyang Wang
Kede Ma
66
20
0
14 May 2021
Dynamic Multi-Branch Layers for On-Device Neural Machine Translation
Zhixing Tan
Zeyuan Yang
Meng Zhang
Qun Liu
Maosong Sun
Yang Liu
AI4CE
56
4
0
14 May 2021
BWCP: Probabilistic Learning-to-Prune Channels for ConvNets via Batch Whitening
Wenqi Shao
Hang Yu
Zhaoyang Zhang
Hang Xu
Zhenguo Li
Ping Luo
AAML
19
2
0
13 May 2021
Model Pruning Based on Quantified Similarity of Feature Maps
Zidu Wang
Xue-jun Liu
Long Huang
Yuxiang Chen
Yufei Zhang
Zhikang Lin
Rui Wang
39
18
0
13 May 2021
Dynamical Isometry: The Missing Ingredient for Neural Network Pruning
Huan Wang
Can Qin
Yue Bai
Y. Fu
48
5
0
12 May 2021
Pruning of Deep Spiking Neural Networks through Gradient Rewiring
Yanqing Chen
Zhaofei Yu
Wei Fang
Tiejun Huang
Yonghong Tian
83
67
0
11 May 2021
A Bregman Learning Framework for Sparse Neural Networks
Leon Bungert
Tim Roith
Daniel Tenbrinck
Martin Burger
89
18
0
10 May 2021
What Kinds of Functions do Deep Neural Networks Learn? Insights from Variational Spline Theory
Rahul Parhi
Robert D. Nowak
MLT
125
71
0
07 May 2021
Adapting by Pruning: A Case Study on BERT
Yang Gao
Nicolo Colombo
Wen Wang
49
17
0
07 May 2021
Network Pruning That Matters: A Case Study on Retraining Variants
Duong H. Le
Binh-Son Hua
89
41
0
07 May 2021
Structured Ensembles: an Approach to Reduce the Memory Footprint of Ensemble Methods
Jary Pomponi
Simone Scardapane
A. Uncini
UQCV
56
7
0
06 May 2021
Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression
Baeseong Park
S. Kwon
Daehwan Oh
Byeongwook Kim
Dongsoo Lee
53
4
0
05 May 2021
On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning
Marc Aurel Vischer
R. T. Lange
Henning Sprekeler
OOD
UQCV
OffRL
91
24
0
04 May 2021
Initialization and Regularization of Factorized Neural Layers
M. Khodak
Neil A. Tenenholtz
Lester W. Mackey
Nicolò Fusi
147
57
0
03 May 2021
Effective Sparsification of Neural Networks with Global Sparsity Constraint
Xiao Zhou
Weizhong Zhang
Hang Xu
Tong Zhang
152
63
0
03 May 2021
Personalized Federated Learning by Structured and Unstructured Pruning under Data Heterogeneity
Saeed Vahidian
Mahdi Morafah
Bill Lin
115
61
0
02 May 2021
Studying the Consistency and Composability of Lottery Ticket Pruning Masks
Rajiv Movva
Jonathan Frankle
Michael Carbin
MoMe
3DPC
37
3
0
30 Apr 2021
Filter Distribution Templates in Convolutional Networks for Image Classification Tasks
Ramon Izquierdo-Cordova
Walterio W. Mayol-Cuevas
VLM
27
0
0
28 Apr 2021
MineGAN++: Mining Generative Models for Efficient Knowledge Transfer to Limited Data Domains
Yaxing Wang
Abel Gonzalez-Garcia
Chenshen Wu
Luis Herranz
Fahad Shahbaz Khan
Shangling Jui
Joost van de Weijer
63
6
0
28 Apr 2021
Policy Manifold Search: Exploring the Manifold Hypothesis for Diversity-based Neuroevolution
Nemanja Rakićević
Antoine Cully
Petar Kormushev
51
34
0
27 Apr 2021
Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones?
Franco Pellegrini
Giulio Biroli
109
6
0
27 Apr 2021
Communication-Efficient and Personalized Federated Lottery Ticket Learning
Sejin Seo
Seung-Woo Ko
Jihong Park
Seong-Lyun Kim
M. Bennis
FedML
86
15
0
26 Apr 2021
Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation
Cheng Chen
Yichun Yin
Lifeng Shang
Zhi Wang
Xin Jiang
Xiao Chen
Qun Liu
FedML
70
7
0
24 Apr 2021
Playing Lottery Tickets with Vision and Language
Zhe Gan
Yen-Chun Chen
Linjie Li
Tianlong Chen
Yu Cheng
Shuohang Wang
Jingjing Liu
Lijuan Wang
Zicheng Liu
VLM
140
56
0
23 Apr 2021
Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes
James Lucas
Juhan Bae
Michael Ruogu Zhang
Stanislav Fort
R. Zemel
Roger C. Grosse
MoMe
234
28
0
22 Apr 2021
Distilling Knowledge via Knowledge Review
Pengguang Chen
Shu Liu
Hengshuang Zhao
Jiaya Jia
197
449
0
19 Apr 2021
Case-based Reasoning for Natural Language Queries over Knowledge Bases
Rajarshi Das
Manzil Zaheer
Dung Ngoc Thai
Ameya Godbole
Ethan Perez
Jay Yoon Lee
Lizhen Tan
L. Polymenakos
Andrew McCallum
111
169
0
18 Apr 2021
Lottery Jackpots Exist in Pre-trained Models
Yuxin Zhang
Mingbao Lin
Yan Wang
Chia-Wen Lin
Rongrong Ji
95
16
0
18 Apr 2021
Rethinking Network Pruning -- under the Pre-train and Fine-tune Paradigm
Dongkuan Xu
Ian En-Hsu Yen
Jinxi Zhao
Zhibin Xiao
VLM
AAML
92
58
0
18 Apr 2021
Adaptive Sparse Transformer for Multilingual Translation
Hongyu Gong
Xian Li
Dmitriy Genzel
69
14
0
15 Apr 2021
Disentangling Representations of Text by Masking Transformers
Xiongyi Zhang
Jan-Willem van de Meent
Byron C. Wallace
DRL
61
21
0
14 Apr 2021
Previous
1
2
3
...
30
31
32
...
39
40
41
Next