The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

9 March 2018

Papers citing "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"

50 / 745 papers shown

Title
Sparse neural networks with skip-connections for identification of aluminum electrolysis cell E. Lundby Haakon Robinson Adil Rasheed I. Halvorsen J. Gravdahl 30 2 0 02 Jan 2023
NIRVANA: Neural Implicit Representations of Videos with Adaptive Networks and Autoregressive Patch-wise Modeling Shishira R. Maiya Sharath Girish Max Ehrlich Hanyu Wang Kwot Sin Lee Patrick Poirson Pengxiang Wu Chen Wang Abhinav Shrivastava VGen 47 40 0 30 Dec 2022
COLT: Cyclic Overlapping Lottery Tickets for Faster Pruning of Convolutional Neural Networks Md. Ismail Hossain Mohammed Rakib M. M. L. Elahi Nabeel Mohammed Shafin Rahman 21 1 0 24 Dec 2022
Hyperspherical Quantization: Toward Smaller and More Accurate Models Dan Liu X. Chen Chen Ma Xue Liu MQ 35 3 0 24 Dec 2022
Constructing Organism Networks from Collaborative Self-Replicators Steffen Illium Maximilian Zorn Cristian Lenta Michael Kolle Claudia Linnhoff-Popien Thomas Gabor 21 0 0 20 Dec 2022
Dynamic Sparse Network for Time Series Classification: Learning What to "see'' Qiao Xiao Boqian Wu Yu Zhang Shiwei Liu Mykola Pechenizkiy Elena Mocanu Decebal Constantin Mocanu AI4TS 43 28 0 19 Dec 2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources Samuel Cahyawijaya Holy Lovenia Alham Fikri Aji Genta Indra Winata Bryan Wilie ... Timothy Baldwin Sebastian Ruder Herry Sujaini S. Sakti Ayu Purwarianti 44 48 0 19 Dec 2022
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting Zheng-Xin Yong Hailey Schoelkopf Niklas Muennighoff Alham Fikri Aji David Ifeoluwa Adelani ... Genta Indra Winata Stella Biderman Edward Raff Dragomir R. Radev Vassilina Nikoulina CLL VLM AI4CE LRM 35 81 0 19 Dec 2022
Can We Find Strong Lottery Tickets in Generative Models? Sangyeop Yeo Yoojin Jang Jy-yong Sohn Dongyoon Han Jaejun Yoo 20 6 0 16 Dec 2022
On the Relationship Between Explanation and Prediction: A Causal View Amir-Hossein Karimi Krikamol Muandet Simon Kornblith Bernhard Schölkopf Been Kim FAtt CML 40 14 0 13 Dec 2022
Statistical guarantees for sparse deep learning Johannes Lederer 24 11 0 11 Dec 2022
AP: Selective Activation for De-sparsifying Pruned Neural Networks Shiyu Liu Rohan Ghosh Dylan Tan Mehul Motani AAML 26 0 0 09 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion Nupur Kumari Bin Zhang Richard Y. Zhang Eli Shechtman Jun-Yan Zhu 67 827 0 08 Dec 2022
A Rubric for Human-like Agents and NeuroAI Ida Momennejad 60 14 0 08 Dec 2022
On-device Training: A First Overview on Existing Systems Shuai Zhu Thiemo Voigt Jeonggil Ko Fatemeh Rahimian 34 14 0 01 Dec 2022
The Effect of Data Dimensionality on Neural Network Prunability Zachary Ankner Alex Renda Gintare Karolina Dziugaite Jonathan Frankle Tian Jin 36 5 0 01 Dec 2022
Towards Practical Few-shot Federated NLP Dongqi Cai Yaozong Wu Haitao Yuan Shangguang Wang F. Lin Mengwei Xu FedML 42 6 0 01 Dec 2022
Nonlinear Advantage: Trained Networks Might Not Be As Complex as You Think Christian H. X. Ali Mehmeti-Göpel Jan Disselhoff 18 5 0 30 Nov 2022
You Can Have Better Graph Neural Networks by Not Training Weights at All: Finding Untrained GNNs Tickets Tianjin Huang Tianlong Chen Meng Fang Vlado Menkovski Jiaxu Zhao ... Yulong Pei Decebal Constantin Mocanu Zhangyang Wang Mykola Pechenizkiy Shiwei Liu GNN 52 14 0 28 Nov 2022
Class-based Quantization for Neural Networks Wenhao Sun Grace Li Zhang Huaxi Gu Bing Li Ulf Schlichtmann MQ 24 7 0 27 Nov 2022
Why Neural Networks Work Sayan Mukherjee Bernardo A. Huberman 19 2 0 26 Nov 2022
Deep Learning Training Procedure Augmentations Cristian Simionescu 11 1 0 25 Nov 2022
LU decomposition and Toeplitz decomposition of a neural network Yucong Liu Simiao Jiao Lek-Heng Lim 30 7 0 25 Nov 2022
Towards Practical Control of Singular Values of Convolutional Layers Alexandra Senderovich Ekaterina Bulatova Anton Obukhov M. Rakhuba AAML 19 9 0 24 Nov 2022
Two Facets of SDE Under an Information-Theoretic Lens: Generalization of SGD via Training Trajectories and via Terminal States Ziqiao Wang Yongyi Mao 35 10 0 19 Nov 2022
Compressing Transformer-based self-supervised models for speech processing Tzu-Quan Lin Tsung-Huan Yang Chun-Yao Chang Kuang-Ming Chen Tzu-hsun Feng Hung-yi Lee Hao Tang 45 6 0 17 Nov 2022
REPAIR: REnormalizing Permuted Activations for Interpolation Repair Keller Jordan Hanie Sedghi O. Saukh R. Entezari Behnam Neyshabur MoMe 46 94 0 15 Nov 2022
SNIPER Training: Single-Shot Sparse Training for Text-to-Speech Perry Lam Huayun Zhang Nancy F. Chen Berrak Sisman Dorien Herremans VLM 40 0 0 14 Nov 2022
Efficient Adversarial Training with Robust Early-Bird Tickets Zhiheng Xi Rui Zheng Tao Gui Qi Zhang Xuanjing Huang AAML 46 9 0 14 Nov 2022
Partial Binarization of Neural Networks for Budget-Aware Efficient Learning Udbhav Bamba Neeraj Anand Saksham Aggarwal Dilip K Prasad D. K. Gupta MQ 26 0 0 12 Nov 2022
Exploiting the Partly Scratch-off Lottery Ticket for Quantization-Aware Training Mingliang Xu Gongrui Nan Yuxin Zhang Rongrong Ji Rongrong Ji MQ 20 3 0 12 Nov 2022
Astronomia ex machina: a history, primer, and outlook on neural networks in astronomy Michael J. Smith James E. Geach 40 32 0 07 Nov 2022
Efficient Traffic State Forecasting using Spatio-Temporal Network Dependencies: A Sparse Graph Neural Network Approach Bin Lei Shaoyi Huang Caiwen Ding Monika Filipovska GNN AI4TS 22 0 0 06 Nov 2022
Robust Lottery Tickets for Pre-trained Language Models Rui Zheng Rong Bao Yuhao Zhou Di Liang Sirui Wang Wei Wu Tao Gui Qi Zhang Xuanjing Huang AAML 32 13 0 06 Nov 2022
An Adversarial Robustness Perspective on the Topology of Neural Networks Morgane Goibert Thomas Ricatte Elvis Dohmatob AAML 21 2 0 04 Nov 2022
Data Models for Dataset Drift Controls in Machine Learning With Optical Images Luis Oala Marco Aversa Gabriel Nobis Kurt Willis Yoan Neuenschwander ... E. Pomarico Wojciech Samek Roderick Murray-Smith Christoph Clausen B. Sanguinetti 36 5 0 04 Nov 2022
Once-for-All Sequence Compression for Self-Supervised Speech Models Hsuan-Jui Chen Yen Meng Hung-yi Lee 33 5 0 04 Nov 2022
Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions Shuhao Gu Bojie Hu Yang Feng CLL 44 13 0 03 Nov 2022
Speeding up NAS with Adaptive Subset Selection Vishak Prasad Colin White P. Jain Sibasis Nayak Ganesh Ramakrishnan BDL 26 5 0 02 Nov 2022
Learning Neural Implicit Representations with Surface Signal Parameterizations Yanran Guan Andrei Chubarau Ruby Rao Derek Nowrouzezahrai AI4CE 27 4 0 01 Nov 2022
Higher-order mutual information reveals synergistic sub-networks for multi-neuron importance Kenzo Clauw S. Stramaglia Daniele Marinazzo SSL FAtt 30 6 0 01 Nov 2022
Data-Efficient Cross-Lingual Transfer with Language-Specific Subnetworks Rochelle Choenni Dan Garrette Ekaterina Shutova 33 2 0 31 Oct 2022
LOFT: Finding Lottery Tickets through Filter-wise Training Qihan Wang Chen Dun Fangshuo Liao C. Jermaine Anastasios Kyrillidis 33 3 0 28 Oct 2022
Desiderata for next generation of ML model serving Sherif Akoush Andrei Paleyes A. V. Looveren Clive Cox 38 5 0 26 Oct 2022
Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering Q. Si Yuanxin Liu Zheng Lin Peng Fu Weiping Wang VLM 42 1 0 26 Oct 2022
Auxiliary task discovery through generate-and-test Banafsheh Rafiee Sina Ghiassian Jun Jin R. Sutton Jun Luo Adam White 21 0 0 25 Oct 2022
Exploring Mode Connectivity for Pre-trained Language Models Yujia Qin Cheng Qian Jing Yi Weize Chen Yankai Lin Xu Han Zhiyuan Liu Maosong Sun Jie Zhou 31 20 0 25 Oct 2022
Gradient-based Weight Density Balancing for Robust Dynamic Sparse Training Mathias Parger Alexander Ertl Paul Eibensteiner J. H. Mueller Martin Winter M. Steinberger 34 0 0 25 Oct 2022
OLLA: Optimizing the Lifetime and Location of Arrays to Reduce the Memory Usage of Neural Networks Benoit Steiner Mostafa Elhoushi Jacob Kahn James Hegarty 31 8 0 24 Oct 2022
Compressing Explicit Voxel Grid Representations: fast NeRFs become also small C. Deng Enzo Tartaglione GNN 34 52 0 23 Oct 2022