The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

9 March 2018

Papers citing "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"

50 / 729 papers shown

Title
BINGO: A Novel Pruning Mechanism to Reduce the Size of Neural Networks Aditya Panangat 21 0 0 15 May 2025
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized? Jianyang Xie Yitian Zhao Y. Meng He Zhao Anh Nguyen Yalin Zheng 22 0 0 15 May 2025
Uniform Loss vs. Specialized Optimization: A Comparative Analysis in Multi-Task Learning Gabriel S. Gama Valdir Grassi Jr MoMe 52 0 0 15 May 2025
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments Ibne Farabi Shihab Sanjeda Akter Anuj Sharma Mamba 54 0 0 13 May 2025
Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer Zhenrong Liu Janne M. J. Huttunen Mikko Honkala CLL 49 0 0 13 May 2025
Super-fast rates of convergence for Neural Networks Classifiers under the Hard Margin Condition Nathanael Tepakbong Ding-Xuan Zhou Xiang Zhou 46 0 0 13 May 2025
ICE-Pruning: An Iterative Cost-Efficient Pruning Pipeline for Deep Neural Networks Wenhao Hu Paul Henderson José Cano 32 0 0 12 May 2025
Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry Mohammed Adnan Rohan Jain Ekansh Sharma Rahul Krishnan Yani Andrew Ioannou 61 0 0 08 May 2025
Guiding Evolutionary AutoEncoder Training with Activation-Based Pruning Operators Steven Jorgensen Erik Hemberg J. Toutouh Una-May O’Reilly 56 0 0 08 May 2025
How to Train Your Metamorphic Deep Neural Network Thomas Sommariva Simone Calderara Angelo Porrello 35 0 0 07 May 2025
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques Sanjay Surendranath Girija Shashank Kapoor Lakshit Arora Dipen Pradhan Aman Raj Ankit Shetgaonkar 57 0 0 05 May 2025
PASCAL: Precise and Efficient ANN- SNN Conversion using Spike Accumulation and Adaptive Layerwise Activation Pranav Ramesh Gopalakrishnan Srinivasan 36 0 0 03 May 2025
Efficient Shapley Value-based Non-Uniform Pruning of Large Language Models Chuan Sun Han Yu Lizhen Cui Xiaoxiao Li 175 0 0 03 May 2025
FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation Chaitali Bhattacharyya Yeseong Kim 50 0 0 01 May 2025
GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling Siqi Li Yufan Shen Xiangnan Chen Jiayi Chen Hengwei Ju ... Botian Shi Y. Liu Xinyu Cai Yu Qiao Yu Qiao VLM ELM 98 0 0 30 Apr 2025
Sparse-to-Sparse Training of Diffusion Models Inês Cardoso Oliveira Decebal Constantin Mocanu Luis A. Leiva DiffM 86 0 0 30 Apr 2025
Model Connectomes: A Generational Approach to Data-Efficient Language Models Klemen Kotar Greta Tuckute 60 0 0 29 Apr 2025
Hadamard product in deep learning: Introduction, Advances and Challenges Grigorios G. Chrysos Yongtao Wu Razvan Pascanu Philip Torr V. Cevher AAML 98 1 0 17 Apr 2025
Evolved Hierarchical Masking for Self-Supervised Learning Zhanzhou Feng Shiliang Zhang 51 0 0 12 Apr 2025
Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning Yupei Li M. Milling Björn Schuller AI4CE 107 0 0 27 Mar 2025
GranQ: Granular Zero-Shot Quantization with Channel-Wise Activation Scaling in QAT Inpyo Hong Youngwan Jo Hyojeong Lee Sunghyun Ahn Sanghyun Park MQ 67 0 0 24 Mar 2025
Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters Roberto Garcia Jerry Liu Daniel Sorvisto Sabri Eyuboglu 95 0 0 23 Mar 2025
FeNeC: Enhancing Continual Learning via Feature Clustering with Neighbor- or Logit-Based Classification Kamil Książek Hubert Jastrzębski Bartosz Trojan Krzysztof Pniaczek Michał Karp Jacek Tabor CLL 84 0 0 18 Mar 2025
Are formal and functional linguistic mechanisms dissociated in language models? Michael Hanna Sandro Pezzelle Yonatan Belinkov 52 0 0 14 Mar 2025
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning Lizhen Xu Xiuxiu Bai Xiaojun Jia Jianwu Fang Shanmin Pang 63 0 0 13 Mar 2025
PRISM: Privacy-Preserving Improved Stochastic Masking for Federated Generative Models Kyeongkook Seo Dong-Jun Han Jaejun Yoo 45 0 0 11 Mar 2025
How can representation dimension dominate structurally pruned LLMs? Mingxue Xu Lisa Alazraki Danilo Mandic 58 0 0 06 Mar 2025
Wanda++: Pruning Large Language Models via Regional Gradients Yifan Yang Kai Zhen Bhavana Ganesh Aram Galstyan Goeric Huybrechts ... S. Bodapati Nathan Susanj Zheng Zhang Jack FitzGerald Abhishek Kumar 66 0 0 06 Mar 2025
FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control Yutong Ye Yingbo Zhou Zhusen Liu Xiao Du Hao Zhou Xiang Lian Mingsong Chen 70 0 0 17 Feb 2025
Forget the Data and Fine-Tuning! Just Fold the Network to Compress Dong Wang Haris Šikić Lothar Thiele O. Saukh 68 0 0 17 Feb 2025
Graph Neural Networks at a Fraction Rucha Bhalchandra Joshi Sagar Prakash Barad Nidhi Tiwari Subhankar Mishra GNN 102 0 0 10 Feb 2025
The impact of allocation strategies in subset learning on the expressive power of neural networks Ofir Schlisselberg Ran Darshan 98 0 0 10 Feb 2025
Contrastive Representation Distillation via Multi-Scale Feature Decoupling Cuipeng Wang Tieyuan Chen Haipeng Wang 54 0 0 09 Feb 2025
Modular Training of Neural Networks aids Interpretability Satvik Golechha Maheep Chaudhary Joan Velja Alessandro Abate Nandi Schoots 79 0 0 04 Feb 2025
Deep Weight Factorization: Sparse Learning Through the Lens of Artificial Symmetries Chris Kolb T. Weber Bernd Bischl David Rügamer 115 0 0 04 Feb 2025
Symmetric Pruning of Large Language Models Kai Yi Peter Richtárik AAML VLM 73 0 0 31 Jan 2025
An Invitation to Neuroalgebraic Geometry Giovanni Luca Marchetti Vahid Shahverdi Stefano Mereta Matthew Trager Kathlén Kohn 119 2 0 31 Jan 2025
Information Consistent Pruning: How to Efficiently Search for Sparse Networks? Soheil Gharatappeh Salimeh Yasaei Sekeh 59 0 0 28 Jan 2025
Meta-Sparsity: Learning Optimal Sparse Structures in Multi-task Networks through Meta-learning Richa Upadhyay Ronald Phlypo Rajkumar Saini Marcus Liwicki 42 0 0 21 Jan 2025
Playing the Lottery With Concave Regularizers for Sparse Trainable Neural Networks Giulia Fracastoro Sophie M. Fosson Andrea Migliorati G. Calafiore 45 1 0 19 Jan 2025
DynST: Dynamic Sparse Training for Resource-Constrained Spatio-Temporal Forecasting Hao Wu Haomin Wen Guibin Zhang Yutong Xia Kai Wang Keli Zhang Yu Zheng Kun Wang 73 2 0 17 Jan 2025
Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts Danyal Aftab Steven Davy ALM 49 0 0 10 Jan 2025
Hierarchical Light Transformer Ensembles for Multimodal Trajectory Forecasting Adrien Lafage Mathieu Barbier Gianni Franchi David Filliat 50 3 0 08 Jan 2025
Training-free Heterogeneous Model Merging Zhengqi Xu Han Zheng Jie Song Li Sun Mingli Song MoMe 77 1 0 03 Jan 2025
Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning Andy Li A. Durrant Milan Markovic Lu Yin Georgios Leontidis Tianlong Chen Lu Yin Georgios Leontidis 80 0 0 20 Nov 2024
Zeroth-Order Adaptive Neuron Alignment Based Pruning without Re-Training Elia Cunegatti Leonardo Lucio Custode Giovanni Iacca 52 0 0 11 Nov 2024
Navigating Extremes: Dynamic Sparsity in Large Output Spaces Nasib Ullah Erik Schultheis Mike Lasby Yani Andrew Ioannou Rohit Babbar 35 0 0 05 Nov 2024
Neural Network Matrix Product Operator: A Multi-Dimensionally Integrable Machine Learning Potential Kentaro Hino Yuki Kurashige 34 0 0 31 Oct 2024
Mutual Information Preserving Neural Network Pruning Charles Westphal Stephen Hailes Mirco Musolesi 57 1 0 31 Oct 2024
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA Sangmin Bae Adam Fisch Hrayr Harutyunyan Ziwei Ji Seungyeon Kim Tal Schuster KELM 84 5 0 28 Oct 2024