v1v2v3v4 (latest)

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

27 February 2018

Dmitry Vetrov

Papers citing "Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"

31 / 31 papers shown

Title
Understanding Mode Connectivity via Parameter Space Symmetry B. Zhao Nima Dehmamy Robin Walters Rose Yu 225 8 0 29 May 2025
Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models Patrick Leask Neel Nanda Noura Al Moubayed 81 1 0 23 May 2025
A Combinatorial Theory of Dropout: Subnetworks, Graph Geometry, and Generalization Sahil Rajesh Dhayalkar 124 1 0 20 Apr 2025
Adiabatic Fine-Tuning of Neural Quantum States Enables Detection of Phase Transitions in Weight Space Vinicius Hernandes Thomas Spriggs Saqar Khaleefah E. Greplova 85 1 0 21 Mar 2025
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis Chen Zhao Xuan Wang Tong Zhang Saqib Javed Mathieu Salzmann 3DGS 559 0 0 13 Mar 2025
SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting Linqi Yang Xiongwei Zhao Qihao Sun Ke Wang Ao Chen Peng Kang 3DGS 134 6 0 07 Mar 2025
High-dimensional manifold of solutions in neural networks: insights from statistical physics Enrico M. Malatesta 105 4 0 20 Feb 2025
SuperMerge: An Approach For Gradient-Based Model Merging Haoyu Yang Zheng Zhang Saket Sathe MoMe 211 0 0 17 Feb 2025
CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling Kaiyuan Zhang Siyuan Cheng Guangyu Shen Bruno Ribeiro Shengwei An Pin-Yu Chen Xinming Zhang Ninghui Li 337 2 0 28 Jan 2025
Meta Curvature-Aware Minimization for Domain Generalization Zhaoyu Chen Yiwen Ye Feilong Tang Yongsheng Pan Yong-quan Xia BDL 424 1 0 16 Dec 2024
TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling Yury Gorishniy Akim Kotelnikov Artem Babenko LMTD MoE 266 13 0 31 Oct 2024
Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics Daniel Paulin Peter Whalley Neil K. Chada Benedict Leimkuhler BDL 102 4 0 14 Oct 2024
Network Fission Ensembles for Low-Cost Self-Ensembles Hojung Lee Jong-Seok Lee UQCV 144 1 0 05 Aug 2024
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better En-hao Liu Junyi Zhu Zinan Lin Xuefei Ning Shuaiqi Wang ... Sergey Yekhanin Guohao Dai Huazhong Yang Yu Wang Yu Wang MoMe 165 4 0 02 Apr 2024
Arcee's MergeKit: A Toolkit for Merging Large Language Models Charles Goddard Shamane Siriwardhana Malikeh Ehghaghi Luke Meyers Vladimir Karpukhin Brian Benedict Mark McQuade Jacob Solawetz MoMe KELM 169 101 0 20 Mar 2024
MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks Ibrahim Almakky Santosh Sanjeev Anees Ur Rehman Hashmi Mohammad Areeb Qazi Mohammad Yaqub Mohammad Yaqub FedML MoMe 149 4 0 18 Mar 2024
Federated Learning over Connected Modes Dennis Grinwald Philipp Wiesner Shinichi Nakajima FedML 182 0 0 05 Mar 2024
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods Akira Ito Masanori Yamada Atsutoshi Kumagai MoMe 136 6 0 06 Feb 2024
Beyond Random Matrix Theory for Deep Networks Diego Granziol 114 16 0 13 Jun 2020
Extrapolation for Large-batch Training in Deep Learning Tao R. Lin Lingjing Kong Sebastian U. Stich Martin Jaggi 90 36 0 10 Jun 2020
Averaging Weights Leads to Wider Optima and Better Generalization Pavel Izmailov Dmitrii Podoprikhin T. Garipov Dmitry Vetrov A. Wilson FedML MoMe 143 1,673 0 14 Mar 2018
Essentially No Barriers in Neural Network Energy Landscape Felix Dräxler K. Veschgini M. Salmhofer Fred Hamprecht MoMe 122 435 0 02 Mar 2018
Visualizing the Loss Landscape of Neural Nets Hao Li Zheng Xu Gavin Taylor Christoph Studer Tom Goldstein 266 1,901 0 28 Dec 2017
On Calibration of Modern Neural Networks Chuan Guo Geoff Pleiss Yu Sun Kilian Q. Weinberger UQCV 299 5,877 0 14 Jun 2017
Snapshot Ensembles: Train 1, get M for free Gao Huang Yixuan Li Geoff Pleiss Zhuang Liu John E. Hopcroft Kilian Q. Weinberger OOD FedML UQCV 147 953 0 01 Apr 2017
Topology and Geometry of Half-Rectified Network Optimization C. Freeman Joan Bruna 224 235 0 04 Nov 2016
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima N. Keskar Dheevatsa Mudigere J. Nocedal M. Smelyanskiy P. T. P. Tang ODL 436 2,946 0 15 Sep 2016
Wide Residual Networks Sergey Zagoruyko N. Komodakis 362 8,005 0 23 May 2016
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems Martín Abadi Ashish Agarwal P. Barham E. Brevdo Zhiwen Chen ... Pete Warden Martin Wattenberg Martin Wicke Yuan Yu Xiaoqiang Zheng 294 11,155 0 14 Mar 2016
Qualitatively characterizing neural network optimization problems Ian Goodfellow Oriol Vinyals Andrew M. Saxe ODL 116 524 0 19 Dec 2014
Horizontal and Vertical Ensemble with Deep Representation for Classification Jingjing Xie Bing Xu Chuang Zhang SSL 109 76 0 12 Jun 2013