Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.10026
Cited By
v1
v2
v3
v4 (latest)
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
27 February 2018
T. Garipov
Pavel Izmailov
Dmitrii Podoprikhin
Dmitry Vetrov
A. Wilson
UQCV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"
31 / 31 papers shown
Title
Understanding Mode Connectivity via Parameter Space Symmetry
B. Zhao
Nima Dehmamy
Robin Walters
Rose Yu
225
8
0
29 May 2025
Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models
Patrick Leask
Neel Nanda
Noura Al Moubayed
81
1
0
23 May 2025
A Combinatorial Theory of Dropout: Subnetworks, Graph Geometry, and Generalization
Sahil Rajesh Dhayalkar
124
1
0
20 Apr 2025
Adiabatic Fine-Tuning of Neural Quantum States Enables Detection of Phase Transitions in Weight Space
Vinicius Hernandes
Thomas Spriggs
Saqar Khaleefah
E. Greplova
85
1
0
21 Mar 2025
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis
Chen Zhao
Xuan Wang
Tong Zhang
Saqib Javed
Mathieu Salzmann
3DGS
559
0
0
13 Mar 2025
SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting
Linqi Yang
Xiongwei Zhao
Qihao Sun
Ke Wang
Ao Chen
Peng Kang
3DGS
134
6
0
07 Mar 2025
High-dimensional manifold of solutions in neural networks: insights from statistical physics
Enrico M. Malatesta
105
4
0
20 Feb 2025
SuperMerge: An Approach For Gradient-Based Model Merging
Haoyu Yang
Zheng Zhang
Saket Sathe
MoMe
211
0
0
17 Feb 2025
CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling
Kaiyuan Zhang
Siyuan Cheng
Guangyu Shen
Bruno Ribeiro
Shengwei An
Pin-Yu Chen
Xinming Zhang
Ninghui Li
337
2
0
28 Jan 2025
Meta Curvature-Aware Minimization for Domain Generalization
Zhaoyu Chen
Yiwen Ye
Feilong Tang
Yongsheng Pan
Yong-quan Xia
BDL
424
1
0
16 Dec 2024
TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling
Yury Gorishniy
Akim Kotelnikov
Artem Babenko
LMTD
MoE
266
13
0
31 Oct 2024
Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics
Daniel Paulin
Peter Whalley
Neil K. Chada
Benedict Leimkuhler
BDL
102
4
0
14 Oct 2024
Network Fission Ensembles for Low-Cost Self-Ensembles
Hojung Lee
Jong-Seok Lee
UQCV
144
1
0
05 Aug 2024
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
En-hao Liu
Junyi Zhu
Zinan Lin
Xuefei Ning
Shuaiqi Wang
...
Sergey Yekhanin
Guohao Dai
Huazhong Yang
Yu Wang
Yu Wang
MoMe
165
4
0
02 Apr 2024
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Charles Goddard
Shamane Siriwardhana
Malikeh Ehghaghi
Luke Meyers
Vladimir Karpukhin
Brian Benedict
Mark McQuade
Jacob Solawetz
MoMe
KELM
169
101
0
20 Mar 2024
MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks
Ibrahim Almakky
Santosh Sanjeev
Anees Ur Rehman Hashmi
Mohammad Areeb Qazi
Mohammad Yaqub
Mohammad Yaqub
FedML
MoMe
149
4
0
18 Mar 2024
Federated Learning over Connected Modes
Dennis Grinwald
Philipp Wiesner
Shinichi Nakajima
FedML
182
0
0
05 Mar 2024
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Akira Ito
Masanori Yamada
Atsutoshi Kumagai
MoMe
136
6
0
06 Feb 2024
Beyond Random Matrix Theory for Deep Networks
Diego Granziol
114
16
0
13 Jun 2020
Extrapolation for Large-batch Training in Deep Learning
Tao R. Lin
Lingjing Kong
Sebastian U. Stich
Martin Jaggi
90
36
0
10 Jun 2020
Averaging Weights Leads to Wider Optima and Better Generalization
Pavel Izmailov
Dmitrii Podoprikhin
T. Garipov
Dmitry Vetrov
A. Wilson
FedML
MoMe
143
1,673
0
14 Mar 2018
Essentially No Barriers in Neural Network Energy Landscape
Felix Dräxler
K. Veschgini
M. Salmhofer
Fred Hamprecht
MoMe
122
435
0
02 Mar 2018
Visualizing the Loss Landscape of Neural Nets
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
266
1,901
0
28 Dec 2017
On Calibration of Modern Neural Networks
Chuan Guo
Geoff Pleiss
Yu Sun
Kilian Q. Weinberger
UQCV
299
5,877
0
14 Jun 2017
Snapshot Ensembles: Train 1, get M for free
Gao Huang
Yixuan Li
Geoff Pleiss
Zhuang Liu
John E. Hopcroft
Kilian Q. Weinberger
OOD
FedML
UQCV
147
953
0
01 Apr 2017
Topology and Geometry of Half-Rectified Network Optimization
C. Freeman
Joan Bruna
224
235
0
04 Nov 2016
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
436
2,946
0
15 Sep 2016
Wide Residual Networks
Sergey Zagoruyko
N. Komodakis
362
8,005
0
23 May 2016
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Martín Abadi
Ashish Agarwal
P. Barham
E. Brevdo
Zhiwen Chen
...
Pete Warden
Martin Wattenberg
Martin Wicke
Yuan Yu
Xiaoqiang Zheng
294
11,155
0
14 Mar 2016
Qualitatively characterizing neural network optimization problems
Ian Goodfellow
Oriol Vinyals
Andrew M. Saxe
ODL
116
524
0
19 Dec 2014
Horizontal and Vertical Ensemble with Deep Representation for Classification
Jingjing Xie
Bing Xu
Chuang Zhang
SSL
109
76
0
12 Jun 2013
1