ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.10026
  4. Cited By
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
v1v2v3v4 (latest)

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

27 February 2018
T. Garipov
Pavel Izmailov
Dmitrii Podoprikhin
Dmitry Vetrov
A. Wilson
    UQCV
ArXiv (abs)PDFHTML

Papers citing "Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"

31 / 31 papers shown
Title
Understanding Mode Connectivity via Parameter Space Symmetry
Understanding Mode Connectivity via Parameter Space Symmetry
B. Zhao
Nima Dehmamy
Robin Walters
Rose Yu
225
8
0
29 May 2025
Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models
Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models
Patrick Leask
Neel Nanda
Noura Al Moubayed
81
1
0
23 May 2025
A Combinatorial Theory of Dropout: Subnetworks, Graph Geometry, and Generalization
A Combinatorial Theory of Dropout: Subnetworks, Graph Geometry, and Generalization
Sahil Rajesh Dhayalkar
124
1
0
20 Apr 2025
Adiabatic Fine-Tuning of Neural Quantum States Enables Detection of Phase Transitions in Weight Space
Adiabatic Fine-Tuning of Neural Quantum States Enables Detection of Phase Transitions in Weight Space
Vinicius Hernandes
Thomas Spriggs
Saqar Khaleefah
E. Greplova
85
1
0
21 Mar 2025
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis
Chen Zhao
Xuan Wang
Tong Zhang
Saqib Javed
Mathieu Salzmann
3DGS
559
0
0
13 Mar 2025
SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting
Linqi Yang
Xiongwei Zhao
Qihao Sun
Ke Wang
Ao Chen
Peng Kang
3DGS
134
6
0
07 Mar 2025
High-dimensional manifold of solutions in neural networks: insights from statistical physics
High-dimensional manifold of solutions in neural networks: insights from statistical physics
Enrico M. Malatesta
105
4
0
20 Feb 2025
SuperMerge: An Approach For Gradient-Based Model Merging
SuperMerge: An Approach For Gradient-Based Model Merging
Haoyu Yang
Zheng Zhang
Saket Sathe
MoMe
211
0
0
17 Feb 2025
CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling
Kaiyuan Zhang
Siyuan Cheng
Guangyu Shen
Bruno Ribeiro
Shengwei An
Pin-Yu Chen
Xinming Zhang
Ninghui Li
337
2
0
28 Jan 2025
Meta Curvature-Aware Minimization for Domain Generalization
Meta Curvature-Aware Minimization for Domain Generalization
Zhaoyu Chen
Yiwen Ye
Feilong Tang
Yongsheng Pan
Yong-quan Xia
BDL
424
1
0
16 Dec 2024
TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling
TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling
Yury Gorishniy
Akim Kotelnikov
Artem Babenko
LMTDMoE
266
13
0
31 Oct 2024
Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics
Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics
Daniel Paulin
Peter Whalley
Neil K. Chada
Benedict Leimkuhler
BDL
102
4
0
14 Oct 2024
Network Fission Ensembles for Low-Cost Self-Ensembles
Network Fission Ensembles for Low-Cost Self-Ensembles
Hojung Lee
Jong-Seok Lee
UQCV
144
1
0
05 Aug 2024
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
En-hao Liu
Junyi Zhu
Zinan Lin
Xuefei Ning
Shuaiqi Wang
...
Sergey Yekhanin
Guohao Dai
Huazhong Yang
Yu Wang
Yu Wang
MoMe
165
4
0
02 Apr 2024
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Charles Goddard
Shamane Siriwardhana
Malikeh Ehghaghi
Luke Meyers
Vladimir Karpukhin
Brian Benedict
Mark McQuade
Jacob Solawetz
MoMeKELM
169
101
0
20 Mar 2024
MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks
MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks
Ibrahim Almakky
Santosh Sanjeev
Anees Ur Rehman Hashmi
Mohammad Areeb Qazi
Mohammad Yaqub
Mohammad Yaqub
FedMLMoMe
149
4
0
18 Mar 2024
Federated Learning over Connected Modes
Federated Learning over Connected Modes
Dennis Grinwald
Philipp Wiesner
Shinichi Nakajima
FedML
182
0
0
05 Mar 2024
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Akira Ito
Masanori Yamada
Atsutoshi Kumagai
MoMe
136
6
0
06 Feb 2024
Beyond Random Matrix Theory for Deep Networks
Beyond Random Matrix Theory for Deep Networks
Diego Granziol
114
16
0
13 Jun 2020
Extrapolation for Large-batch Training in Deep Learning
Extrapolation for Large-batch Training in Deep Learning
Tao R. Lin
Lingjing Kong
Sebastian U. Stich
Martin Jaggi
90
36
0
10 Jun 2020
Averaging Weights Leads to Wider Optima and Better Generalization
Averaging Weights Leads to Wider Optima and Better Generalization
Pavel Izmailov
Dmitrii Podoprikhin
T. Garipov
Dmitry Vetrov
A. Wilson
FedMLMoMe
143
1,673
0
14 Mar 2018
Essentially No Barriers in Neural Network Energy Landscape
Essentially No Barriers in Neural Network Energy Landscape
Felix Dräxler
K. Veschgini
M. Salmhofer
Fred Hamprecht
MoMe
122
435
0
02 Mar 2018
Visualizing the Loss Landscape of Neural Nets
Visualizing the Loss Landscape of Neural Nets
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
266
1,901
0
28 Dec 2017
On Calibration of Modern Neural Networks
On Calibration of Modern Neural Networks
Chuan Guo
Geoff Pleiss
Yu Sun
Kilian Q. Weinberger
UQCV
299
5,877
0
14 Jun 2017
Snapshot Ensembles: Train 1, get M for free
Snapshot Ensembles: Train 1, get M for free
Gao Huang
Yixuan Li
Geoff Pleiss
Zhuang Liu
John E. Hopcroft
Kilian Q. Weinberger
OODFedMLUQCV
147
953
0
01 Apr 2017
Topology and Geometry of Half-Rectified Network Optimization
Topology and Geometry of Half-Rectified Network Optimization
C. Freeman
Joan Bruna
224
235
0
04 Nov 2016
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp
  Minima
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
436
2,946
0
15 Sep 2016
Wide Residual Networks
Wide Residual Networks
Sergey Zagoruyko
N. Komodakis
362
8,005
0
23 May 2016
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed
  Systems
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Martín Abadi
Ashish Agarwal
P. Barham
E. Brevdo
Zhiwen Chen
...
Pete Warden
Martin Wattenberg
Martin Wicke
Yuan Yu
Xiaoqiang Zheng
294
11,155
0
14 Mar 2016
Qualitatively characterizing neural network optimization problems
Qualitatively characterizing neural network optimization problems
Ian Goodfellow
Oriol Vinyals
Andrew M. Saxe
ODL
116
524
0
19 Dec 2014
Horizontal and Vertical Ensemble with Deep Representation for
  Classification
Horizontal and Vertical Ensemble with Deep Representation for Classification
Jingjing Xie
Bing Xu
Chuang Zhang
SSL
109
76
0
12 Jun 2013
1