Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.00885
Cited By
v1
v2
v3
v4
v5 (latest)
Essentially No Barriers in Neural Network Energy Landscape
2 March 2018
Felix Dräxler
K. Veschgini
M. Salmhofer
Fred Hamprecht
MoMe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Essentially No Barriers in Neural Network Energy Landscape"
50 / 295 papers shown
Title
Hidden Breakthroughs in Language Model Training
Sara Kangaslahti
Elan Rosenfeld
Naomi Saphra
28
0
0
18 Jun 2025
Symmetry in Neural Network Parameter Spaces
Bo Zhao
Robin Walters
Rose Yu
27
0
0
16 Jun 2025
Circumventing Backdoor Space via Weight Symmetry
Jie Peng
Hongwei Yang
Jing Zhao
Hengji Dong
Hui He
Weizhe Zhang
Haoyu He
AAML
17
0
0
09 Jun 2025
Walking the Weight Manifold: a Topological Approach to Conditioning Inspired by Neuromodulation
Ari S. Benjamin
Kyle Daruwalla
Christian Pehle
Anthony M. Zador
55
0
0
29 May 2025
Understanding Mode Connectivity via Parameter Space Symmetry
B. Zhao
Nima Dehmamy
Robin Walters
Rose Yu
236
8
0
29 May 2025
Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking
Yuatyong Chaichana
Thanapat Trachu
Peerat Limkonchotiwat
Konpat Preechakul
Tirasan Khandhawit
Ekapol Chuangsuwanich
MoMe
82
0
0
29 May 2025
SGD as Free Energy Minimization: A Thermodynamic View on Neural Network Training
Ildus Sadrtdinov
Ivan Klimov
E. Lobacheva
Dmitry Vetrov
35
0
0
29 May 2025
Update Your Transformer to the Latest Release: Re-Basin of Task Vectors
Filippo Rinaldi
Giacomo Capitani
Lorenzo Bonicelli
Donato Crisostomi
Federico Bolelli
E. Ficarra
Emanuele Rodolà
Simone Calderara
Angelo Porrello
26
0
0
28 May 2025
Benignity of loss landscape with weight decay requires both large overparametrization and initialization
Etienne Boursier
Matthew Bowditch
Matthias Englert
R. Lazic
56
0
0
28 May 2025
Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual Transfer
Seungyoon Lee
Seongtae Hong
Hyeonseok Moon
Heuiseok Lim
KELM
90
0
0
16 May 2025
Low-Loss Space in Neural Networks is Continuous and Fully Connected
Yongding Tian
Zaid Al-Ars
Maksim Kitsak
P. Hofstee
3DPC
116
1
0
05 May 2025
Towards Quantifying the Hessian Structure of Neural Networks
Zhaorui Dong
Yushun Zhang
Zhi-Quan Luo
Jianfeng Yao
Ruoyu Sun
77
1
0
05 May 2025
Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Sanwoo Lee
Jiahao Liu
Qifan Wang
Jiadong Wang
Xunliang Cai
Yunfang Wu
MoMe
464
1
0
26 Apr 2025
A Combinatorial Theory of Dropout: Subnetworks, Graph Geometry, and Generalization
Sahil Rajesh Dhayalkar
130
1
0
20 Apr 2025
Understanding Machine Unlearning Through the Lens of Mode Connectivity
Jiali Cheng
Hadi Amiri
MU
458
1
0
08 Apr 2025
Finding Stable Subnetworks at Initialization with Dataset Distillation
Luke McDermott
Rahul Parhi
DD
87
0
0
23 Mar 2025
Analyzing the Role of Permutation Invariance in Linear Mode Connectivity
Keyao Zhan
Puheng Li
Lei Wu
MoMe
113
0
0
13 Mar 2025
SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting
Linqi Yang
Xiongwei Zhao
Qihao Sun
Ke Wang
Ao Chen
Peng Kang
3DGS
138
0
0
07 Mar 2025
Paths and Ambient Spaces in Neural Loss Landscapes
Daniel Dold
Julius Kobialka
Nicolai Palm
Emanuel Sommer
David Rügamer
Oliver Durr
AI4CE
126
0
0
05 Mar 2025
Deep Learning is Not So Mysterious or Different
Andrew Gordon Wilson
101
6
0
03 Mar 2025
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think
Jie Tian
Xiaoye Qu
Zhenyi Lu
Xiaoye Qu
Sichen Liu
Yu Cheng
DiffM
VGen
81
4
0
02 Mar 2025
Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation
Qiuming Zhao
Guangzhi Sun
Chao Zhang
MoMe
VLM
457
1
0
24 Feb 2025
Rethinking Spiking Neural Networks from an Ensemble Learning Perspective
Lin Zuo
Yongqi Ding
Mengmeng Jing
Pei He
Hanpu Deng
100
3
0
20 Feb 2025
High-dimensional manifold of solutions in neural networks: insights from statistical physics
Enrico M. Malatesta
114
4
0
20 Feb 2025
Unveiling Mode Connectivity in Graph Neural Networks
Bingheng Li
Z. Chen
Haoyu Han
Shenglai Zeng
J. Liu
Jiliang Tang
88
1
0
18 Feb 2025
In-Context Parametric Inference: Point or Distribution Estimators?
Sarthak Mittal
Yoshua Bengio
Nikolay Malkin
Guillaume Lajoie
130
0
0
17 Feb 2025
Dissecting a Small Artificial Neural Network
Xiguang Yang
Krish Arora
Michael Bachmann
89
0
0
03 Jan 2025
Parameter-Efficient Interventions for Enhanced Model Merging
Marcin Osial
Daniel Marczak
Bartosz Zieliñski
MoMe
150
1
0
22 Dec 2024
Non-Uniform Parameter-Wise Model Merging
Albert Manuel Orozco Camacho
Stefan Horoi
Guy Wolf
Eugene Belilovsky
MoMe
FedML
142
0
0
20 Dec 2024
Implicit Neural Compression of Point Clouds
Hongning Ruan
Yulin Shao
Qianqian Yang
Liang Zhao
Zhaoyang Zhang
Dusit Niyato
3DPC
116
1
0
11 Dec 2024
Task Arithmetic Through The Lens Of One-Shot Federated Learning
Zhixu Tao
I. Mason
Sanjeev R. Kulkarni
Xavier Boix
MoMe
FedML
131
7
0
27 Nov 2024
CopRA: A Progressive LoRA Training Strategy
Zhan Zhuang
Xiequn Wang
Yulong Zhang
Wei Li
Yu Zhang
Ying Wei
116
1
0
30 Oct 2024
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging
Li Shen
Anke Tang
Enneng Yang
G. Guo
Yong Luo
Lefei Zhang
Xiaochun Cao
Di Lin
Dacheng Tao
MoMe
83
9
0
29 Oct 2024
Subgraph Aggregation for Out-of-Distribution Generalization on Graphs
Bowen Liu
Haoyang Li
Shuning Wang
Shuo Nie
Shanghang Zhang
OODD
CML
172
0
0
29 Oct 2024
Model merging with SVD to tie the Knots
George Stoica
Pratik Ramesh
B. Ecsedi
Leshem Choshen
Judy Hoffman
MoMe
99
19
0
25 Oct 2024
In Search of the Successful Interpolation: On the Role of Sharpness in CLIP Generalization
Alireza Abdollahpoorrostam
57
0
0
21 Oct 2024
Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense
Rui Min
Zeyu Qin
Nevin L. Zhang
Li Shen
Minhao Cheng
AAML
91
4
0
13 Oct 2024
What Matters for Model Merging at Scale?
Prateek Yadav
Tu Vu
Jonathan Lai
Alexandra Chronopoulou
Manaal Faruqui
Joey Tianyi Zhou
Tsendsuren Munkhdalai
MoMe
107
22
0
04 Oct 2024
Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks
Edan Kinderman
Itay Hubara
Haggai Maron
Daniel Soudry
MoMe
104
2
0
02 Oct 2024
Input Space Mode Connectivity in Deep Neural Networks
Jakub Vrabel
Ori Shem-Ur
Yaron Oz
David Krueger
112
1
0
09 Sep 2024
Weight Scope Alignment: A Frustratingly Easy Method for Model Merging
Yichu Xu
Xin-Chun Li
Le Gan
De-Chuan Zhan
MoMe
91
0
0
22 Aug 2024
SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models
Anke Tang
Li Shen
Yong Luo
Shuai Xie
Han Hu
Lefei Zhang
Di Lin
Dacheng Tao
MoMe
104
4
0
19 Aug 2024
Enhancing material property prediction with ensemble deep graph convolutional networks
Chowdhury Mohammad Abid Rahman
Ghadendra B. Bhandari
Nasser M. Nasrabadi
Aldo H. Romero
P. Gyawali
AI4CE
84
4
0
26 Jul 2024
Flatness-aware Sequential Learning Generates Resilient Backdoors
Hoang Pham
The-Anh Ta
Anh Tran
Khoa D. Doan
FedML
AAML
76
1
0
20 Jul 2024
Exploring End-to-end Differentiable Neural Charged Particle Tracking -- A Loss Landscape Perspective
T. Kortus
Ralf Keidel
N.R. Gauger
133
0
0
18 Jul 2024
Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis
Stefan Horoi
Albert Manuel Orozco Camacho
Eugene Belilovsky
Guy Wolf
FedML
MoMe
58
10
0
07 Jul 2024
Adaptive Stochastic Weight Averaging
Caglar Demir
Arnab Sharma
Axel-Cyrille Ngonga Ngomo
MoMe
69
1
0
27 Jun 2024
MD tree: a model-diagnostic tree grown on loss landscape
Yefan Zhou
Jianlong Chen
Qinxue Cao
Konstantin Schürholt
Yaoqing Yang
102
2
0
24 Jun 2024
Landscaping Linear Mode Connectivity
Sidak Pal Singh
Linara Adilova
Michael Kamp
Asja Fischer
Bernhard Scholkopf
Thomas Hofmann
120
6
0
24 Jun 2024
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
Zhenyi Lu
Chenghao Fan
Wei Wei
Xiaoye Qu
Dangyang Chen
Yu Cheng
MoMe
126
63
0
17 Jun 2024
1
2
3
4
5
6
Next