Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.05407
Cited By
v1
v2
v3 (latest)
Averaging Weights Leads to Wider Optima and Better Generalization
14 March 2018
Pavel Izmailov
Dmitrii Podoprikhin
T. Garipov
Dmitry Vetrov
A. Wilson
FedML
MoMe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Averaging Weights Leads to Wider Optima and Better Generalization"
50 / 1,040 papers shown
Title
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking
Chang-Shu Liu
Yinpeng Dong
Wenzhao Xiang
Xiaohu Yang
Hang Su
Junyi Zhu
YueFeng Chen
Yuan He
H. Xue
Shibao Zheng
OOD
VLM
AAML
115
85
0
28 Feb 2023
Analyzing Populations of Neural Networks via Dynamical Model Embedding
Jordan S. Cotler
Kai Sheng Tai
Felipe Hernández
Blake Elias
David Sussillo
100
4
0
27 Feb 2023
Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Ruisi Cai
Zhenyu Zhang
Zhangyang Wang
AAML
OOD
91
12
0
24 Feb 2023
Domain Generalisation via Domain Adaptation: An Adversarial Fourier Amplitude Approach
Minyoung Kim
Da Li
Timothy M. Hospedales
OOD
54
11
0
23 Feb 2023
Personalized Privacy-Preserving Framework for Cross-Silo Federated Learning
Van Tuan Tran
Huy Hieu Pham
Kok-Seng Wong
FedML
98
8
0
22 Feb 2023
Seasoning Model Soups for Robustness to Adversarial and Natural Distribution Shifts
Francesco Croce
Sylvestre-Alvise Rebuffi
Evan Shelhamer
Sven Gowal
AAML
79
18
0
20 Feb 2023
Why is parameter averaging beneficial in SGD? An objective smoothing perspective
Atsushi Nitanda
Ryuhei Kikuchi
Shugo Maeda
Denny Wu
FedML
51
0
0
18 Feb 2023
Calibrating the Rigged Lottery: Making All Tickets Reliable
Bowen Lei
Ruqi Zhang
Dongkuan Xu
Bani Mallick
UQCV
111
7
0
18 Feb 2023
Scalable Bayesian optimization with high-dimensional outputs using randomized prior networks
Mohamed Aziz Bhouri
M. Joly
Robert Yu
S. Sarkar
P. Perdikaris
BDL
UQCV
AI4CE
77
1
0
14 Feb 2023
A Modern Look at the Relationship between Sharpness and Generalization
Maksym Andriushchenko
Francesco Croce
Maximilian Müller
Matthias Hein
Nicolas Flammarion
3DH
138
63
0
14 Feb 2023
Revisiting Weighted Aggregation in Federated Learning with Neural Networks
Zexi Li
Tao R. Lin
Xinyi Shang
Chao-Xiang Wu
FedML
102
65
0
14 Feb 2023
FilFL: Client Filtering for Optimized Client Participation in Federated Learning
Fares Fourati
Salma Kharrat
Vaneet Aggarwal
Mohamed-Slim Alouini
Marco Canini
FedML
75
4
0
13 Feb 2023
Contour-based Interactive Segmentation
Danil Galeev
Polina Popenova
Anna Vorontsova
Anton Konushin
84
5
0
13 Feb 2023
Autoselection of the Ensemble of Convolutional Neural Networks with Second-Order Cone Programming
Buse Çisil Güldoğuş
Abdullah Nazhat Abdullah
Muhammad Ammar Ali
Süreyya Özögür-Akyüz
73
0
0
12 Feb 2023
Sparse Mutation Decompositions: Fine Tuning Deep Neural Networks with Subspace Evolution
Tim Whitaker
L. D. Whitley
57
0
0
12 Feb 2023
Data efficiency and extrapolation trends in neural network interatomic potentials
Joshua A Vita
Daniel Schwalbe-Koda
73
17
0
12 Feb 2023
Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples
Qizhang Li
Yiwen Guo
W. Zuo
Hao Chen
AAML
125
37
0
10 Feb 2023
Toward Degree Bias in Embedding-Based Knowledge Graph Completion
Harry Shomer
Wei Jin
Wentao Wang
Jiliang Tang
47
25
0
10 Feb 2023
Better Diffusion Models Further Improve Adversarial Training
Zekai Wang
Tianyu Pang
Chao Du
Min Lin
Weiwei Liu
Shuicheng Yan
DiffM
106
228
0
09 Feb 2023
Generalization in Graph Neural Networks: Improved PAC-Bayesian Bounds on Graph Diffusion
Haotian Ju
Dongyue Li
Aneesh Sharma
Hongyang R. Zhang
59
41
0
09 Feb 2023
Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness
Yuancheng Xu
Yanchao Sun
Micah Goldblum
Tom Goldstein
Furong Huang
AAML
92
38
0
06 Feb 2023
Flat Seeking Bayesian Neural Networks
Van-Anh Nguyen
L. Vuong
Hoang Phan
Thanh-Toan Do
Dinh Q. Phung
Trung Le
BDL
100
10
0
06 Feb 2023
Variational Inference on the Final-Layer Output of Neural Networks
Yadi Wei
Roni Khardon
BDL
UQCV
91
0
0
05 Feb 2023
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications
Chengyu Dong
OOD
UQCV
BDL
AI4CE
128
0
0
02 Feb 2023
A Survey of Deep Learning: From Activations to Transformers
Johannes Schneider
Michalis Vlachos
ViT
MedIm
AI4TS
AI4CE
112
10
0
01 Feb 2023
A Comprehensive Survey of Continual Learning: Theory, Method and Application
Liyuan Wang
Xingxing Zhang
Hang Su
Jun Zhu
KELM
CLL
236
714
0
31 Jan 2023
Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised Learning
P. Singh
Jacopo Cirrone
SSL
117
0
0
27 Jan 2023
Exploring the Effect of Multi-step Ascent in Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Woojin Lee
Jaewook Lee
44
9
0
27 Jan 2023
Backward Compatibility During Data Updates by Weight Interpolation
Raphael Schumann
Elman Mansimov
Yi-An Lai
Nikolaos Pappas
Xibin Gao
Yi Zhang
44
5
0
25 Jan 2023
Model soups to increase inference without increasing compute time
Charles Dansereau
Milo Sobral
Maninder Bhogal
Mehdi Zalai
23
2
0
24 Jan 2023
Stability Analysis of Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Jaewook Lee
78
13
0
16 Jan 2023
Training trajectories, mini-batch losses and the curious role of the learning rate
Mark Sandler
A. Zhmoginov
Max Vladymyrov
Nolan Miller
ODL
90
12
0
05 Jan 2023
Audio-Visual Efficient Conformer for Robust Speech Recognition
Maxime Burchi
Radu Timofte
VLM
78
35
0
04 Jan 2023
Recent Advances on Federated Learning: A Systematic Survey
Bingyan Liu
Nuoyan Lv
Yuanchun Guo
Yawen Li
FedML
118
89
0
03 Jan 2023
Self-Activating Neural Ensembles for Continual Reinforcement Learning
Sam Powers
Eliot Xing
Abhinav Gupta
KELM
CLL
86
5
0
31 Dec 2022
Do Bayesian Variational Autoencoders Know What They Don't Know?
Misha Glazunov
Apostolis Zarras
UQCV
BDL
63
5
0
29 Dec 2022
Frequency Regularization for Improving Adversarial Robustness
Binxiao Huang
Chaofan Tao
R. Lin
Ngai Wong
AAML
34
4
0
24 Dec 2022
Training Integer-Only Deep Recurrent Neural Networks
V. Nia
Eyyub Sari
Vanessa Courville
M. Asgharian
MQ
96
2
0
22 Dec 2022
Revisiting Residual Networks for Adversarial Robustness: An Architectural Perspective
Shihua Huang
Zhichao Lu
Kalyanmoy Deb
Vishnu Boddeti
OOD
102
45
0
21 Dec 2022
KL Regularized Normalization Framework for Low Resource Tasks
Neeraj Kumar
Ankur Narang
Brejesh Lall
58
1
0
21 Dec 2022
Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization
Alexandre Ramé
Kartik Ahuja
Jianyu Zhang
Matthieu Cord
Léon Bottou
David Lopez-Paz
MoMe
OODD
128
86
0
20 Dec 2022
Variational Factorization Machines for Preference Elicitation in Large-Scale Recommender Systems
Jill-Jênn Vie
Tomas Rigaux
H. Kashima
BDL
131
1
0
20 Dec 2022
Dataless Knowledge Fusion by Merging Weights of Language Models
Xisen Jin
Xiang Ren
Daniel Preoţiuc-Pietro
Pengxiang Cheng
FedML
MoMe
99
250
0
19 Dec 2022
A Probabilistic Framework for Lifelong Test-Time Adaptation
Dhanajit Brahma
Piyush Rai
TTA
68
36
0
19 Dec 2022
The Underlying Correlated Dynamics in Neural Training
Rotem Turjeman
Tom Berkov
I. Cohen
Guy Gilboa
70
3
0
18 Dec 2022
Bayesian posterior approximation with stochastic ensembles
Oleksandr Balabanov
Bernhard Mehlig
Hampus Linander
BDL
UQCV
120
5
0
15 Dec 2022
Generative Robust Classification
Xuwang Yin
TPM
53
0
0
14 Dec 2022
Efficient Bayesian Uncertainty Estimation for nnU-Net
Yidong Zhao
Changchun Yang
Artur M. Schweidtmann
Qian Tao
UQCV
BDL
62
22
0
12 Dec 2022
Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Peng Lu
I. Kobyzev
Mehdi Rezagholizadeh
Ahmad Rashid
A. Ghodsi
Philippe Langlais
MoMe
100
11
0
12 Dec 2022
A New Linear Scaling Rule for Private Adaptive Hyperparameter Optimization
Ashwinee Panda
Xinyu Tang
Saeed Mahloujifar
Vikash Sehwag
Prateek Mittal
126
12
0
08 Dec 2022
Previous
1
2
3
...
10
11
12
...
19
20
21
Next