ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.05407
  4. Cited By
Averaging Weights Leads to Wider Optima and Better Generalization
v1v2v3 (latest)

Averaging Weights Leads to Wider Optima and Better Generalization

14 March 2018
Pavel Izmailov
Dmitrii Podoprikhin
T. Garipov
Dmitry Vetrov
A. Wilson
    FedMLMoMe
ArXiv (abs)PDFHTML

Papers citing "Averaging Weights Leads to Wider Optima and Better Generalization"

50 / 1,040 papers shown
Title
MEDFAIR: Benchmarking Fairness for Medical Imaging
MEDFAIR: Benchmarking Fairness for Medical Imaging
Yongshuo Zong
Yongxin Yang
Timothy M. Hospedales
OOD
173
66
0
04 Oct 2022
Stability Analysis and Generalization Bounds of Adversarial Training
Stability Analysis and Generalization Bounds of Adversarial Training
Jiancong Xiao
Yanbo Fan
Ruoyu Sun
Jue Wang
Zhimin Luo
AAML
85
31
0
03 Oct 2022
Ensembling improves stability and power of feature selection for deep
  learning models
Ensembling improves stability and power of feature selection for deep learning models
P. Gyawali
Xiaoxia Liu
James Zou
Zihuai He
OODFedML
123
6
0
02 Oct 2022
Adaptive Smoothness-weighted Adversarial Training for Multiple
  Perturbations with Its Stability Analysis
Adaptive Smoothness-weighted Adversarial Training for Multiple Perturbations with Its Stability Analysis
Jiancong Xiao
Zeyu Qin
Yanbo Fan
Baoyuan Wu
Jue Wang
Zhimin Luo
AAML
124
7
0
02 Oct 2022
Improving Robustness with Adaptive Weight Decay
Improving Robustness with Adaptive Weight Decay
Amin Ghiasi
Ali Shafahi
R. Ardekani
OOD
46
8
0
30 Sep 2022
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with
  Latest Weight Averaging
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with Latest Weight Averaging
Jean Kaddour
MoMe3DH
85
41
0
29 Sep 2022
Learning Gradient-based Mixup towards Flatter Minima for Domain
  Generalization
Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization
Danni Peng
Sinno Jialin Pan
64
3
0
29 Sep 2022
Two-Tailed Averaging: Anytime, Adaptive, Once-in-a-While Optimal Weight
  Averaging for Better Generalization
Two-Tailed Averaging: Anytime, Adaptive, Once-in-a-While Optimal Weight Averaging for Better Generalization
Gábor Melis
MoMe
93
1
0
26 Sep 2022
Strong Transferable Adversarial Attacks via Ensembled Asymptotically
  Normal Distribution Learning
Strong Transferable Adversarial Attacks via Ensembled Asymptotically Normal Distribution Learning
Zhengwei Fang
Rui Wang
Tao Huang
L. Jing
AAML
73
8
0
24 Sep 2022
Random initialisations performing above chance and how to find them
Random initialisations performing above chance and how to find them
Frederik Benzing
Simon Schug
Robert Meier
J. Oswald
Yassir Akram
Nicolas Zucchet
Laurence Aitchison
Angelika Steger
ODL
119
26
0
15 Sep 2022
Git Re-Basin: Merging Models modulo Permutation Symmetries
Git Re-Basin: Merging Models modulo Permutation Symmetries
Samuel K. Ainsworth
J. Hayase
S. Srinivasa
MoMe
331
344
0
11 Sep 2022
MMV_Im2Im: An Open Source Microscopy Machine Vision Toolbox for
  Image-to-Image Transformation
MMV_Im2Im: An Open Source Microscopy Machine Vision Toolbox for Image-to-Image Transformation
Justin Sonneck
Jianxu Chen
VLMMedIm
96
5
0
06 Sep 2022
Generalisation under gradient descent via deterministic PAC-Bayes
Generalisation under gradient descent via deterministic PAC-Bayes
Eugenio Clerico
Tyler Farghly
George Deligiannidis
Benjamin Guedj
Arnaud Doucet
152
4
0
06 Sep 2022
Investigating the Impact of Model Misspecification in Neural
  Simulation-based Inference
Investigating the Impact of Model Misspecification in Neural Simulation-based Inference
Patrick W Cannon
Daniel Ward
Sebastian M. Schmon
78
36
0
05 Sep 2022
Ensembling Neural Networks for Improved Prediction and Privacy in Early
  Diagnosis of Sepsis
Ensembling Neural Networks for Improved Prediction and Privacy in Early Diagnosis of Sepsis
Shigehiko Schamoni
Michael Hagmann
Stefan Riezler
FedML
58
4
0
01 Sep 2022
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Yanbei Chen
Massimiliano Mancini
Xiatian Zhu
Zeynep Akata
157
121
0
24 Aug 2022
Lottery Pools: Winning More by Interpolating Tickets without Increasing
  Training or Inference Cost
Lottery Pools: Winning More by Interpolating Tickets without Increasing Training or Inference Cost
Lu Yin
Shiwei Liu
Fang Meng
Tianjin Huang
Vlado Menkovski
Mykola Pechenizkiy
54
13
0
23 Aug 2022
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function
  Perspective
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective
Chanwoo Park
Sangdoo Yun
Sanghyuk Chun
AAML
83
32
0
21 Aug 2022
Uncertainty Quantification for Traffic Forecasting: A Unified Approach
Uncertainty Quantification for Traffic Forecasting: A Unified Approach
Weizhu Qian
Dalin Zhang
Yan Zhao
Kai Zheng
James Jianqiao Yu
BDLAI4TS
69
23
0
11 Aug 2022
Semi-supervised Vision Transformers at Scale
Semi-supervised Vision Transformers at Scale
Zhaowei Cai
Avinash Ravichandran
Paolo Favaro
Manchen Wang
Davide Modolo
Rahul Bhotika
Zhuowen Tu
Stefano Soatto
ViT
108
58
0
11 Aug 2022
Patching open-vocabulary models by interpolating weights
Patching open-vocabulary models by interpolating weights
Gabriel Ilharco
Mitchell Wortsman
S. Gadre
Shuran Song
Hannaneh Hajishirzi
Simon Kornblith
Ali Farhadi
Ludwig Schmidt
VLMKELM
144
176
0
10 Aug 2022
Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language
  Models
Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Margaret Li
Suchin Gururangan
Tim Dettmers
M. Lewis
Tim Althoff
Noah A. Smith
Luke Zettlemoyer
MoMe
110
154
0
05 Aug 2022
Interpretable Uncertainty Quantification in AI for HEP
Interpretable Uncertainty Quantification in AI for HEP
Thomas Y. Chen
B. Dey
A. Ghosh
Michael Kagan
Brian D. Nord
Nesar Ramachandra
72
7
0
05 Aug 2022
Parameter Averaging for Feature Ranking
Parameter Averaging for Feature Ranking
Talip Uçar
Ehsan Hajiramezanali
32
0
0
05 Aug 2022
PEA: Improving the Performance of ReLU Networks for Free by Using
  Progressive Ensemble Activations
PEA: Improving the Performance of ReLU Networks for Free by Using Progressive Ensemble Activations
Á. Utasi
49
0
0
28 Jul 2022
Image sensing with multilayer, nonlinear optical neural networks
Image sensing with multilayer, nonlinear optical neural networks
Tianyu Wang
Mandar M. Sohoni
Logan G. Wright
Martin M. Stein
Shifan Ma
Tatsuhiro Onodera
Maxwell G. Anderson
Peter L. McMahon
67
159
0
27 Jul 2022
Learning Hyper Label Model for Programmatic Weak Supervision
Learning Hyper Label Model for Programmatic Weak Supervision
Renzhi Wu
Sheng Chen
Jieyu Zhang
Xu Chu
130
17
0
27 Jul 2022
LGV: Boosting Adversarial Example Transferability from Large Geometric
  Vicinity
LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity
Martin Gubri
Maxime Cordy
Mike Papadakis
Yves Le Traon
Koushik Sen
AAML
77
55
0
26 Jul 2022
Efficient One Pass Self-distillation with Zipf's Label Smoothing
Efficient One Pass Self-distillation with Zipf's Label Smoothing
Jiajun Liang
Linze Li
Z. Bing
Borui Zhao
Yao Tang
Bo Lin
Haoqiang Fan
53
19
0
26 Jul 2022
Time Series Prediction under Distribution Shift using Differentiable
  Forgetting
Time Series Prediction under Distribution Shift using Differentiable Forgetting
Stefanos Bennett
J. Clarkson
OODAI4TS
30
4
0
23 Jul 2022
Improving Predictive Performance and Calibration by Weight Fusion in
  Semantic Segmentation
Improving Predictive Performance and Calibration by Weight Fusion in Semantic Segmentation
Timo Sämann
A. Hammam
Andrei Bursuc
Christoph Stiller
H. Groß
FedML
53
1
0
22 Jul 2022
Learning from Data with Noisy Labels Using Temporal Self-Ensemble
Learning from Data with Noisy Labels Using Temporal Self-Ensemble
Jun Ho Lee
J. Baik
Taebaek Hwang
J. Choi
NoLa
50
1
0
21 Jul 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
108
173
0
14 Jul 2022
A Data-Efficient Deep Learning Framework for Segmentation and
  Classification of Histopathology Images
A Data-Efficient Deep Learning Framework for Segmentation and Classification of Histopathology Images
Pranav Singh
Jacopo Cirrone
117
11
0
13 Jul 2022
TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent
  Kernels
TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels
Yaodong Yu
Alexander Wei
Sai Praneeth Karimireddy
Yi-An Ma
Michael I. Jordan
FedML
80
31
0
13 Jul 2022
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for
  real-time object detectors
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Chien-Yao Wang
Alexey Bochkovskiy
H. Liao
ObjD
179
6,645
0
06 Jul 2022
The alignment property of SGD noise and how it helps select flat minima:
  A stability analysis
The alignment property of SGD noise and how it helps select flat minima: A stability analysis
Lei Wu
Mingze Wang
Weijie Su
MLT
101
34
0
06 Jul 2022
ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal
  Self-Ensemble for Active Learning
ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning
J. Baik
In Young Yoon
J. Choi
66
0
0
05 Jul 2022
Federated Self-supervised Learning for Video Understanding
Federated Self-supervised Learning for Video Understanding
Yasar Abbas Ur Rehman
Yan Gao
Jiajun Shen
Pedro Porto Buarque de Gusmão
Nicholas D. Lane
FedML
75
15
0
05 Jul 2022
PoF: Post-Training of Feature Extractor for Improving Generalization
PoF: Post-Training of Feature Extractor for Improving Generalization
Ikuro Sato
Ryota Yamada
Masayuki Tanaka
Nakamasa Inoue
Rei Kawakami
39
4
0
05 Jul 2022
A Theoretical Analysis of the Learning Dynamics under Class Imbalance
A Theoretical Analysis of the Learning Dynamics under Class Imbalance
Emanuele Francazi
Marco Baity-Jesi
Aurelien Lucchi
100
18
0
01 Jul 2022
Improving Ensemble Distillation With Weight Averaging and Diversifying
  Perturbation
Improving Ensemble Distillation With Weight Averaging and Diversifying Perturbation
G. Nam
Hyungi Lee
Byeongho Heo
Juho Lee
UQCVFedML
62
7
0
30 Jun 2022
Modeling Teams Performance Using Deep Representational Learning on
  Graphs
Modeling Teams Performance Using Deep Representational Learning on Graphs
Francesco Carli
Pietro Foini
N. Gozzi
N. Perra
Rossano Schifanella
GNN
40
0
0
29 Jun 2022
Effective training-time stacking for ensembling of deep neural networks
Effective training-time stacking for ensembling of deep neural networks
P. Proskura
Alexey Zaytsev
34
7
0
27 Jun 2022
Woodscape Fisheye Object Detection for Autonomous Driving -- CVPR 2022
  OmniCV Workshop Challenge
Woodscape Fisheye Object Detection for Autonomous Driving -- CVPR 2022 OmniCV Workshop Challenge
Saravanabalagi Ramachandran
Ganesh Sistu
V. Kumar
J. McDonald
S. Yogamani
75
5
0
26 Jun 2022
Training Your Sparse Neural Network Better with Any Mask
Training Your Sparse Neural Network Better with Any Mask
Ajay Jaiswal
Haoyu Ma
Tianlong Chen
Ying Ding
Zhangyang Wang
CVBM
137
36
0
26 Jun 2022
HOPE: Hierarchical Spatial-temporal Network for Occupancy Flow
  Prediction
HOPE: Hierarchical Spatial-temporal Network for Occupancy Flow Prediction
Yi Hu
Wenxin Shao
Bo Jiang
Jiajie Chen
Siqi Chai
Zhening Yang
Jingyu Qian
Helong Zhou
Qiang Liu
AI4CE
76
14
0
21 Jun 2022
When Does Re-initialization Work?
When Does Re-initialization Work?
Sheheryar Zaidi
Tudor Berariu
Hyunjik Kim
J. Bornschein
Claudia Clopath
Yee Whye Teh
Razvan Pascanu
68
11
0
20 Jun 2022
Disentangling Model Multiplicity in Deep Learning
Disentangling Model Multiplicity in Deep Learning
Ari Heljakka
Martin Trapp
Arno Solin
Arno Solin
65
4
0
17 Jun 2022
Uncertainty-aware Evaluation of Time-Series Classification for Online
  Handwriting Recognition with Domain Shift
Uncertainty-aware Evaluation of Time-Series Classification for Online Handwriting Recognition with Domain Shift
Andreas Klass
Sven M. Lorenz
M. Lauer-Schmaltz
David Rügamer
Bernd Bischl
Christopher Mutschler
Felix Ott
77
10
0
17 Jun 2022
Previous
123...121314...192021
Next