ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.05407
  4. Cited By
Averaging Weights Leads to Wider Optima and Better Generalization
v1v2v3 (latest)

Averaging Weights Leads to Wider Optima and Better Generalization

14 March 2018
Pavel Izmailov
Dmitrii Podoprikhin
T. Garipov
Dmitry Vetrov
A. Wilson
    FedMLMoMe
ArXiv (abs)PDFHTML

Papers citing "Averaging Weights Leads to Wider Optima and Better Generalization"

50 / 1,040 papers shown
Title
Wide-minima Density Hypothesis and the Explore-Exploit Learning Rate
  Schedule
Wide-minima Density Hypothesis and the Explore-Exploit Learning Rate Schedule
Nikhil Iyer
V. Thejas
Nipun Kwatra
Ramachandran Ramjee
Muthian Sivathanu
89
29
0
09 Mar 2020
Rethinking Parameter Counting in Deep Models: Effective Dimensionality
  Revisited
Rethinking Parameter Counting in Deep Models: Effective Dimensionality Revisited
Wesley J. Maddox
Gregory W. Benton
A. Wilson
136
61
0
04 Mar 2020
Iterative Averaging in the Quest for Best Test Error
Iterative Averaging in the Quest for Best Test Error
Diego Granziol
Xingchen Wan
Samuel Albanie
Stephen J. Roberts
66
3
0
02 Mar 2020
Towards Robust and Reproducible Active Learning Using Neural Networks
Towards Robust and Reproducible Active Learning Using Neural Networks
Prateek Munjal
Nasir Hayat
Munawar Hayat
J. Sourati
Shadab Khan
UQCV
84
69
0
21 Feb 2020
Bayesian Deep Learning and a Probabilistic Perspective of Generalization
Bayesian Deep Learning and a Probabilistic Perspective of Generalization
A. Wilson
Pavel Izmailov
UQCVBDLOOD
175
658
0
20 Feb 2020
A survey on Semi-, Self- and Unsupervised Learning for Image
  Classification
A survey on Semi-, Self- and Unsupervised Learning for Image Classification
Lars Schmarje
M. Santarossa
Simon-Martin Schroder
Reinhard Koch
SSLVLM
98
165
0
20 Feb 2020
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep
  Learning
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning
Arsenii Ashukha
Alexander Lyzhov
Dmitry Molchanov
Dmitry Vetrov
UQCVFedML
108
320
0
15 Feb 2020
LaProp: Separating Momentum and Adaptivity in Adam
LaProp: Separating Momentum and Adaptivity in Adam
Liu Ziyin
Zhikang T.Wang
Masahito Ueda
ODL
70
18
0
12 Feb 2020
Renofeation: A Simple Transfer Learning Method for Improved Adversarial
  Robustness
Renofeation: A Simple Transfer Learning Method for Improved Adversarial Robustness
Ting-Wu Chin
Cha Zhang
Diana Marculescu
AAML
26
1
0
07 Feb 2020
SQWA: Stochastic Quantized Weight Averaging for Improving the
  Generalization Capability of Low-Precision Deep Neural Networks
SQWA: Stochastic Quantized Weight Averaging for Improving the Generalization Capability of Low-Precision Deep Neural Networks
Sungho Shin
Yoonho Boo
Wonyong Sung
MQ
44
3
0
02 Feb 2020
The Case for Bayesian Deep Learning
The Case for Bayesian Deep Learning
A. Wilson
UQCVBDLOOD
132
114
0
29 Jan 2020
No Routing Needed Between Capsules
No Routing Needed Between Capsules
Adam Byerly
T. Kalganova
I. Dear
131
67
0
24 Jan 2020
Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised
  Learning
Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning
Paola Cascante-Bonilla
Fuwen Tan
Yanjun Qi
Vicente Ordonez
ODL
109
23
0
16 Jan 2020
Stochastic Weight Averaging in Parallel: Large-Batch Training that
  Generalizes Well
Stochastic Weight Averaging in Parallel: Large-Batch Training that Generalizes Well
Vipul Gupta
S. Serrano
D. DeCoste
MoMe
83
60
0
07 Jan 2020
Relative Flatness and Generalization
Relative Flatness and Generalization
Henning Petzka
Michael Kamp
Linara Adilova
C. Sminchisescu
Mario Boley
87
78
0
03 Jan 2020
Searching for Stage-wise Neural Graphs In the Limit
Searching for Stage-wise Neural Graphs In the Limit
Xiaoxia Zhou
Dejing Dou
Boyang Albert Li
GNN
38
2
0
30 Dec 2019
Big Transfer (BiT): General Visual Representation Learning
Big Transfer (BiT): General Visual Representation Learning
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
J. Puigcerver
Jessica Yung
Sylvain Gelly
N. Houlsby
MQ
310
1,211
0
24 Dec 2019
TRADI: Tracking deep neural network weight distributions for uncertainty
  estimation
TRADI: Tracking deep neural network weight distributions for uncertainty estimation
Gianni Franchi
Andrei Bursuc
Emanuel Aldea
Séverine Dubuisson
Isabelle Bloch
UQCV
156
52
0
24 Dec 2019
Robustness of Brain Tumor Segmentation
Robustness of Brain Tumor Segmentation
Sabine Müller
Joachim Weickert
N. Graf
AAMLOOD
62
5
0
24 Dec 2019
Learning to Impute: A General Framework for Semi-supervised Learning
Learning to Impute: A General Framework for Semi-supervised Learning
Wei-Hong Li
Chuan-Sheng Foo
Hakan Bilen
SSL
73
10
0
22 Dec 2019
Deep Curvature Suite
Deep Curvature Suite
Diego Granziol
Xingchen Wan
T. Garipov
3DV
50
12
0
20 Dec 2019
Deep Ensembles: A Loss Landscape Perspective
Deep Ensembles: A Loss Landscape Perspective
Stanislav Fort
Huiyi Hu
Balaji Lakshminarayanan
OODUQCV
160
631
0
05 Dec 2019
On the Validity of Bayesian Neural Networks for Uncertainty Estimation
On the Validity of Bayesian Neural Networks for Uncertainty Estimation
John Mitros
Brian Mac Namee
UQCVBDL
98
30
0
03 Dec 2019
Semi-Supervised Learning for Text Classification by Layer Partitioning
Semi-Supervised Learning for Text Classification by Layer Partitioning
Alexander Hanbo Li
A. Sethy
40
12
0
26 Nov 2019
Orderless Recurrent Models for Multi-label Classification
Orderless Recurrent Models for Multi-label Classification
V. O. Yazici
Abel Gonzalez-Garcia
Arnau Ramisa
Bartlomiej Twardowski
Joost van de Weijer
SSL
115
93
0
22 Nov 2019
AssemblyNet: A large ensemble of CNNs for 3D Whole Brain MRI
  Segmentation
AssemblyNet: A large ensemble of CNNs for 3D Whole Brain MRI Segmentation
Pierrick Coupé
Boris Mansencal
Michael Clement
Rémi Giraud
B. D. D. Senneville
T. Thong
Vincent Lepetit
J. V. Manjón
137
118
0
20 Nov 2019
Information-Theoretic Local Minima Characterization and Regularization
Information-Theoretic Local Minima Characterization and Regularization
Zhiwei Jia
Hao Su
80
19
0
19 Nov 2019
Disentangle, align and fuse for multimodal and semi-supervised image
  segmentation
Disentangle, align and fuse for multimodal and semi-supervised image segmentation
A. Chartsias
G. Papanastasiou
Chengjia Wang
S. Semple
D. Newby
R. Dharmakumar
Sotirios A. Tsaftaris
64
13
0
11 Nov 2019
DC-S3GD: Delay-Compensated Stale-Synchronous SGD for Large-Scale
  Decentralized Neural Network Training
DC-S3GD: Delay-Compensated Stale-Synchronous SGD for Large-Scale Decentralized Neural Network Training
Alessandro Rigazzi
33
5
0
06 Nov 2019
Eye Semantic Segmentation with a Lightweight Model
Eye Semantic Segmentation with a Lightweight Model
V. Huynh
Soohyung Kim
Gueesang Lee
Hyung-Jeong Yang
VLM3DV
67
17
0
04 Nov 2019
Emotion and Theme Recognition in Music with Frequency-Aware
  RF-Regularized CNNs
Emotion and Theme Recognition in Music with Frequency-Aware RF-Regularized CNNs
Khaled Koutini
Shreyan Chowdhury
Verena Haunschmid
Hamid Eghbalzadeh
Gerhard Widmer
99
18
0
28 Oct 2019
Segmenting Ships in Satellite Imagery With Squeeze and Excitation U-Net
Segmenting Ships in Satellite Imagery With Squeeze and Excitation U-Net
R. Venkatesh
Anand Metha
SSeg
119
4
0
27 Oct 2019
Self-Correction for Human Parsing
Self-Correction for Human Parsing
Peike Li
Yunqiu Xu
Yunchao Wei
Yezhou Yang
101
339
0
22 Oct 2019
Compact Network Training for Person ReID
Compact Network Training for Person ReID
Hussam Lawen
Avi Ben-Cohen
M. Protter
Itamar Friedman
Lihi Zelnik-Manor
VOT
72
10
0
15 Oct 2019
BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization
BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization
Maximilian Balandat
Brian Karrer
Daniel R. Jiang
Sam Daulton
Benjamin Letham
A. Wilson
E. Bakshy
75
93
0
14 Oct 2019
Towards Understanding the Transferability of Deep Representations
Towards Understanding the Transferability of Deep Representations
Hong Liu
Mingsheng Long
Jianmin Wang
Michael I. Jordan
66
26
0
26 Sep 2019
On Model Stability as a Function of Random Seed
On Model Stability as a Function of Random Seed
Pranava Madhyastha
Dhruv Batra
104
63
0
23 Sep 2019
Object Segmentation using Pixel-wise Adversarial Loss
Object Segmentation using Pixel-wise Adversarial Loss
Ricard Durall
Franz-Josef Pfreundt
Ullrich Kothe
J. Keuper
GANSSeg
41
2
0
23 Sep 2019
Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data
Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data
Sergei Popov
S. Morozov
Artem Babenko
LMTD
172
319
0
13 Sep 2019
Dual Student: Breaking the Limits of the Teacher in Semi-supervised
  Learning
Dual Student: Breaking the Limits of the Teacher in Semi-supervised Learning
Zhanghan Ke
Daoye Wang
Qiong Yan
Jimmy S. J. Ren
Rynson W. H. Lau
78
216
0
03 Sep 2019
Nuclear Instance Segmentation using a Proposal-Free Spatially Aware Deep
  Learning Framework
Nuclear Instance Segmentation using a Proposal-Free Spatially Aware Deep Learning Framework
Navid Alemi Koohbanani
Mostafa Jahanifar
A. Gooya
Nasir M. Rajpoot
SSeg
52
37
0
27 Aug 2019
Visualizing and Understanding the Effectiveness of BERT
Visualizing and Understanding the Effectiveness of BERT
Y. Hao
Li Dong
Furu Wei
Ke Xu
150
186
0
15 Aug 2019
Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning
Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning
Eric Arazo
Diego Ortego
Paul Albert
Noel E. O'Connor
Kevin McGuinness
138
848
0
08 Aug 2019
Sound source detection, localization and classification using
  consecutive ensemble of CRNN models
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
122
66
0
02 Aug 2019
A Hybrid Neural Network Model for Commonsense Reasoning
A Hybrid Neural Network Model for Commonsense Reasoning
Pengcheng He
Xiaodong Liu
Weizhu Chen
Jianfeng Gao
LRM
75
29
0
27 Jul 2019
Lookahead Optimizer: k steps forward, 1 step back
Lookahead Optimizer: k steps forward, 1 step back
Michael Ruogu Zhang
James Lucas
Geoffrey E. Hinton
Jimmy Ba
ODL
168
736
0
19 Jul 2019
Subspace Inference for Bayesian Deep Learning
Subspace Inference for Bayesian Deep Learning
Pavel Izmailov
Wesley J. Maddox
Polina Kirichenko
T. Garipov
Dmitry Vetrov
A. Wilson
UQCVBDL
97
144
0
17 Jul 2019
Confidence Calibration for Convolutional Neural Networks Using
  Structured Dropout
Confidence Calibration for Convolutional Neural Networks Using Structured Dropout
Zhilu Zhang
Adrian Dalca
M. Sabuncu
UQCVBDL
72
48
0
23 Jun 2019
Homogeneous Vector Capsules Enable Adaptive Gradient Descent in
  Convolutional Neural Networks
Homogeneous Vector Capsules Enable Adaptive Gradient Descent in Convolutional Neural Networks
Adam Byerly
T. Kalganova
61
13
0
20 Jun 2019
Modeling the Dynamics of PDE Systems with Physics-Constrained Deep
  Auto-Regressive Networks
Modeling the Dynamics of PDE Systems with Physics-Constrained Deep Auto-Regressive Networks
N. Geneva
N. Zabaras
AI4CE
75
276
0
13 Jun 2019
Previous
123...192021
Next