ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.05407
  4. Cited By
Averaging Weights Leads to Wider Optima and Better Generalization
v1v2v3 (latest)

Averaging Weights Leads to Wider Optima and Better Generalization

14 March 2018
Pavel Izmailov
Dmitrii Podoprikhin
T. Garipov
Dmitry Vetrov
A. Wilson
    FedMLMoMe
ArXiv (abs)PDFHTML

Papers citing "Averaging Weights Leads to Wider Optima and Better Generalization"

40 / 1,040 papers shown
Title
Large Scale Structure of Neural Network Loss Landscapes
Large Scale Structure of Neural Network Loss Landscapes
Stanislav Fort
Stanislaw Jastrzebski
74
84
0
11 Jun 2019
Understanding Generalization through Visualizations
Understanding Generalization through Visualizations
Wenjie Huang
Z. Emam
Micah Goldblum
Liam H. Fowl
J. K. Terry
Furong Huang
Tom Goldstein
AI4CE
51
80
0
07 Jun 2019
AssemblyNet: A Novel Deep Decision-Making Process for Whole Brain MRI
  Segmentation
AssemblyNet: A Novel Deep Decision-Making Process for Whole Brain MRI Segmentation
Pierrick Coupé
Boris Mansencal
Michael Clement
Rémi Giraud
B. D. D. Senneville
T. Thong
Vincent Lepetit
J. V. Manjón
114
17
0
05 Jun 2019
Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections
Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections
R. Y. Rohekar
Yaniv Gurwicz
Shami Nisimov
Gal Novik
BDLUQCV
127
13
0
30 May 2019
Leader Stochastic Gradient Descent for Distributed Training of Deep
  Learning Models: Extension
Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models: Extension
Yunfei Teng
Wenbo Gao
F. Chalus
A. Choromańska
Shiqian Ma
Adrian Weller
134
12
0
24 May 2019
Countering Noisy Labels By Learning From Auxiliary Clean Labels
Countering Noisy Labels By Learning From Auxiliary Clean Labels
Tsung Wei Tsai
Chongxuan Li
Jun Zhu
SSL
19
1
0
23 May 2019
Ensemble Model Patching: A Parameter-Efficient Variational Bayesian
  Neural Network
Ensemble Model Patching: A Parameter-Efficient Variational Bayesian Neural Network
Oscar Chang
Yuling Yao
David Williams-King
Hod Lipson
BDLUQCV
71
8
0
23 May 2019
ROI Regularization for Semi-supervised and Supervised Learning
ROI Regularization for Semi-supervised and Supervised Learning
H. Kaizuka
Yasuhiro Nagasaki
R. Sako
23
1
0
15 May 2019
Improving Model Training by Periodic Sampling over Weight Distributions
Improving Model Training by Periodic Sampling over Weight Distributions
S. Tripathi
Jiayi Liu
Unmesh Kurup
Mohak Shah
Sauptik Dhar
35
0
0
14 May 2019
Breast Tumor Cellularity Assessment using Deep Neural Networks
Breast Tumor Cellularity Assessment using Deep Neural Networks
Alexander Rakhlin
A. Tiulpin
Alexey A. Shvets
Alexandr A Kalinin
V. Iglovikov
Sergey I. Nikolenko
60
20
0
05 May 2019
SWALP : Stochastic Weight Averaging in Low-Precision Training
SWALP : Stochastic Weight Averaging in Low-Precision Training
Guandao Yang
Tianyi Zhang
Polina Kirichenko
Junwen Bai
A. Wilson
Christopher De Sa
85
97
0
26 Apr 2019
A neural network-based framework for financial model calibration
A neural network-based framework for financial model calibration
Shuaiqiang Liu
Anastasia Borovykh
L. Grzelak
C. Oosterlee
82
104
0
23 Apr 2019
UG$^{2+}$ Track 2: A Collective Benchmark Effort for Evaluating and
  Advancing Image Understanding in Poor Visibility Environments
UG2+^{2+}2+ Track 2: A Collective Benchmark Effort for Evaluating and Advancing Image Understanding in Poor Visibility Environments
Ye Yuan
Wenhan Yang
Wenqi Ren
Xin Liu
Cheng Chi
Haiquan Wang
3DV
147
238
0
09 Apr 2019
Parabolic Approximation Line Search for DNNs
Parabolic Approximation Line Search for DNNs
Max Mutschler
A. Zell
ODL
87
20
0
28 Mar 2019
Accelerating Self-Play Learning in Go
Accelerating Self-Play Learning in Go
David J. Wu
103
96
0
27 Feb 2019
Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning
Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning
Ruqi Zhang
Chunyuan Li
Jianyi Zhang
Changyou Chen
A. Wilson
BDL
88
278
0
11 Feb 2019
A Simple Baseline for Bayesian Uncertainty in Deep Learning
A Simple Baseline for Bayesian Uncertainty in Deep Learning
Wesley J. Maddox
T. Garipov
Pavel Izmailov
Dmitry Vetrov
A. Wilson
BDLUQCV
150
810
0
07 Feb 2019
Asymmetric Valleys: Beyond Sharp and Flat Local Minima
Asymmetric Valleys: Beyond Sharp and Flat Local Minima
Haowei He
Gao Huang
Yang Yuan
ODLMLT
84
150
0
02 Feb 2019
Hamiltonian Monte-Carlo for Orthogonal Matrices
Hamiltonian Monte-Carlo for Orthogonal Matrices
V. Yanush
D. Kropotov
30
1
0
23 Jan 2019
Certainty Driven Consistency Loss on Multi-Teacher Networks for
  Semi-Supervised Learning
Certainty Driven Consistency Loss on Multi-Teacher Networks for Semi-Supervised Learning
Lu Liu
R. Tan
82
32
0
17 Jan 2019
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat
  Minima for Neural Networks using PAC-Bayesian Analysis
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks using PAC-Bayesian Analysis
Yusuke Tsuzuku
Issei Sato
Masashi Sugiyama
84
77
0
15 Jan 2019
A Survey of Unsupervised Deep Domain Adaptation
A Survey of Unsupervised Deep Domain Adaptation
Garrett Wilson
D. Cook
OOD
195
824
0
06 Dec 2018
Projected BNNs: Avoiding weight-space pathologies by learning latent
  representations of neural network weights
Projected BNNs: Avoiding weight-space pathologies by learning latent representations of neural network weights
Melanie F. Pradier
Weiwei Pan
Jiayu Yao
S. Ghosh
Finale Doshi-velez
UQCVBDL
63
10
0
16 Nov 2018
AttentionXML: Label Tree-based Attention-Aware Deep Model for
  High-Performance Extreme Multi-Label Text Classification
AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification
Rafael M. O. Cruz
Zihan Zhang
R. Sabourin
Suyang Dai
Hiroshi Mamitsuka
Shanfeng Zhu
VLM
100
253
0
01 Nov 2018
A Closer Look at Deep Learning Heuristics: Learning rate restarts,
  Warmup and Distillation
A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
Akhilesh Deepak Gotmare
N. Keskar
Caiming Xiong
R. Socher
ODL
105
277
0
29 Oct 2018
Collaborative Deep Learning Across Multiple Data Centers
Collaborative Deep Learning Across Multiple Data Centers
Kele Xu
Haibo Mi
Dawei Feng
Huaimin Wang
Chuan Chen
Zibin Zheng
Xu Lan
FedML
344
18
0
16 Oct 2018
DeepCMB: Lensing Reconstruction of the Cosmic Microwave Background with
  Deep Neural Networks
DeepCMB: Lensing Reconstruction of the Cosmic Microwave Background with Deep Neural Networks
J. Caldeira
W. L. K. Wu
Brian D. Nord
Camille Avestruz
Shubhendu Trivedi
K. Story
101
66
0
02 Oct 2018
Non-local NetVLAD Encoding for Video Classification
Non-local NetVLAD Encoding for Video Classification
Yongyi Tang
Xing Zhang
Jingwen Wang
Shaoxiang Chen
Lin Ma
Yu-Gang Jiang
64
41
0
29 Sep 2018
GPyTorch: Blackbox Matrix-Matrix Gaussian Process Inference with GPU
  Acceleration
GPyTorch: Blackbox Matrix-Matrix Gaussian Process Inference with GPU Acceleration
Jacob R. Gardner
Geoff Pleiss
D. Bindel
Kilian Q. Weinberger
A. Wilson
GP
151
1,106
0
28 Sep 2018
Discovering Low-Precision Networks Close to Full-Precision Networks for
  Efficient Embedded Inference
Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient Embedded Inference
J. McKinstry
S. K. Esser
R. Appuswamy
Deepika Bablani
John V. Arthur
Izzet B. Yildiz
D. Modha
MQ
66
94
0
11 Sep 2018
A Survey of Modern Object Detection Literature using Deep Learning
A Survey of Modern Object Detection Literature using Deep Learning
K. Chahal
Kuntal Dey
ObjD
48
36
0
22 Aug 2018
Don't Use Large Mini-Batches, Use Local SGD
Don't Use Large Mini-Batches, Use Local SGD
Tao R. Lin
Sebastian U. Stich
Kumar Kshitij Patel
Martin Jaggi
123
432
0
22 Aug 2018
Make (Nearly) Every Neural Network Better: Generating Neural Network
  Ensembles by Weight Parameter Resampling
Make (Nearly) Every Neural Network Better: Generating Neural Network Ensembles by Weight Parameter Resampling
Jiayi Liu
S. Tripathi
Unmesh Kurup
Mohak Shah
UQCV
20
4
0
02 Jul 2018
Using Mode Connectivity for Loss Landscape Analysis
Using Mode Connectivity for Loss Landscape Analysis
Akhilesh Deepak Gotmare
N. Keskar
Caiming Xiong
R. Socher
71
28
0
18 Jun 2018
There Are Many Consistent Explanations of Unlabeled Data: Why You Should
  Average
There Are Many Consistent Explanations of Unlabeled Data: Why You Should Average
Ben Athiwaratkun
Marc Finzi
Pavel Izmailov
A. Wilson
281
244
0
14 Jun 2018
The Unusual Effectiveness of Averaging in GAN Training
The Unusual Effectiveness of Averaging in GAN Training
Yasin Yazici
Chuan-Sheng Foo
Stefan Winkler
Kim-Hui Yap
Georgios Piliouras
V. Chandrasekhar
132
175
0
12 Jun 2018
Online Regularized Nonlinear Acceleration
Online Regularized Nonlinear Acceleration
Damien Scieur
Edouard Oyallon
Alexandre d’Aspremont
Francis R. Bach
31
13
0
24 May 2018
Bias-Reduced Uncertainty Estimation for Deep Neural Classifiers
Bias-Reduced Uncertainty Estimation for Deep Neural Classifiers
Yonatan Geifman
Guy Uziel
Ran El-Yaniv
UQCV
73
142
0
21 May 2018
SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep
  Learning
SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning
W. Wen
Yandan Wang
Feng Yan
Cong Xu
Chunpeng Wu
Yiran Chen
H. Li
79
51
0
21 May 2018
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
T. Garipov
Pavel Izmailov
Dmitrii Podoprikhin
Dmitry Vetrov
A. Wilson
UQCV
132
758
0
27 Feb 2018
Previous
123...192021