v1v2v3 (latest)

Averaging Weights Leads to Wider Optima and Better Generalization

14 March 2018

Dmitry Vetrov

Papers citing "Averaging Weights Leads to Wider Optima and Better Generalization"

50 / 1,040 papers shown

Title
Wide-minima Density Hypothesis and the Explore-Exploit Learning Rate Schedule Nikhil Iyer V. Thejas Nipun Kwatra Ramachandran Ramjee Muthian Sivathanu 89 29 0 09 Mar 2020
Rethinking Parameter Counting in Deep Models: Effective Dimensionality Revisited Wesley J. Maddox Gregory W. Benton A. Wilson 136 61 0 04 Mar 2020
Iterative Averaging in the Quest for Best Test Error Diego Granziol Xingchen Wan Samuel Albanie Stephen J. Roberts 66 3 0 02 Mar 2020
Towards Robust and Reproducible Active Learning Using Neural Networks Prateek Munjal Nasir Hayat Munawar Hayat J. Sourati Shadab Khan UQCV 84 69 0 21 Feb 2020
Bayesian Deep Learning and a Probabilistic Perspective of Generalization A. Wilson Pavel Izmailov UQCV BDL OOD 175 658 0 20 Feb 2020
A survey on Semi-, Self- and Unsupervised Learning for Image Classification Lars Schmarje M. Santarossa Simon-Martin Schroder Reinhard Koch SSL VLM 98 165 0 20 Feb 2020
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning Arsenii Ashukha Alexander Lyzhov Dmitry Molchanov Dmitry Vetrov UQCV FedML 108 320 0 15 Feb 2020
LaProp: Separating Momentum and Adaptivity in Adam Liu Ziyin Zhikang T.Wang Masahito Ueda ODL 70 18 0 12 Feb 2020
Renofeation: A Simple Transfer Learning Method for Improved Adversarial Robustness Ting-Wu Chin Cha Zhang Diana Marculescu AAML 26 1 0 07 Feb 2020
SQWA: Stochastic Quantized Weight Averaging for Improving the Generalization Capability of Low-Precision Deep Neural Networks Sungho Shin Yoonho Boo Wonyong Sung MQ 44 3 0 02 Feb 2020
The Case for Bayesian Deep Learning A. Wilson UQCV BDL OOD 132 114 0 29 Jan 2020
No Routing Needed Between Capsules Adam Byerly T. Kalganova I. Dear 131 67 0 24 Jan 2020
Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning Paola Cascante-Bonilla Fuwen Tan Yanjun Qi Vicente Ordonez ODL 109 23 0 16 Jan 2020
Stochastic Weight Averaging in Parallel: Large-Batch Training that Generalizes Well Vipul Gupta S. Serrano D. DeCoste MoMe 83 60 0 07 Jan 2020
Relative Flatness and Generalization Henning Petzka Michael Kamp Linara Adilova C. Sminchisescu Mario Boley 87 78 0 03 Jan 2020
Searching for Stage-wise Neural Graphs In the Limit Xiaoxia Zhou Dejing Dou Boyang Albert Li GNN 38 2 0 30 Dec 2019
Big Transfer (BiT): General Visual Representation Learning Alexander Kolesnikov Lucas Beyer Xiaohua Zhai J. Puigcerver Jessica Yung Sylvain Gelly N. Houlsby MQ 310 1,211 0 24 Dec 2019
TRADI: Tracking deep neural network weight distributions for uncertainty estimation Gianni Franchi Andrei Bursuc Emanuel Aldea Séverine Dubuisson Isabelle Bloch UQCV 156 52 0 24 Dec 2019
Robustness of Brain Tumor Segmentation Sabine Müller Joachim Weickert N. Graf AAML OOD 62 5 0 24 Dec 2019
Learning to Impute: A General Framework for Semi-supervised Learning Wei-Hong Li Chuan-Sheng Foo Hakan Bilen SSL 73 10 0 22 Dec 2019
Deep Curvature Suite Diego Granziol Xingchen Wan T. Garipov 3DV 50 12 0 20 Dec 2019
Deep Ensembles: A Loss Landscape Perspective Stanislav Fort Huiyi Hu Balaji Lakshminarayanan OOD UQCV 160 631 0 05 Dec 2019
On the Validity of Bayesian Neural Networks for Uncertainty Estimation John Mitros Brian Mac Namee UQCV BDL 98 30 0 03 Dec 2019
Semi-Supervised Learning for Text Classification by Layer Partitioning Alexander Hanbo Li A. Sethy 40 12 0 26 Nov 2019
Orderless Recurrent Models for Multi-label Classification V. O. Yazici Abel Gonzalez-Garcia Arnau Ramisa Bartlomiej Twardowski Joost van de Weijer SSL 115 93 0 22 Nov 2019
AssemblyNet: A large ensemble of CNNs for 3D Whole Brain MRI Segmentation Pierrick Coupé Boris Mansencal Michael Clement Rémi Giraud B. D. D. Senneville T. Thong Vincent Lepetit J. V. Manjón 137 118 0 20 Nov 2019
Information-Theoretic Local Minima Characterization and Regularization Zhiwei Jia Hao Su 80 19 0 19 Nov 2019
Disentangle, align and fuse for multimodal and semi-supervised image segmentation A. Chartsias G. Papanastasiou Chengjia Wang S. Semple D. Newby R. Dharmakumar Sotirios A. Tsaftaris 64 13 0 11 Nov 2019
DC-S3GD: Delay-Compensated Stale-Synchronous SGD for Large-Scale Decentralized Neural Network Training Alessandro Rigazzi 33 5 0 06 Nov 2019
Eye Semantic Segmentation with a Lightweight Model V. Huynh Soohyung Kim Gueesang Lee Hyung-Jeong Yang VLM 3DV 67 17 0 04 Nov 2019
Emotion and Theme Recognition in Music with Frequency-Aware RF-Regularized CNNs Khaled Koutini Shreyan Chowdhury Verena Haunschmid Hamid Eghbalzadeh Gerhard Widmer 99 18 0 28 Oct 2019
Segmenting Ships in Satellite Imagery With Squeeze and Excitation U-Net R. Venkatesh Anand Metha SSeg 119 4 0 27 Oct 2019
Self-Correction for Human Parsing Peike Li Yunqiu Xu Yunchao Wei Yezhou Yang 101 339 0 22 Oct 2019
Compact Network Training for Person ReID Hussam Lawen Avi Ben-Cohen M. Protter Itamar Friedman Lihi Zelnik-Manor VOT 72 10 0 15 Oct 2019
BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization Maximilian Balandat Brian Karrer Daniel R. Jiang Sam Daulton Benjamin Letham A. Wilson E. Bakshy 75 93 0 14 Oct 2019
Towards Understanding the Transferability of Deep Representations Hong Liu Mingsheng Long Jianmin Wang Michael I. Jordan 66 26 0 26 Sep 2019
On Model Stability as a Function of Random Seed Pranava Madhyastha Dhruv Batra 104 63 0 23 Sep 2019
Object Segmentation using Pixel-wise Adversarial Loss Ricard Durall Franz-Josef Pfreundt Ullrich Kothe J. Keuper GAN SSeg 41 2 0 23 Sep 2019
Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data Sergei Popov S. Morozov Artem Babenko LMTD 172 319 0 13 Sep 2019
Dual Student: Breaking the Limits of the Teacher in Semi-supervised Learning Zhanghan Ke Daoye Wang Qiong Yan Jimmy S. J. Ren Rynson W. H. Lau 78 216 0 03 Sep 2019
Nuclear Instance Segmentation using a Proposal-Free Spatially Aware Deep Learning Framework Navid Alemi Koohbanani Mostafa Jahanifar A. Gooya Nasir M. Rajpoot SSeg 52 37 0 27 Aug 2019
Visualizing and Understanding the Effectiveness of BERT Y. Hao Li Dong Furu Wei Ke Xu 150 186 0 15 Aug 2019
Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning Eric Arazo Diego Ortego Paul Albert Noel E. O'Connor Kevin McGuinness 138 848 0 08 Aug 2019
Sound source detection, localization and classification using consecutive ensemble of CRNN models Slawomir Kapka M. Lewandowski 122 66 0 02 Aug 2019
A Hybrid Neural Network Model for Commonsense Reasoning Pengcheng He Xiaodong Liu Weizhu Chen Jianfeng Gao LRM 75 29 0 27 Jul 2019
Lookahead Optimizer: k steps forward, 1 step back Michael Ruogu Zhang James Lucas Geoffrey E. Hinton Jimmy Ba ODL 168 736 0 19 Jul 2019
Subspace Inference for Bayesian Deep Learning Pavel Izmailov Wesley J. Maddox Polina Kirichenko T. Garipov Dmitry Vetrov A. Wilson UQCV BDL 97 144 0 17 Jul 2019
Confidence Calibration for Convolutional Neural Networks Using Structured Dropout Zhilu Zhang Adrian Dalca M. Sabuncu UQCV BDL 72 48 0 23 Jun 2019
Homogeneous Vector Capsules Enable Adaptive Gradient Descent in Convolutional Neural Networks Adam Byerly T. Kalganova 61 13 0 20 Jun 2019
Modeling the Dynamics of PDE Systems with Physics-Constrained Deep Auto-Regressive Networks N. Geneva N. Zabaras AI4CE 75 276 0 13 Jun 2019