v1v2v3 (latest)

Averaging Weights Leads to Wider Optima and Better Generalization

14 March 2018

Dmitry Vetrov

Papers citing "Averaging Weights Leads to Wider Optima and Better Generalization"

50 / 1,040 papers shown

Title
Dive into Deep Learning Aston Zhang Zachary Chase Lipton Mu Li Alexander J. Smola VLM 104 572 0 21 Jun 2021
Well-tuned Simple Nets Excel on Tabular Datasets Arlind Kadra Marius Lindauer Frank Hutter Josif Grabocka 68 201 0 21 Jun 2021
Multirate Training of Neural Networks Tiffany J. Vlaar Benedict Leimkuhler 55 4 0 20 Jun 2021
Humble Teachers Teach Better Students for Semi-Supervised Object Detection Yihe Tang Weifeng Chen Yijun Luo Yuting Zhang 89 186 0 19 Jun 2021
Effective Evaluation of Deep Active Learning on Image Classification Tasks Nathan Beck D. Sivasubramanian Apurva Dani Ganesh Ramakrishnan Rishabh K. Iyer VLM 78 39 0 16 Jun 2021
ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learning Junyoung Park Sanjar Bakhtiyar Jinkyoo Park 70 39 0 06 Jun 2021
NODE-GAM: Neural Generalized Additive Model for Interpretable Deep Learning C. Chang R. Caruana Anna Goldenberg AI4CE 93 80 0 03 Jun 2021
SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning Boyuan Zheng Xiaoyu Yang Yu-Ping Ruan Zhen-Hua Ling Quan Liu Si Wei Xiao-Dan Zhu ELM 44 13 0 31 May 2021
Informing Geometric Deep Learning with Electronic Interactions to Accelerate Quantum Chemistry Zhuoran Qiao Anders S. Christensen Matthew Welborn F. Manby Anima Anandkumar Thomas F. Miller 120 74 0 31 May 2021
Efficient and Accurate Gradients for Neural SDEs Patrick Kidger James Foster Xuechen Li Terry Lyons DiffM 113 66 0 27 May 2021
On Linear Stability of SGD and Input-Smoothness of Neural Networks Chao Ma Lexing Ying MLT 66 44 0 27 May 2021
Calibration and Uncertainty Quantification of Bayesian Convolutional Neural Networks for Geophysical Applications L. Mosser E. Naeini UQCV BDL 24 0 0 25 May 2021
Denoising Noisy Neural Networks: A Bayesian Approach with Compensation Yulin Shao Soung Chang Liew Deniz Gunduz 94 14 0 22 May 2021
Visual FUDGE: Form Understanding via Dynamic Graph Editing Brian L. Davis B. Morse Brian L. Price Chris Tensmeyer Curtis Wigington AI4CE 83 20 0 17 May 2021
Fast and Accurate Quantized Camera Scene Detection on Smartphones, Mobile AI 2021 Challenge: Report Andrey D. Ignatov Grigory Malivenko Radu Timofte Sheng Chen Xin Xia ... K. Lyda L. Khojoyan Abhishek Thanki Sayak Paul Shahid Siddiqui MQ 90 20 0 17 May 2021
Rethinking "Batch" in BatchNorm Yuxin Wu Justin Johnson BDL 123 66 0 17 May 2021
Advances in Multi-Variate Analysis Methods for New Physics Searches at the Large Hadron Collider A. Stakia T. Dorigo G. Banelli D. Bortoletto A. Casa ... G. Strong C. Tosciri J. Varela Pietro Vischia A. Weiler 31 3 0 16 May 2021
PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Table Image Recognition to Latex Yelin He Xianbiao Qi Jiaquan Ye Peng Gao Yihao Chen Bingcong Li Xin Tang Rong Xiao LMTD 50 11 0 05 May 2021
Russian News Clustering and Headline Selection Shared Task I. Gusev I. Smurov 43 7 0 03 May 2021
Self-supervised Augmentation Consistency for Adapting Semantic Segmentation Nikita Araslanov Stefan Roth 92 231 0 30 Apr 2021
Post-training deep neural network pruning via layer-wise calibration Ivan Lazarevich Alexander Kozlov Nikita Malinin 3DPC 80 27 0 30 Apr 2021
What Are Bayesian Neural Network Posteriors Really Like? Pavel Izmailov Sharad Vikram Matthew D. Hoffman A. Wilson UQCV BDL 81 389 0 29 Apr 2021
SelfReg: Self-supervised Contrastive Regularization for Domain Generalization Daehee Kim Seunghyun Park Jinkyu Kim Jaekoo Lee OOD SSL 137 273 0 20 Apr 2021
Rehearsal revealed: The limits and merits of revisiting samples in continual learning Eli Verwimp Matthias De Lange Tinne Tuytelaars CLL 59 108 0 15 Apr 2021
Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic Parsing Akshat Shrivastava P. Chuang Arun Babu Shrey Desai Abhinav Arora Alexander Zotov Ahmed Aly 78 21 0 15 Apr 2021
Relating Adversarially Robust Generalization to Flat Minima David Stutz Matthias Hein Bernt Schiele OOD 105 67 0 09 Apr 2021
Predicting Inflation with Recurrent Neural Networks L. Paranhos AI4TS 71 6 0 08 Apr 2021
AST: Audio Spectrogram Transformer Yuan Gong Yu-An Chung James R. Glass ViT 206 887 0 05 Apr 2021
Adaptive Boosting for Domain Adaptation: Towards Robust Predictions in Scene Segmentation Zhedong Zheng Yi Yang 137 30 0 29 Mar 2021
Server Averaging for Federated Learning George Pu Yanlin Zhou D. Wu Xiaolin Li FedML 65 4 0 22 Mar 2021
Conversational Answer Generation and Factuality for Reading Comprehension Question-Answering Stanislav Peshterliev Barlas Oğuz Debojeet Chatterjee Hakan Inan Vikas Bhardwaj 39 4 0 11 Mar 2021
Why flatness does and does not correlate with generalization for deep neural networks Shuo Zhang Isaac Reid Guillermo Valle Pérez A. Louis 77 8 0 10 Mar 2021
MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks Alexandre Ramé Rémy Sun Matthieu Cord UQCV 108 60 0 10 Mar 2021
Robustness to Pruning Predicts Generalization in Deep Neural Networks Lorenz Kuhn Clare Lyle Aidan Gomez Jonas Rothfuss Y. Gal 91 14 0 10 Mar 2021
Nondeterminism and Instability in Neural Network Optimization Cecilia Summers M. Dinneen 70 41 0 08 Mar 2021
Domain Generalization: A Survey Kaiyang Zhou Ziwei Liu Yu Qiao Tao Xiang Chen Change Loy OOD AI4CE 268 1,035 0 03 Mar 2021
Fixing Data Augmentation to Improve Adversarial Robustness Sylvestre-Alvise Rebuffi Sven Gowal D. A. Calian Florian Stimberg Olivia Wiles Timothy A. Mann AAML 121 276 0 02 Mar 2021
A Multiclass Boosting Framework for Achieving Fast and Provable Adversarial Robustness Jacob D. Abernethy Pranjal Awasthi Satyen Kale AAML 59 6 0 01 Mar 2021
A Survey on Deep Semi-supervised Learning Xiangli Yang Zixing Song Irwin King Zenglin Xu 114 594 0 28 Feb 2021
Consistent Sparse Deep Learning: Theory and Computation Y. Sun Qifan Song F. Liang BDL 85 30 0 25 Feb 2021
Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling Gregory W. Benton Wesley J. Maddox Sanae Lotfi A. Wilson UQCV 126 70 0 25 Feb 2021
Provable Super-Convergence with a Large Cyclical Learning Rate Samet Oymak 64 12 0 22 Feb 2021
Learning Neural Network Subspaces Mitchell Wortsman Maxwell Horton Carlos Guestrin Ali Farhadi Mohammad Rastegari UQCV 105 88 0 20 Feb 2021
ISCL: Interdependent Self-Cooperative Learning for Unpaired Image Denoising Kanggeun Lee Won-Ki Jeong 56 36 0 19 Feb 2021
SWAD: Domain Generalization by Seeking Flat Minima Junbum Cha Sanghyuk Chun Kyungjae Lee Han-Cheol Cho Seunghyun Park Yunsung Lee Sungrae Park MoMe 311 460 0 17 Feb 2021
DEUP: Direct Epistemic Uncertainty Prediction Salem Lahlou Moksh Jain Hadi Nekoei V. Butoi Paul Bertin Jarrid Rector-Brooks Maksym Korablyov Yoshua Bengio PER UQLM UQCV UD 321 94 0 16 Feb 2021
Adversarially Robust Kernel Smoothing Jia-Jie Zhu Christina Kouridi Yassine Nemmour Bernhard Schölkopf 64 7 0 16 Feb 2021
Low Curvature Activations Reduce Overfitting in Adversarial Training Vasu Singla Sahil Singla David Jacobs Soheil Feizi AAML 102 47 0 15 Feb 2021
Consensus Control for Decentralized Deep Learning Lingjing Kong Tao R. Lin Anastasia Koloskova Martin Jaggi Sebastian U. Stich 53 79 0 09 Feb 2021
On the Reproducibility of Neural Network Predictions Srinadh Bhojanapalli Kimberly Wilber Andreas Veit A. S. Rawat Seungyeon Kim A. Menon Sanjiv Kumar 122 35 0 05 Feb 2021