v1v2v3 (latest)

Averaging Weights Leads to Wider Optima and Better Generalization

14 March 2018

Dmitry Vetrov

Papers citing "Averaging Weights Leads to Wider Optima and Better Generalization"

50 / 1,040 papers shown

Title
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking Chang-Shu Liu Yinpeng Dong Wenzhao Xiang Xiaohu Yang Hang Su Junyi Zhu YueFeng Chen Yuan He H. Xue Shibao Zheng OOD VLM AAML 115 85 0 28 Feb 2023
Analyzing Populations of Neural Networks via Dynamical Model Embedding Jordan S. Cotler Kai Sheng Tai Felipe Hernández Blake Elias David Sussillo 100 4 0 27 Feb 2023
Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights? Ruisi Cai Zhenyu Zhang Zhangyang Wang AAML OOD 91 12 0 24 Feb 2023
Domain Generalisation via Domain Adaptation: An Adversarial Fourier Amplitude Approach Minyoung Kim Da Li Timothy M. Hospedales OOD 54 11 0 23 Feb 2023
Personalized Privacy-Preserving Framework for Cross-Silo Federated Learning Van Tuan Tran Huy Hieu Pham Kok-Seng Wong FedML 98 8 0 22 Feb 2023
Seasoning Model Soups for Robustness to Adversarial and Natural Distribution Shifts Francesco Croce Sylvestre-Alvise Rebuffi Evan Shelhamer Sven Gowal AAML 79 18 0 20 Feb 2023
Why is parameter averaging beneficial in SGD? An objective smoothing perspective Atsushi Nitanda Ryuhei Kikuchi Shugo Maeda Denny Wu FedML 51 0 0 18 Feb 2023
Calibrating the Rigged Lottery: Making All Tickets Reliable Bowen Lei Ruqi Zhang Dongkuan Xu Bani Mallick UQCV 111 7 0 18 Feb 2023
Scalable Bayesian optimization with high-dimensional outputs using randomized prior networks Mohamed Aziz Bhouri M. Joly Robert Yu S. Sarkar P. Perdikaris BDL UQCV AI4CE 77 1 0 14 Feb 2023
A Modern Look at the Relationship between Sharpness and Generalization Maksym Andriushchenko Francesco Croce Maximilian Müller Matthias Hein Nicolas Flammarion 3DH 138 63 0 14 Feb 2023
Revisiting Weighted Aggregation in Federated Learning with Neural Networks Zexi Li Tao R. Lin Xinyi Shang Chao-Xiang Wu FedML 102 65 0 14 Feb 2023
FilFL: Client Filtering for Optimized Client Participation in Federated Learning Fares Fourati Salma Kharrat Vaneet Aggarwal Mohamed-Slim Alouini Marco Canini FedML 75 4 0 13 Feb 2023
Contour-based Interactive Segmentation Danil Galeev Polina Popenova Anna Vorontsova Anton Konushin 84 5 0 13 Feb 2023
Autoselection of the Ensemble of Convolutional Neural Networks with Second-Order Cone Programming Buse Çisil Güldoğuş Abdullah Nazhat Abdullah Muhammad Ammar Ali Süreyya Özögür-Akyüz 73 0 0 12 Feb 2023
Sparse Mutation Decompositions: Fine Tuning Deep Neural Networks with Subspace Evolution Tim Whitaker L. D. Whitley 57 0 0 12 Feb 2023
Data efficiency and extrapolation trends in neural network interatomic potentials Joshua A Vita Daniel Schwalbe-Koda 73 17 0 12 Feb 2023
Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples Qizhang Li Yiwen Guo W. Zuo Hao Chen AAML 125 37 0 10 Feb 2023
Toward Degree Bias in Embedding-Based Knowledge Graph Completion Harry Shomer Wei Jin Wentao Wang Jiliang Tang 47 25 0 10 Feb 2023
Better Diffusion Models Further Improve Adversarial Training Zekai Wang Tianyu Pang Chao Du Min Lin Weiwei Liu Shuicheng Yan DiffM 106 228 0 09 Feb 2023
Generalization in Graph Neural Networks: Improved PAC-Bayesian Bounds on Graph Diffusion Haotian Ju Dongyue Li Aneesh Sharma Hongyang R. Zhang 59 41 0 09 Feb 2023
Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness Yuancheng Xu Yanchao Sun Micah Goldblum Tom Goldstein Furong Huang AAML 92 38 0 06 Feb 2023
Flat Seeking Bayesian Neural Networks Van-Anh Nguyen L. Vuong Hoang Phan Thanh-Toan Do Dinh Q. Phung Trung Le BDL 100 10 0 06 Feb 2023
Variational Inference on the Final-Layer Output of Neural Networks Yadi Wei Roni Khardon BDL UQCV 91 0 0 05 Feb 2023
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications Chengyu Dong OOD UQCV BDL AI4CE 128 0 0 02 Feb 2023
A Survey of Deep Learning: From Activations to Transformers Johannes Schneider Michalis Vlachos ViT MedIm AI4TS AI4CE 112 10 0 01 Feb 2023
A Comprehensive Survey of Continual Learning: Theory, Method and Application Liyuan Wang Xingxing Zhang Hang Su Jun Zhu KELM CLL 236 714 0 31 Jan 2023
Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised Learning P. Singh Jacopo Cirrone SSL 117 0 0 27 Jan 2023
Exploring the Effect of Multi-step Ascent in Sharpness-Aware Minimization Hoki Kim Jinseong Park Yujin Choi Woojin Lee Jaewook Lee 44 9 0 27 Jan 2023
Backward Compatibility During Data Updates by Weight Interpolation Raphael Schumann Elman Mansimov Yi-An Lai Nikolaos Pappas Xibin Gao Yi Zhang 44 5 0 25 Jan 2023
Model soups to increase inference without increasing compute time Charles Dansereau Milo Sobral Maninder Bhogal Mehdi Zalai 23 2 0 24 Jan 2023
Stability Analysis of Sharpness-Aware Minimization Hoki Kim Jinseong Park Yujin Choi Jaewook Lee 78 13 0 16 Jan 2023
Training trajectories, mini-batch losses and the curious role of the learning rate Mark Sandler A. Zhmoginov Max Vladymyrov Nolan Miller ODL 90 12 0 05 Jan 2023
Audio-Visual Efficient Conformer for Robust Speech Recognition Maxime Burchi Radu Timofte VLM 78 35 0 04 Jan 2023
Recent Advances on Federated Learning: A Systematic Survey Bingyan Liu Nuoyan Lv Yuanchun Guo Yawen Li FedML 118 89 0 03 Jan 2023
Self-Activating Neural Ensembles for Continual Reinforcement Learning Sam Powers Eliot Xing Abhinav Gupta KELM CLL 86 5 0 31 Dec 2022
Do Bayesian Variational Autoencoders Know What They Don't Know? Misha Glazunov Apostolis Zarras UQCV BDL 63 5 0 29 Dec 2022
Frequency Regularization for Improving Adversarial Robustness Binxiao Huang Chaofan Tao R. Lin Ngai Wong AAML 34 4 0 24 Dec 2022
Training Integer-Only Deep Recurrent Neural Networks V. Nia Eyyub Sari Vanessa Courville M. Asgharian MQ 96 2 0 22 Dec 2022
Revisiting Residual Networks for Adversarial Robustness: An Architectural Perspective Shihua Huang Zhichao Lu Kalyanmoy Deb Vishnu Boddeti OOD 102 45 0 21 Dec 2022
KL Regularized Normalization Framework for Low Resource Tasks Neeraj Kumar Ankur Narang Brejesh Lall 58 1 0 21 Dec 2022
Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization Alexandre Ramé Kartik Ahuja Jianyu Zhang Matthieu Cord Léon Bottou David Lopez-Paz MoMe OODD 128 86 0 20 Dec 2022
Variational Factorization Machines for Preference Elicitation in Large-Scale Recommender Systems Jill-Jênn Vie Tomas Rigaux H. Kashima BDL 131 1 0 20 Dec 2022
Dataless Knowledge Fusion by Merging Weights of Language Models Xisen Jin Xiang Ren Daniel Preoţiuc-Pietro Pengxiang Cheng FedML MoMe 99 250 0 19 Dec 2022
A Probabilistic Framework for Lifelong Test-Time Adaptation Dhanajit Brahma Piyush Rai TTA 68 36 0 19 Dec 2022
The Underlying Correlated Dynamics in Neural Training Rotem Turjeman Tom Berkov I. Cohen Guy Gilboa 70 3 0 18 Dec 2022
Bayesian posterior approximation with stochastic ensembles Oleksandr Balabanov Bernhard Mehlig Hampus Linander BDL UQCV 120 5 0 15 Dec 2022
Generative Robust Classification Xuwang Yin TPM 53 0 0 14 Dec 2022
Efficient Bayesian Uncertainty Estimation for nnU-Net Yidong Zhao Changchun Yang Artur M. Schweidtmann Qian Tao UQCV BDL 62 22 0 12 Dec 2022
Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging Peng Lu I. Kobyzev Mehdi Rezagholizadeh Ahmad Rashid A. Ghodsi Philippe Langlais MoMe 100 11 0 12 Dec 2022
A New Linear Scaling Rule for Private Adaptive Hyperparameter Optimization Ashwinee Panda Xinyu Tang Saeed Mahloujifar Vikash Sehwag Prateek Mittal 126 12 0 08 Dec 2022